LoRa adaptors

Vibrant green numbers on a computer screen, showcasing binary code and data streams.

LoRa in the IOT community are the low power long range wireless standards devices that are utilized to send a signal over large distances. However, in the AI field LoRa implies Low rank adaptation – which is relatively more efficient compute way to fine tune pretrained models (LLM, vision transformers, diffusion models).

With a pretrained model is general and to improve its performance on a particular task a process called fine-tuning needs to be performed. Now, it is easier, to fine tune a model for a task rather than develop a brand new model from scratch. Not only easier but also computationally less intense especially since restart with all the data and resources are not required to make a new model for a new task.

Large models by definition are large and fine tuning on the new dataset is computation expensive, needs data and is harder to share. To accommodate fine tuning other methods have been tried – such as bottleneck adaptors, prefix/prompt tuning which involve modifying all model weights which take resources and can lead to issues like forgetting wherein the model loses the capability that existed before in the model.

LoRa (low rank adaptation) focuses on a select set of parameters to modify. This is called low-rank decomposition to adapt the large model but freeze the pretrained model weights intact. Low rank matrices called matrix A and matrix B. Matrix A is for the changes that are needed and matrix B helps in projecting matrix A changes back to the original parameter space. Thus, knowledge in the core model is unchanged but now the new model can adapt to new tasks.

These matrices are placed in attention layers in transformers or in cross-attention and self attention in diffusion models.

The workflow is simple – freeze the base model → inject LoRa modules into specific layer, train LoRa parameters on yoru dataset. Then you deploy either as merged with base model or as separate modules for multi-task adaptability.

As example, it can be used for modifying a LLM for domain specific knowledge, transfer a new style in image generation.

Similar Posts

  • DeepSpot

    Kalin Klonchev – the winner of a competition for AI based data analysis from Broad in 2024 had also created a tool called DeepSpot. Worth looking at for spot analysis of H&E sections by converting a full H&E slide pictures to “spots” which are analyzed. Some good links: DeepSpot paper: https://www.medrxiv.org/content/10.1101/2025.02.09.25321567v1 DeepSpot GitHub repository: https://github.com/ratschlab/DeepSpot…

  • |

    Reinforcement learning

    Reinforcement learning is a method that drives learning and memory in primitive species such as birds, humans and other living species to its use in machine learning. It is used to influence the behavior of us humans on social media to its use to train machines. The essential components were initiated by BF Skinner 20th…

  • AI job search

    With AI enabling many activities in the job hunt process, it is expected that many job applicants and executive at hiring companies use AI based tools for the process. The “automated” process used to be enabled by keywords which was the way that the candidates were selected from a large pool but with the availability…

  • Tools for critique of art – open source

    Good publication to review material: CognArtive: Large Language Models for Automating Art Analysis and Decoding Aesthetic Elements at https://arxiv.org/html/2502.04353v1 And the wonderful visualization to analyze the data over the years that was derived from WikiArt. https://cognartive.github.io Autocritic: an open-source system that uses classic art theory to evaluate images and steer generative systems. It distills historical…

  • AI Automations

    The AI automations have only increased. There is one interesting one that has been receiving publicity. Check it out: https://knowledgework.ai It takes notes while the person is working and becomes the second brain. Privacy and access may be of concern but capability is available with AI tools.

  • | | |

    Biotech companies

    Small Biotechs: Diagonal Tx: Clustering antibodies that mimic the action of the ligand and bypass the need for the ligand and receptor. This mutation that is created makes standard AI models not useful and so need a new method. This restores new ALK1 signaling in Hereditary Hemorrhagic Telangiectasia. It also treates LoF mutations in ALK1…