The latest iteration of artificial intelligence uses basic models. Such basic models or “generalist” models can be used for numerous downstream tasks without particular training rather than building AI models that address specific tasks one at a time. For example, the massive pretrained language models GPT-3 and GPT-4 have revolutionized the basic model of AI. LLM can use low or zero try learning to apply your knowledge to new tasks for which it has not yet been taught. Multitasking learning, which allows LLM to accidentally learn from implicit tasks in its training corpus, is partly to blame for this.
Although LLM has demonstrated proficiency in few-shot learning in several disciplines, including computer vision, robotics, and natural language processing, its generalization to problems that cannot be observed in more complex fields has yet to be fully examined. like biology. It is necessary to understand the parties involved and the underlying biological systems to infer unobserved biological reactions. Most of this information is found in the free text literature, which could be used to train LLMs, while structured databases only encapsulate a small amount. Researchers at the University of Texas, the University of Massachusetts Amherst, and the University of Texas Health Science Center believe that LLMs, which draw on prior knowledge from the unstructured literature, could be a creative approach to prediction challenges. biology where there is a lack of structured data. and small sample sizes.
A crucial problem in such a few shot biological prediction is the prediction of drug pair synergy in cancers that have not been well explored. Drug combinations in therapy are now common practice to manage difficult-to-treat conditions such as cancer, infectious infections, and neurological disorders. Combination therapy frequently offers therapeutic results superior to treatment with a single drug. Drug discovery and development research has increasingly focused on predicting the synergy of drug pairs. Drug pair synergy describes how the use of two drugs together has a greater therapeutic impact than the use of each separately. Due to the many potential combinations and the complexity of the underlying biological systems, it cannot be easy to predict the synergy of drug pairs.
Several computational techniques have been created to anticipate the synergy of drug pairs, particularly using machine learning. Large data sets of results from in vitro experiments for drug combinations can be used to train machine learning algorithms to find trends and forecast the probability of synergy for a new drug pair. A relatively small amount of experimental data is accessible for some tissues, such as bone and soft tissue. Rather, most of the data refer to common forms of cancer in selected tissues, such as breast and lung cancer. The volume of training data available for drug pair synergy prediction is limited by the expensive and physically demanding nature of obtaining cell lines from these tissues. Machine learning models dependent on large data sets may need help training.
Early investigations ignored the biological and cellular variations of these tissues and extrapolated the synergy score to cell lines from other tissues based on relational or contextual information. By using various large-dimensional data, such as genomic or chemical profiles, another line of research has attempted to reduce the disparity between tissues. Despite promising findings in some tissues, these techniques should be used in tissues with sufficient data to modify your model with the many parameters for those high-dimensional properties. They want to address the aforementioned problem that LLMs face in this paper. They state that the scientific literature still contains useful information on cancer types with sparse organized data and inconsistent characteristics.
It is not easy to manually collect forecast data on such biological things from the literature. Using past information from the scientific literature stored in LLM is its novel strategy. They created a model that turns prediction work into a natural language inference challenge and generates answers based on the knowledge embedded in LLMs, called the few-shot drug-pair synergy prediction model. Their experimental findings show that their LLM-based low-shot prediction model outperformed robust tabular prediction models in most scenarios and achieved considerable accuracy even in zero-shot configurations. Demonstrating high potential in “generalist” biomedical artificial intelligence, this extraordinary few-shot prediction performance in one of the most difficult biological prediction tasks has vital and timely relevance to a large biomedical community.
review the Paper. Don’t forget to join our 20k+ ML SubReddit, discord channel, and electronic newsletter, where we share the latest AI research news, exciting AI projects, and more. If you have any questions about the article above or if we missed anything, feel free to email us at [email protected]
🚀 Check out 100 AI tools at AI Tools Club
Aneesh Tickoo is a consulting intern at MarktechPost. She is currently pursuing her bachelor’s degree in Information Science and Artificial Intelligence at the Indian Institute of Technology (IIT), Bhilai. She spends most of her time working on projects aimed at harnessing the power of machine learning. Her research interest is image processing and she is passionate about creating solutions around her. She loves connecting with people and collaborating on interesting projects.