Researchers from Future House and Oxford created BioPlanner: an automated AI approach to assess and train the protocol planning capabilities of biology LLMs

Large language models (LLMs) generally face difficulties with multi-step problems and long-term planning, which is an important step in designing scientific experiments. Recent research presents a method, Bioplanner, that addresses the challenge of automating the generation of accurate protocols for scientific experiments. Researchers from Align to Innovate, the Francis Crick Institute, Future House and the University of Oxford introduced an automated assessment framework along with a dataset, BIOPROT1, which provides a solution to improve the planning capabilities of LLMs. BIOPROT1 is specifically focused on biology protocols. Researchers seek to expand the concept to other fields of science.

The generation of scientific protocols poses a significant challenge due to several reasons: variability in descriptions, sensitivity to small details, and the need for established metrics for evaluation. Traditional methods in biological research are time-consuming and carry risks of error. The BIOPROT1 dataset is presented, comprising biological protocols from Protocols.io, filtered and translated into pseudocode. The approach involves using a model that teaches LLMs to generate admissible actions and pseudocode for a protocol and evaluating the LLM's ability to reconstruct the pseudocode from a high-level description to enumerate admissible pseudocode functions.

Bioplanner uses GPT-4 to convert natural language protocols into pseudocode. First, it provides a structured representation that facilitates evaluation. The framework defines a set of protocol-specific pseudofunctions. This generates pseudocode and evaluates the performance of the model in reconstructing the pseudocode. Researchers explore multiple tasks, including next-step prediction, complete protocol generation, and feature recovery, using random input features and feedback loops for error detection. The BIOPROT1 dataset is verified and experiments show that pseudocode representations allow for more robust evaluation metrics. This successfully overcame the challenges associated with n-gram overlays and contextual embeddings.

Bioplanner addresses the critical problem of automating scientific experiment protocols by utilizing advanced language models. Evaluation of the method on the BIOPROT1 dataset shows the effectiveness of using pseudocode representations for more accurate and robust evaluation of LLMs. As expected, GPT-4 shows superior performance compared to GPT -3.5 on various tasks, indicating advances in long-term planning and multi-step problem solving. Real-world validation, where an LLM-generated protocol is successfully executed in a laboratory, underlines the practical utility of the proposed method.

Review the Paper. All credit for this research goes to the researchers of this project. Also, don't forget to follow us on Twitter. Join our 36k+ ML SubReddit, 41k+ Facebook community, Discord channeland LinkedIn Grabove.

If you like our work, you will love our Newsletter..

Don't forget to join our Telegram channel

Pragati Jhunjhunwala is a Consulting Intern at MarktechPost. She is currently pursuing B.tech from the Indian Institute of technology (IIT), Kharagpur. She is a technology enthusiast and has a keen interest in the scope of data science software and applications. She is always reading about the advancements in different fields of ai and ML.

<!– ai CONTENT END 2 –>

(Free ai Event) 'Meet SingleStore Pro Max, the Powerhouse Edition' (January 24, 2024, 10am PST)

Researchers from Future House and Oxford created BioPlanner: an automated AI approach to assess and train the protocol planning capabilities of biology LLMs

Technical Terrence Team

Navigating Single Stock Futures Amid Inflation Data

Leave a Reply Cancel reply

Recommended.

Redis changes license and acquires Speedb to go beyond its core in-memory database

Data is the basis of linguistic models | by Cameron R. Wolfe, Ph.D. | October 2023

47.36 million ETH staked in the Beacon ETH2 deposit contract

Daily Crunch: After raising $3M seeds, global fintech platform Payday plans to license Canada, UK

CoinEx Launches Automatic Investment Plan to Help Prudent and Beginning Investors Invest in Cryptocurrencies with Ease – Press Release Bitcoin News

Categories

Important Links

Researchers from Future House and Oxford created BioPlanner: an automated AI approach to assess and train the protocol planning capabilities of biology LLMs

Related

Technical Terrence Team

Navigating Single Stock Futures Amid Inflation Data

Leave a Reply Cancel reply

Recommended.

Redis changes license and acquires Speedb to go beyond its core in-memory database

Data is the basis of linguistic models | by Cameron R. Wolfe, Ph.D. | October 2023

47.36 million ETH staked in the Beacon ETH2 deposit contract

Daily Crunch: After raising $3M seeds, global fintech platform Payday plans to license Canada, UK

CoinEx Launches Automatic Investment Plan to Help Prudent and Beginning Investors Invest in Cryptocurrencies with Ease – Press Release Bitcoin News

Categories

Important Links

Get daily news updates to your inbox!