Basic models can be applied to a wide variety of downstream tasks after being trained on large and varied data sets. From textual questions answering to visual descriptions and gameplay, individual models can now achieve cutting-edge performance. Growing data sets, larger models, and improved model architectures have given rise to new possibilities for basic models.
Due to the complexity of medicine, the difficulty of collecting broad and diverse medical information, and the novelty of this discovery, these models have not yet infiltrated medical AI. Most medical AI models use a task-specific modeling technique. Images must be manually labeled to train a model to analyze chest X-rays for pneumonia. A human must write a radiological report when this algorithm detects pneumonia. This hyper-focused, tag-based methodology produces rigid models that can only perform the tasks on the training dataset. To adapt to new tasks or data distributions for the same goal, these models sometimes require retraining on a new data set.
Developments such as multimodal architectures, self-supervised learning techniques, and learning-in-context capabilities have made possible a new class of sophisticated basic medical models called GMAIs. Their “generalist” tag suggests they will replace more specialized models for specific medical tasks.
Researchers from Stanford University, Harvard University, the University of Toronto, Yale University School of Medicine, and the Scripps Translational Research Institute identify three essential qualities that distinguish GMAI models from traditional medical AI models. .
- A GMAI template can be easily adapted to a new task simply by declaring the job in English (or another language). Models can tackle novel challenges after they have been presented (dynamic task specification), but before requiring retraining.
- GMAI models can take data from various sources and generate output in various formats. GMAI models will explicitly reflect medical knowledge, allowing them to reason through new challenges and communicate their results in terms medical professionals understand. Compared to existing medical AI models, GMAI models have the potential to address a wider variety of tasks with fewer or no labels. Two of GMAI’s defining capabilities, supporting various combinations of data modalities and the ability to perform dynamically set tasks, allow GMAI models to interact with users in a variety of ways.
- GMAI models must explicitly represent medical domain knowledge and use it for sophisticated medical reasoning.
GMAI provides remarkable adaptability between jobs and situations by allowing users to interact with models through custom queries, making AI insights accessible to a broader range of consumers. To generate queries like “Explain the mass on this head MRI,” users can use a custom query. Is it more likely to be a tumor or an abscess?
Two crucial features, dynamic task specification and multimodal inputs and outputs will be possible through user-defined queries.
- Dynamic task specification: AI models can be retrained on the fly using custom queries to learn how to address new challenges. When asked, “Given this ultrasound, what is the thickness of the gallbladder wall in millimeters?” GMAI can provide an answer that has never been seen before. The GMAI can be trained in a new notion with just a few examples, thanks to learning in context.
- Multimodal inputs and outputs: Personalized consultations make possible the ability to arbitrarily combine modalities in complex medical problems. When requesting a diagnosis, a doctor can attach various photos and lab reports to your inquiry. If the client requests both a textual response and an accompanying display, a GMAI model can easily accommodate both requests.
Some of the use cases of GMAI are mentioned below:
- Credible Radiological Findings: GMAI paves the way for a new class of flexible digital radiological assistants that can support radiologists at any stage of their processes and significantly reduce their workload. Radiology reports that include both aberrant and relevant normal results and that take patient history into account can be automatically produced by GMAI models. When combined with text reports, the interactive visualizations of these models can be of great help to clinicians, for example, by highlighting the area specified by each sentence.
- Improved Surgical Methods: With a GMAI model, surgical teams are expected to perform treatments more easily. GMAI models can perform visualization tasks, such as annotating live video feeds of a trade. When surgeons discover unusual anatomical events, they can also convey verbal information by sounding alarms or reading aloud from pertinent literature.
- Help make tough decisions right at bedside. More detailed explanations and recommendations for future care are made possible by GMAI-enabled bedside clinical decision support tools, which are based on existing AI-based early warning systems.
- Make Proteins from Text: GMAI synthesized protein amino acid sequences and three-dimensional structures from text input. This model could be conditional on the production of protein sequences with desirable functional characteristics, such as those found in existing generative models.
- Collaborative note taking. The GMAI models will automatically draft documents such as electronic notes and discharge reports; doctors will only have to review, update and approve them.
- Medical chatbots. New patient support apps could work with GMAI, enabling high-quality care to be delivered even outside of clinical settings.
review the Paper and Reference article. Don’t forget to join our 19k+ ML SubReddit, discord channel, and electronic newsletter, where we share the latest AI research news, exciting AI projects, and more. If you have any questions about the article above or if we missed anything, feel free to email us at [email protected]
🚀 Check out 100 AI tools at AI Tools Club
Tanushree Shenwai is a consulting intern at MarktechPost. She is currently pursuing her B.Tech at the Indian Institute of Technology (IIT), Bhubaneswar. She is a data science enthusiast and has a strong interest in the scope of application of artificial intelligence in various fields. She is passionate about exploring new advances in technology and its real life application.