• HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
Wednesday, August 27, 2025
BIOENGINEER.ORG
No Result
View All Result
  • Login
  • HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
  • HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
No Result
View All Result
Bioengineer.org
No Result
View All Result
Home NEWS Science News Chemistry

Precious2GPT: Multiomics transformer and conditional diffusion for generation of multi-omics multi-species multi-tissue synthetic biological data

by
August 13, 2024
in Chemistry
Reading Time: 4 mins read
0
Share on FacebookShare on TwitterShare on LinkedinShare on RedditShare on Telegram

 

Schemetic representation of the Precious2GPT structure

Credit: Insilico Medicine

  • PreciousGPT series are pioneering architecture designed to understand the biological mechanisms and the aging process for life from birth to death
  • Precious2GPT diffusion-transformer architecture was published in Nature npj Aging
  • Precious2GPT integrates pretrained transformers with conditional diffusion models for generating multi-omics, multi-species, and multi-tissue data for drug discovery and aging research
  • Precious3GPT is in the process of community validation open source, and can be accessed on discord

 

Scientists at Insilico Medicine have introduced Precious2GPT, an innovative multimodal architecture that integrates the pretrained transformer and conditional diffusion for generating and predicting multi-omics, multi-species, and multi-tissue samples data. Published in the Nature npj aging, this pioneering study showcases Precious2GPT’s capability  to provide high-quality biological data that mimics the real world conditions to support biological mechanisms and the aging process researches, enhancing the understanding of fundamental life biology from birth to death.

Synthetic data generation in omics is a vital tool for training and evaluating genomic analysis tools, controlling differential expression, and exploring data architecture. Traditional methods often fall short due to the complexity and variability inherent in biological data. Precious2GPT addresses these challenges by integrating Conditional Diffusion (CDiffusion) and decoder-only Multi-omics Pretrained Transformer (MoPT) models, trained on gene expression and DNA methylation data. This novel approach not only outperforms existing models like Conditional Generative Adversarial Networks (CGANs) but also excels in generating representative synthetic data that captures tissue- and age-specific information.

The AI work was performed by Insilico’s teams under Insilico Medicine Canada in Montreal and  Insilico Medicine Middle East in Abu Dhabi and validation of the synthetic data generation and other capabilities of the model was performed by multiple teams around the world. 

“Precious2GPT represents a major advancement in synthetic data generation for multi-omics research,” says Frank Pun, PhD, co-author of the study. “The model generates accurate omics data, offering great potential for advancing our understanding of complex biological phenomena and developing new therapeutic strategies.”

The research team at Insilico employed a hybrid approach to construct Precious2GPT. The process began with the CDiffusion model generating an initial dataset that simulates gene expression levels based on a gene expression network. This network ensures biologically plausible gene expression patterns by incorporating dependencies between genes. The MoPT model then evaluates the quality of each gene’s generation, calculating a quality score that reflects the similarity between the synthetic data and real-world profiles. By combining these models using Feature Weighted Linear Stacking (FWLS), the team achieved a balanced and high-quality synthetic data generation.

The validation study results are promising. Precious2GPT demonstrated superior performance in age prediction accuracy using the generated data, even generating data beyond 120 years of age. This capability is particularly valuable for aging research, where longitudinal biological data is often scarce. Additionally, the model’s ability to generate tissue-specific data was validated through UMAP dimensionality reduction, showing high concordance with real labels.

In a colorectal cancer case study, Precious2GPT showcased its potential in identifying gene signatures and therapeutic targets. By generating control samples for colorectal cancer cell lines, the model enabled a meta-analysis that revealed significant gene expression signatures, closely aligning with known colorectal cancer pathology. This highlights the model’s utility in bioinformatic analyses and target discovery.

Insilico has been at the forefront of both generative AI and aging research, and began publishing studies on biomarkers of aging using advanced bioinformatics in 2014. Later, the company trained deep neural networks (DNNs) on human “multi-omics” longitudinal data and retrained them on diseases to develop its end-to-end Pharma.AI platform for target discovery, drug design, and clinical trial prediction. 

The concept of multimodal transformers for aging research was first proposed by Alex Zhavoronkov, founder and CEO of Insilico Medicine during the Gordon Research Conference (GRC) on Systems Aging in May, 2022. Subsequently, in order to explore the potential of multi modal transformers and diffusion models in learning longitudinal multi-Omics and development of the body world models, Insilico started working on the PreciousGPT series. Prior to Precious2GPT, Insilico released Precious1GPT in June 2023, a dual-transformer model using methylation and transcriptomic data for aging biomarker development and target discovery. 

“We are combining transformer and diffusion models and using other machine learning techniques to build models that understand fundamental biological changes in time and at the same time, understand how to affect this biology using different small molecule approaches, biologics, food and many other modifications that modulate the different biological pathways at different levels of organization. ”says Alex Zhavoronkov, PhD, founder and CEO of Insilico Medicine and corresponding author of the study.  “We open-source the PreciousGPT series and expect to unite researchers around the world to work in peace to extend healthy, productive and sustainable life for everyone on the planet.”

The implications of Precious2GPT extend beyond aging research. The model’s ability to generate synthetic data with high accuracy and specificity opens new avenues for studying various biological processes and diseases. Insilico scientists plan to further expand the application of Precious2GPT to other bioinformatics tasks including survival analysis, cross-modality prediction, and disease-specific omics generation. 

 

About Insilico Medicine

Insilico Medicine, a clinical stage end-to-end generative artificial intelligence (AI)-driven drug discovery company, is connecting biology, chemistry, and clinical trials analysis using next-generation AI systems. The company has developed AI platforms that utilize deep generative models, reinforcement learning, transformers, and other modern machine learning techniques for novel target discovery and the generation of novel molecular structures with desired properties. Insilico Medicine is developing breakthrough solutions to discover and develop innovative drugs for cancer, fibrosis, immunity, central nervous system diseases, infectious diseases, autoimmune diseases, and aging-related diseases. 

Website: www.insilico.com



Share12Tweet8Share2ShareShareShare2

Related Posts

Wayne State Study Advances Quality of Life for Individuals with Type 1 Diabetes

Wayne State Study Advances Quality of Life for Individuals with Type 1 Diabetes

August 27, 2025
Wayne State Researchers Pioneer Advances to Enhance Quality of Life for Individuals with Type 1 Diabetes

Wayne State Researchers Pioneer Advances to Enhance Quality of Life for Individuals with Type 1 Diabetes

August 27, 2025

Electrostatic Map Reveals Non-Covalent Metal–Organic Frameworks

August 27, 2025

Widespread Metal, Extraordinary Potential Unveiled

August 27, 2025

POPULAR NEWS

  • blank

    Breakthrough in Computer Hardware Advances Solves Complex Optimization Challenges

    149 shares
    Share 60 Tweet 37
  • Molecules in Focus: Capturing the Timeless Dance of Particles

    142 shares
    Share 57 Tweet 36
  • New Drug Formulation Transforms Intravenous Treatments into Rapid Injections

    115 shares
    Share 46 Tweet 29
  • Neuropsychiatric Risks Linked to COVID-19 Revealed

    82 shares
    Share 33 Tweet 21

About

We bring you the latest biotechnology news from best research centers and universities around the world. Check our website.

Follow us

Recent News

Impact of Low Blood Pressure Dipping on Pediatric CKD

Optimal Flow Rate for Thoracoabdominal Aortic Surgery

High SERPINE2 Levels Signal Kidney Issues in Diabetes

  • Contact Us

Bioengineer.org © Copyright 2023 All Rights Reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Homepages
    • Home Page 1
    • Home Page 2
  • News
  • National
  • Business
  • Health
  • Lifestyle
  • Science

Bioengineer.org © Copyright 2023 All Rights Reserved.