• HOME
  • NEWS
    • BIOENGINEERING
    • SCIENCE NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • FORUM
    • INSTAGRAM
    • TWITTER
  • CONTACT US
Wednesday, May 25, 2022
BIOENGINEER.ORG
No Result
View All Result
  • Login
  • HOME
  • NEWS
    • BIOENGINEERING
    • SCIENCE NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • FORUM
    • INSTAGRAM
    • TWITTER
  • CONTACT US
  • HOME
  • NEWS
    • BIOENGINEERING
    • SCIENCE NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • FORUM
    • INSTAGRAM
    • TWITTER
  • CONTACT US
No Result
View All Result
Bioengineer.org
No Result
View All Result
Home NEWS Science News

New Machine Learning maps the potentials of proteins

Bioengineer by Bioengineer
May 4, 2022
in Science News
0
Share on FacebookShare on TwitterShare on LinkedinShare on RedditShare on Telegram

The biotech industry is constantly searching for the perfect mutation, where properties from different proteins are synthetically combined to achieve a desired effect. It may be necessary to develop new medicaments or enzymes that prolong the shelf-life of yogurt, break down plastics in the wild, or make washing powder effective at low water temperature.

protein_geodesic_map

Credit: Credit: W. Boomsma, N. S. Detlefsen, S. Hauberg.

The biotech industry is constantly searching for the perfect mutation, where properties from different proteins are synthetically combined to achieve a desired effect. It may be necessary to develop new medicaments or enzymes that prolong the shelf-life of yogurt, break down plastics in the wild, or make washing powder effective at low water temperature.

New research from DTU Compute and the Department of Computer Science at the University of Copenhagen (DIKU) can in the long term help the industry to accelerate the process. In the journal Nature Communications, the researchers explain how a new way of using Machine Learning (ML) draws a map of proteins, that makes it possible to appoint a candidate list of the proteins that you need to examine more closely.

In recent years, we have started to use Machine Learning to form a picture of permitted mutations in proteins. The problem is, however, that you get different images depending on what method you use, and even if you train the same model several times, it can provide different answers about how the biology is related.

“In our work, we are looking at how to make this process more robust, and we are showing that you can extract significantly more biological information than you have previously been able to. This is an important step forward in order to be able to explore the mutation landscape in the hunt for proteins with special properties,” says Postdoc Nicki Skafte Detlefsen from the Cognitive Systems section at DTU Compute.

The map of the proteins
A protein is a chain of amino acids, and a mutation occurs when just one of these amino acids in the chain is replaced with another. As there are 20 natural amino acids, this means that the number of mutations increases so quickly that it is completely impossible to study them all. There are more possible mutations than there are atoms in the universe, even if you look at simple proteins. It is not possible to test everything in an experimental manner, so you must be selective about which proteins you want to try to produce synthetically.

The researchers from DIKU and DTU Compute have used their ML model to generate a picture of how the proteins are linked. By presenting the model for many examples of protein sequences, it learns to draw a card with a dot for each protein so that closely related proteins are placed close to each other while distantly related proteins are placed far from each other.

The ML model is based on mathematics and geometry developed to draw maps. Imagine that you must make a map of the globe. If you zoom in on Denmark, you can easily draw a map on a piece of paper that preserves the geography. But if you must draw the earth, mistakes will occur because you stretch the globe, so that the Arctic becomes a long country instead of a pole. So, on the map, the earth is distorted. For this reason, research in map-making has developed a lot of mathematics that describe the distortions and compensate for the distortions on the map.

This is exactly the theory that DIKU and DTU Compute have been able to expand to cover their Machine Learning model (deep learning) for proteins. Because they have mastered the distortion on the map, they can also compensate for it.

“It enables us to talk about what a sensible distance target is between proteins that are closely related, and then we can suddenly measure it. In this way, we can draw a path through the map of the proteins that tells us which way we expect a protein to develop from to another – i.e. mutated, since they are all related to evolution. In this way, the ML model can measure a distance between the proteins and draw optimal paths between promising proteins,” says Wouter Boomsma, Associate Professor in the section for Machine Learning at DIKU.

The researchers have tested the model on data from numerous proteins that are found in nature, where their structure is known, and they can see that the distance between proteins starts to correspond to the evolutionary development of the proteins, so that proteins that are close to each other evolutionally are placed close to each other.

“We are now able to put two proteins on the map and draw the curve between them. On the path between the two proteins are possible proteins, which have closely related properties. This is no guarantee, but it provides an opportunity to have a hypothesis about which proteins it could be that the biotech industry ought to test when new proteins are designed,” says Søren Hauberg, professor in the Cognitive Systems section at DTU Compute.

The unique collaboration between DTU Compute and DIKU was established through a new centre for Machine Learning in Life Sciences (MLLS), which started last year with the support of the Novo Nordisk Foundation. In the centre, researchers in artificial intelligence from both universities are working together to solve the fundamental problems in Machine Learning driven by important issues within the field of biology.

The developed protein maps are part of a large-scale project that spans from basic research to industrial applications, e.g. in collaboration with Novozymes and Novo Nordisk.

FACT BOX: Artificial intelligence, machine learning and deep learning

When computer programs are able to do something ‘smart’, it is called artificial intelligence – or just AI. Artificial intelligence is thus a unified concept that covers several methods.
One of the methods is Machine Learning, and the latest and most advanced use of Machine Learning is called Deep Learning.

Deep Learning is based on neural networks, which is a mathematical model, where the model itself from a given dataset and without direct programming can learn to find patterns in data. Because you use data, it is called a data-driven model.

In unsupervised learning, the goal is to train a neural network to discover the underlying patterns in the data. This is typically done by attempting to compress data, because it thereby rejects the trends in data that is least frequent, while the most important data takes up more information, so you can see the underlying patterns.

By means of many repetitions, the network learns which patterns in data that can be used to compress data.

Once the model has been trained, it is tested on unknown data, which then also can be compressed into a compact representation that can be interpreted to form scientific hypotheses or form the foundation for other Machine Learning models.



Journal

Nature Communications

DOI

10.1038/s41467-022-29443-w

Article Title

Learning meaningful representations of protein sequences

Article Publication Date

8-Apr-2022

COI Statement

The authors declare no competing interests.

Share12Tweet8Share2ShareShareShare2

Related Posts

Researchers discover the mechanism responsible for information transfer between different regions of the brain

Researchers discover the mechanism responsible for information transfer between different regions of the brain

May 25, 2022
2022 Microsoft Imagine Cup

Microsoft Imagine Cup: Jacobs University students win World Championship

May 25, 2022

Why COVID vaccines are deemed non-essential for UK young children

May 25, 2022

The Cinderella Project: The right to see yourself in the mirror and like what you see

May 25, 2022

POPULAR NEWS

  • Masks

    Hidden benefit: Facemasks may reduce severity of COVID-19 and pressure on health systems, researchers find

    44 shares
    Share 18 Tweet 11
  • Breakthrough in estimating fossil fuel CO2 emissions

    46 shares
    Share 18 Tweet 12
  • Discovery of the one-way superconductor, thought to be impossible

    43 shares
    Share 17 Tweet 11
  • Sweet discovery could drive down inflammation, cancers and viruses

    43 shares
    Share 17 Tweet 11

About

We bring you the latest biotechnology news from best research centers and universities around the world. Check our website.

Follow us

Tags

VaccineVehiclesWeather/StormsUniversity of WashingtonUrogenital SystemZoology/Veterinary ScienceVirologyWeaponryVirusVaccinesViolence/CriminalsUrbanization

Recent Posts

  • Researchers discover the mechanism responsible for information transfer between different regions of the brain
  • Microsoft Imagine Cup: Jacobs University students win World Championship
  • Why COVID vaccines are deemed non-essential for UK young children
  • The Cinderella Project: The right to see yourself in the mirror and like what you see
  • Contact Us

© 2019 Bioengineer.org - Biotechnology news by Science Magazine - Scienmag.

No Result
View All Result
  • Homepages
    • Home Page 1
    • Home Page 2
  • News
  • National
  • Business
  • Health
  • Lifestyle
  • Science

© 2019 Bioengineer.org - Biotechnology news by Science Magazine - Scienmag.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Posting....