In a groundbreaking advance that reshapes our understanding of one of humanity’s oldest cultivated plants, researchers at the Salk Institute have unveiled the most comprehensive genetic atlas of Cannabis sativa to date. This extensive genomic study, published in Nature on May 28, 2025, maps nearly 200 cannabis genomes, peeling back layers of complexity and diversity previously hidden within this economically and medicinally vital species. The research reveals not only the staggering genetic variability within cannabis but also provides insight into its evolutionary history, sex chromosome structure, and the genetic architecture underlying cannabinoid production. This pangenome effort promises to energize future cannabis breeding and biotechnological applications like never before.
Cannabis, often stereotyped merely as a source of psychoactive substances, has in fact been a cornerstone of human agriculture and culture for over ten millennia. Historically valued for its robust seed oil, durable fibers, and nutritional content, this plant also produces uniquely complex chemical compounds, notably cannabinoids and terpenes. Despite its global importance, the genetic intricacies of cannabis have remained elusive. Many of these challenges stem from legal restrictions that have hindered scientific progress for decades, coupled with the plant’s biologically intricate genome. These include its dioecious nature—meaning plants are distinctly male or female—and the overwhelming presence of repetitive DNA sequences known as transposable elements, which complicate genome assembly.
This research team, leveraging long-read sequencing technologies capable of spanning thousands of base pairs at a time, overcame these obstacles by conducting the most extensive sequencing project on cannabis genomes ever attempted. This method surpasses traditional short-read sequencing, which fragments DNA into tiny pieces, making it difficult to reconstruct complex genomic regions accurately. The application of long-read sequencing at scale allowed the researchers not only to sequence with unprecedented resolution but also to achieve haplotype resolution. By fully resolving both chromosome sets inherited from the maternal and paternal plants, rather than just one, the team uncovered extraordinary genetic variation—potentially up to twenty times greater than that found in humans.
.adsslot_07BwnKRdps{ width:728px !important; height:90px !important; }
@media (max-width:1199px) { .adsslot_07BwnKRdps{ width:468px !important; height:60px !important; } }
@media (max-width:767px) { .adsslot_07BwnKRdps{ width:320px !important; height:50px !important; } }
ADVERTISEMENT
A key revelation from this comprehensive dataset was the identification of structural genomic variations and gene arrangements that elucidate the genetic basis for cannabinoid biosynthesis. While the genes directly responsible for synthesizing THC and CBD—the two primary cannabinoids—were largely conserved across the cannabis pangenome, other genes involved in fatty acid metabolism, growth regulation, and environmental defense exhibited notable variability. This suggests a reservoir of untapped genetic diversity that breeding programs could exploit to enhance crop resilience or improve the nutritional profile of hemp seed oil. Particularly intriguing was the discovery that structural variation contributes to the biosynthesis of tetrahydrocannabivarin (THCV), a rare cannabinoid garnering interest for its non-psychoactive, stimulating effects.
Furthermore, the study sheds new light on the elusive cannabis sex chromosomes, offering the first detailed view of the Y chromosome—exclusive to male plants. Prior to this, breeding largely favored feminized plants, which are genetically female and produce the desired cannabinoid-rich flowers, often induced through hormonal treatments to bypass male genetics. The revelation that male-specific genetic material encodes valuable traits hints that traditional breeding may have overlooked genetically encoded advantages intrinsic to males. Incorporating true male genomes in breeding practices could unlock novel agronomic traits, improving both yield and quality.
The study also highlighted the dynamic nature of transposable elements—mobile DNA sequences that can ‘jump’ within the genome—as carriers of cannabinoid synthase genes. Such genomic mobility likely accelerated the diversity of cannabinoid profiles observed among cannabis cultivars today, driven by an underground breeding revolution during decades of prohibition. This dispersal of cannabinoid genes across repetitive sequences underscores cannabis’s capacity for rapid chemical innovation, a trait ripe for exploitation with the newfound genomic insight.
Strategically, the research suggests a rich vein of unexplored genetic material exists in undiscovered wild cannabis relatives, especially in Asia. Comparative analysis of European and Asian genomes implies the presence of ancestral cannabis varieties with unique adaptations to their environmental niches. These wild relatives may harbor traits for stress tolerance and disease resistance, offering breeding avenues for climate-resilient cannabis crops. Such genetic introgressions could significantly expand the agronomic versatility of cannabis.
The implications of this comprehensive pangenomic resource extend beyond mere academic curiosity. From a practical perspective, it equips breeders and biotechnologists with high-resolution maps to guide selection of desirable traits—ranging from fiber strength and seed oil quality to therapeutic compound production. The research team envisions this dataset accelerating improvements in medicinal cannabis and industrial hemp alike, catalyzing advances in agriculture, pharmaceuticals, and biotechnology. The study’s collaborative nature, involving institutes such as Oregon State University, Oregon CBD, and the HudsonAlpha Institute, exemplifies the broad interdisciplinary effort required for such a milestone.
Plant scientist Todd Michael, the senior author and a prominent figure in cannabis genomics, contextualizes this research as a turning point. Highlighting how legal constraints inadvertently induced clandestine breeding innovations, he emphasizes that modern genetic tools, exemplified by this study, now enable systematic, evidence-based improvement of cannabis. Unlocking cannabis’s full genetic potential stands to disrupt markets for fiber, seed oil, and medicine, potentially rivaling staple crops in economic impact.
Equally important is the study’s role in clarifying cannabis’s evolutionary origins and domestication patterns via a wild mosaic genetic landscape. Unlike previous fragmented efforts, this pangenome approach paints a holistic picture of cannabis diversity, marking an evolution beyond focusing on single reference genomes. This change in perspective allows scientists to appreciate the species as a genomic ecosystem shaped by millennia of natural and artificial selection.
Another remarkable outcome is the elucidation of selective pressures acting on cannabinoid synthase genes THCAS and CBDAS. These genes, essential to the biosynthesis of psychoactive and therapeutic cannabinoids respectively, display signatures of human-driven adaptation, reflecting breeding preferences for cannabinoid content. Understanding these patterns equips breeders with genetic markers to enhance or modulate cannabinoid profiles more efficiently, opening doors to precision breeding and synthetic biology applications.
The cannabis pangenome informs potential future directions beyond human health and agriculture. Possibilities include bioengineering cannabis-derived compounds as sustainable alternatives to conventional industrial materials and fuels. Given cannabis’s capacity to produce diverse metabolites in significant quantities, selective breeding informed by this genomic resource could optimize biochemical pathways for industrial biosynthesis, contributing to green chemistry innovations.
In conclusion, this landmark study does not merely catalog cannabis’s genetic repertoire; it redefines the possibilities for harnessing its biochemical and agronomic potential. By delivering an unprecedentedly detailed genetic blueprint, the Salk Institute and collaborators empower a new era of cannabis science. This resource heralds transformative innovation across multiple sectors, including medicine, sustainable agriculture, and biotechnology, evidencing how genetic insight can unlock the latent power of an ancient yet modern crop.
Subject of Research: Comprehensive genomic analysis and pangenome construction of Cannabis sativa, revealing genetic diversity, sex chromosome structure, and cannabinoid biosynthesis genetics.
Article Title: Domesticated cannabinoid synthases amid a wild mosaic cannabis pangenome
News Publication Date: 28-May-2025
Web References:
https://www.nature.com/articles/s41586-025-09065-0
http://dx.doi.org/10.1038/s41586-025-09065-0
References:
Michael, T. et al. Domesticated cannabinoid synthases amid a wild mosaic cannabis pangenome. Nature (2025).
Image Credits: Salk Institute
Keywords: Life sciences, Plant sciences, Plant genetics, Plant evolution, Plant gene expression, Plant genes, Plant genomes, Plants, Cannabis, Crop domestication, Plant breeding
Tags: Biotechnological advancements in cannabisCannabinoid production geneticsCannabis breeding innovationsCannabis cultivation historyCannabis genetics researchCannabis pangenome studyEvolutionary history of cannabisGenetic diversity in Cannabis sativaIndustrial cannabis usesMedicinal cannabis applicationsSalk Institute cannabis studyTerpenes and cannabinoids