In the rapidly evolving landscape of genomics, the ability to decipher vast datasets from genome-wide association studies (GWAS) remains both a critical priority and a formidable challenge. Despite the surge in GWAS data generation, integrating complex analytical approaches such as Mendelian randomization (MR), polygenic risk score (PRS) calculations, and functional enrichment analyses like Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) has often posed significant technical barriers. A breakthrough in this domain now emerges through the development of MPGK, a sophisticated yet user-friendly command-line tool designed to unify these analyses within a streamlined and reproducible pipeline.
The program MPGK is the brainchild of Professor Yong Cui and his research team at China-Japan Friendship Hospital’s Department of Dermatology. Published in the Chinese Medical Journal in March 2026, this innovative software bridges the divide between powerful bioinformatics methods and accessibility for researchers at all levels. By consolidating MR, PRS, GO, and KEGG analyses into a repeatable workflow, MPGK alleviates the burden of managing multiple tools and scripting languages, thereby accelerating genomic data interpretation.
At its core, MPGK harnesses the renowned TwoSampleMR package to conduct MR analyses, automatically performing essential validations such as heterogeneity testing and pleiotropy assessment. This functionality allows researchers to infer causal relationships between genetic variants and phenotypes with improved accuracy and less manual effort. Simultaneously, its PRS module leverages PLINK and PRSice for the robust processing of genotype data alongside the computation of individualized polygenic risk scores. Notably, this module delivers detailed numerical outputs complemented by probability distribution plots, greatly enhancing interpretability.
The operational architecture of MPGK extends to seamlessly incorporate GO and KEGG pathway enrichment analyses through the clusterProfiler package, generating publication-ready visualizations including dot plots, bar plots, directed acyclic graphs, and network diagrams. This comprehensive integration enables the elucidation of biological functions and pathways implicated in GWAS findings without requiring researchers to manually handle diverse software environments or individual visualization tools.
What sets MPGK apart is its flexible command-line interface, featuring flag-based commands that facilitate either isolated execution of individual modules or the orchestration of entire workflows in one automated run. This design not only ensures reproducible research practices but also optimizes computational efficiency across Linux and macOS platforms. Importantly, the tool is engineered to run effectively on standard computing resources, democratizing access to advanced genomic analyses for smaller labs and clinical investigators who may not have access to high-performance computing clusters.
Demonstrating its practical utility, the MPGK team applied the software to both publicly available GWAS datasets for diabetes and psoriasis, as well as proprietary institutional sequencing data. Through MR analysis, MPGK robustly identified a significant causal linkage between diabetes and psoriasis, validating previous epidemiological and genetic studies with an inverse variance weighted P-value of 3.75 × 10⁻²⁵. The PRS functionality further enabled the computation and visualization of individual-level polygenic risk scores, revealing characteristic distribution patterns among case and control groups.
Functional enrichment analyses conducted with MPGK shed light on the molecular underpinnings of psoriasis by identifying notable immune pathways. Among these were pathways involved in antigen processing and presentation, Epstein–Barr virus infection, and T helper cell differentiation, all of which are well-documented contributors to psoriasis pathogenesis. These findings underscore MPGK’s capacity to translate GWAS results into meaningful biological narratives, reinforcing its value for both hypothesis generation and validation.
Crucially, MPGK’s emphasis on accessibility and reproducibility addresses a longstanding issue in bioinformatics workflows—the tendency toward fragmented, non-standardized approaches that hinder collaboration and result in duplicative manual efforts. By unifying multiple complex analyses within a single framework and standardizing input-output processes, MPGK reduces potential errors and facilitates consistent, transparent research outputs, advancing the field toward more open and reproducible genomic science.
Looking ahead, the creators of MPGK acknowledge its current focus primarily on genomics but express a clear vision for expansion into multi-omics data integration and advanced analytical techniques. Future iterations are poised to incorporate machine learning frameworks, potentially leveraging Python-based libraries to enhance modeling capabilities and interpretability, thereby positioning MPGK at the forefront of precision medicine analytics.
As genomic research accelerates, tools like MPGK represent a vital evolution by making powerful, multifaceted analyses more approachable to a wider range of scientists. By simplifying complex bioinformatics processes, MPGK empowers researchers—from novices to experts—to unlock actionable insights from expansive genetic data, accelerating discoveries that could transform understanding and treatment of complex diseases.
In this era of data deluge, the democratization of bioinformatics through integrative platforms like MPGK becomes imperative. Its blend of advanced analytics, automation, and user-centered design marks a significant stride toward bridging the gap between data availability and biomedical insight. Such innovations hold promise not only for advancing research but also for facilitating clinical translation that ultimately benefits patient care worldwide.
Subject of Research: Not applicable
Article Title: MPGK: A user-friendly tool for MR, PRS, GO, and KEGG analysis
News Publication Date: 9-Mar-2026
Web References: https://journals.lww.com/cmj/fulltext/9900/mpgk__a_user_friendly_tool_for_mr,_prs,_go,_and.1951.aspx
References: DOI: 10.1097/CM9.0000000000003980
Image Credits: Professor Yong Cui from China-Japan Friendship Hospital, China
Keywords: Mendelian randomization, polygenic risk score, Gene Ontology, KEGG, bioinformatics, GWAS, functional enrichment, genetic epidemiology, pathway analysis, reproducible research, computational genomics
Tags: advanced post-GWAS analysis pipelineChina-Japan Friendship Hospital genomics researchfunctional enrichment analysis integrationGene Ontology and KEGG analysisgenomic data analysis for beginnersMendelian randomization automationMPGK bioinformatics toolpolygenic risk score calculation softwarereproducible genomics workflowsstreamlining complex genetic dataTwoSampleMR package applicationuser-friendly GWAS data interpretation



