• HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
Saturday, July 26, 2025
BIOENGINEER.ORG
No Result
View All Result
  • Login
  • HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
  • HOME
  • NEWS
  • EXPLORE
    • CAREER
      • Companies
      • Jobs
        • Lecturer
        • PhD Studentship
        • Postdoc
        • Research Assistant
    • EVENTS
    • iGEM
      • News
      • Team
    • PHOTOS
    • VIDEO
    • WIKI
  • BLOG
  • COMMUNITY
    • FACEBOOK
    • INSTAGRAM
    • TWITTER
No Result
View All Result
Bioengineer.org
No Result
View All Result
Home NEWS Science News Technology

Ancient Texts Decoded by Neural Networks

Bioengineer by Bioengineer
July 26, 2025
in Technology
Reading Time: 5 mins read
0
ADVERTISEMENT
Share on FacebookShare on TwitterShare on LinkedinShare on RedditShare on Telegram

blank

In a groundbreaking fusion of ancient history and cutting-edge artificial intelligence, researchers have unveiled Aeneas, a sophisticated machine learning model designed to revolutionize the study of Latin inscriptions. This technological marvel aims to assist historians by restoring fragmented texts, dating undocumented carvings, and pinpointing their geographic origins within the vast expanse of the Roman Empire. The advent of Aeneas heralds a new era for digital humanities, where deep learning meets millennia-old stone, unearthing insights previously buried under the patina of time.

The impetus for Aeneas stems from recent strides in machine learning, particularly the proliferation of transformer architectures, which have excelled in natural language processing. Unlike conventional approaches that often rely on rigid rule-based systems or manual curation, Aeneas integrates both textual data and visual imagery, harnessing a multimodal framework that taps into the physicality of inscriptions. This allows the system to engage with nuances such as the carving style, material wear, and layout—factors that are crucial in epigraphic interpretation.

Central to Aeneas’s success is the Latin Epigraphic Dataset (LED), meticulously compiled from leading epigraphic repositories including the Epigraphic Database Roma (EDR), Epigraphic Database Heidelberg (EDH), and the Electronic Archive of Greek and Latin Epigraphy (EDCS_ETL). This formidable dataset encompasses over 16 million characters sourced from inscriptions spanning an extensive timeline, approximately 800 BCE to 800 CE, and covering diverse Roman provinces. The careful curation involved normalizing metadata, standardizing ancient provincial nomenclature, and retaining the original epigraphic conventions, such as editors’ restoration brackets and placeholders for missing characters, to preserve historical authenticity and challenge the model’s learning capacity.

.adsslot_yIuvk8Ddgq{width:728px !important;height:90px !important;}
@media(max-width:1199px){ .adsslot_yIuvk8Ddgq{width:468px !important;height:60px !important;}
}
@media(max-width:767px){ .adsslot_yIuvk8Ddgq{width:320px !important;height:50px !important;}
}

ADVERTISEMENT

Importantly, the dataset also includes a limited but representative selection of inscription images. These visual data underwent rigorous preprocessing, including the exclusion of non-photographic artefacts, the removal of blurry frames using Laplacian variance analysis, and conversion to greyscale, ensuring uniformity suitable for deep learning. However, only about 5% of textual inscriptions are accompanied by images, presenting limitations that highlight the scarcity of multimodal epigraphic data, a bottleneck that Aeneas’s creators aim to alleviate with future dataset expansions.

Aeneas is architected to undertake four principal epigraphic tasks: fixed-length text restoration, restoration of missing sequences with unknown length, geographical attribution, and chronological dating. To do this, it processes sequences up to 768 characters long, marked with placeholder symbols denoting absent or uncertain text portions, and pairs these with corresponding inscription images. The model’s transformer-based design enables it to capture both linguistic context and subtle visual cues, such as inscriptive style or stone type, thereby enhancing its predictive accuracy beyond traditional text-only analyses.

Notably, the system confronts the challenge of editorial circularity, a phenomenon in which previous human restorations—especially the expansion of abbreviations and speculative text restorations—might inadvertently bias model training. While expansions were removed from the dataset to avoid outright replication of editorial interpolations, conjectured restorations marked by square brackets were retained to maximize the quantity of learning material. Trials showed that excluding these conjectures diminished model performance, indicating that including carefully curbed editorial input strikes a balance between dataset richness and bias mitigation.

Beyond raw algorithmic prowess, Aeneas boasts a contextualization mechanism that retrieves historically and linguistically relevant parallel inscriptions, serving as a research assistant that extends beyond mere pattern matching. An extensive historian–AI evaluation involving 23 epigraphers validated this feature: participants found that Aeneas’s suggested parallels dramatically reduced the time taken to identify contextual analogues from days to mere minutes, facilitating more profound and nuanced historical interpretations. The experts reported significant increases in confidence—up to nearly 70%—when equipped with Aeneas’s parallels and predictive suggestions, underscoring its potential as a collaborative tool complementing expert judgment.

However, Aeneas is not without constraints. Its effectiveness correlates strongly with the data density of specific regions and periods. For example, the model excels in attributing inscriptions from central Roman provinces such as Roma and Africa Proconsularis due to abundant high-quality data. Conversely, regions like Sicily and Sardinia, where inscriptions are sparse and data coverage is patchy, exhibit notably decreased attribution accuracy. Furthermore, the model’s ability to date inscriptions peaks around 200 CE, aligning with a historically prolific phase of Latin epigraphy and rich dataset representation, while earlier periods with scarcer and less precisely dated texts yield more ambiguous results.

The model’s multimodal approach also ushers in novel investigative avenues beyond textual analysis. Image saliency maps reveal that Aeneas leverages architectural and iconographic elements, such as altar shapes or carving techniques, to substantiate its geographical predictions. This capacity to incorporate visual context mimics expert epigraphers’ archaeological acumen, enabling the model to identify subtler markers of provenance that escape pure linguistic scrutiny. Such integrations hint at the transformative potential of combining AI with traditional epigraphic methodologies.

One illustrative case study involves an inscribed limestone altar dedicated to the Deae Aufaniae in Germania superior, dating to 211 CE. Aeneas accurately restores missing text fragments, assigns precise chronological dates within tight brackets, and situates the inscription geographically with remarkable precision. Moreover, through its parallel retrieval function, the model identifies related altars and formulaic expressions spanning contiguous Roman provinces, emulating and extending human scholarly insight. This exemplifies how Aeneas can uncover epigraphic networks and shared cultural practices previously obscured by fragmented or dispersed evidence.

Beyond research, Aeneas is paving the way for educational innovation. Collaborations with educational institutions in Belgium have produced curricula integrating Aeneas as a pedagogical tool, enabling high school and university students to engage directly with ancient Latin inscriptions mediated by AI. This approach cultivates digital literacy alongside classical scholarship, aligned with contemporary frameworks such as the European Digital Competence Framework and UNESCO’s AI competency standards, fostering critical engagement with both AI technologies and historical sources.

Looking ahead, the team behind Aeneas envisions enriching its capabilities via integration with large-scale dialogue models. Such enhancements could facilitate interactive research workflows, allowing historians to query the system dynamically and receive transparent, explainable insights. The researchers also highlight the need for improved methods to handle chronological uncertainty inherent in ancient data, advocating for evaluation metrics that accommodate variable date ranges rather than rigid point estimates.

Fundamental to the project’s ethos is an ethical commitment. Recognizing the risk of AI reinforcing biases or misclassifying sensitive historical contexts, the developers stress the indispensability of human oversight and caution against blind reliance on automated interpretations. Their interdisciplinary collaboration espouses responsible AI deployment, championing tools that augment rather than replace the nuanced judgment of humanities scholars, and emphasize inclusivity by highlighting epigraphy’s potential to illuminate marginalized voices in the ancient world.

In sum, Aeneas stands as a landmark achievement at the intersection of artificial intelligence and historical inquiry. Through sophisticated multimodal learning, expansive datasets, and human-centric evaluations, it charts a transformative path for epigraphy and ancient studies. By marrying the computational rigor of machine learning with the interpretive subtleties of scholarly expertise, Aeneas exemplifies how AI can deepen our dialogue with the past, unlocking the silent stories etched in stone through the ages.

Subject of Research:
Contextualizing and analyzing Latin inscriptions from the Roman Empire using multimodal machine learning to restore text, and attribute geographical and chronological provenance.

Article Title:
Contextualizing Ancient Texts with Generative Neural Networks

Article References:
Assael, Y., Sommerschield, T., Cooley, A. et al. Contextualizing ancient texts with generative neural networks.
Nature (2025). https://doi.org/10.1038/s41586-025-09292-5

Image Credits:
AI Generated

Tags: ancient texts and AIartificial intelligence and archaeologydigital humanities advancementsepigraphic interpretation techniquesgeographic origins of Roman inscriptionshistorical text analysis toolsLatin Epigraphic DatasetLatin inscriptions restorationmachine learning in epigraphymultimodal frameworks in AItechnology in historical researchtransformer architectures in history

Share12Tweet8Share2ShareShareShare2

Related Posts

blank

Durable, Flexible Electrochemical Transistors via Electropolymerized PEDOT

July 26, 2025
Challenges and Opportunities in High-Filled Polymer Manufacturing

Challenges and Opportunities in High-Filled Polymer Manufacturing

July 26, 2025

Biodegradable, Mass-Produced Tungsten-PBAT Conductive Fiber

July 26, 2025

Hippocampal Maps Shift Despite Stable Senses

July 26, 2025

POPULAR NEWS

  • Blind to the Burn

    Overlooked Dangers: Debunking Common Myths About Skin Cancer Risk in the U.S.

    48 shares
    Share 19 Tweet 12
  • USF Research Unveils AI Technology for Detecting Early PTSD Indicators in Youth Through Facial Analysis

    42 shares
    Share 17 Tweet 11
  • Dr. Miriam Merad Honored with French Knighthood for Groundbreaking Contributions to Science and Medicine

    45 shares
    Share 18 Tweet 11
  • New Measurements Elevate Hubble Tension to a Critical Crisis

    43 shares
    Share 17 Tweet 11

About

We bring you the latest biotechnology news from best research centers and universities around the world. Check our website.

Follow us

Recent News

Durable, Flexible Electrochemical Transistors via Electropolymerized PEDOT

Challenges and Opportunities in High-Filled Polymer Manufacturing

Epicardial Fat: Protector or Threat to Heart Health?

  • Contact Us

Bioengineer.org © Copyright 2023 All Rights Reserved.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Homepages
    • Home Page 1
    • Home Page 2
  • News
  • National
  • Business
  • Health
  • Lifestyle
  • Science

Bioengineer.org © Copyright 2023 All Rights Reserved.