Meet Aeneas, Google DeepMind's AI For Decoding Ancient Texts

Share this article
Share this article
Prioritise Us on Google
Aeneas was designed to help historians 'fill in the blanks' of incomplete or indecipherable ancient texts | Credit: Google DeepMind
Google DeepMind's Aeneas AI model helps historians decode damaged Latin inscriptions, outperforming experts in dating accuracy and predictive text

Google DeepMind has developed an AI model that can predict where ancient Latin texts originate, estimate their age and restore missing sections.

The model is called Aeneas and it represents a significant advancement in the field of epigraphy — the study of ancient inscriptions.

Developed by a team of researchers from universities in the UK and Greece, as well as a team from DeepMind itself, Aeneas addresses one of the most longstanding challenges in the fields of history, archaeology and anthropology.

"Aeneas can retrieve relevant parallels from across our entire data set instantly" because each text has a unique identifier in the database, says Yannis Assael, a research scientist at Google DeepMind.

Yannis Assael, a research scientist at Google DeepMind

Training on vast historical datasets

Aeneas was trained on inscriptions from three of the world's largest databases of Latin epigraphy.

The combined dataset contained text from 176,861 inscriptions — plus images of 5% of them — with dates ranging across a millennium from the 7th century BC to the 8th century AD.

The model comprises three neural networks, each designed for different tasks: restoring missing text, predicting geographical origin and estimating age.

Along with results, Aeneas provides a ranked list of similar inscriptions from the dataset to support its conclusions.

A fragment of a military diploma from Sardinia which Aeneas is able to complete using predictive AI | Credit: Nature

Outperforming human experts in accuracy tests

The team tested Aeneas by asking 23 epigraphers to restore text removed from inscriptions and to date and identify origins both independently and with the model's assistance.

Working alone, experts dated inscriptions to within around 31 years of the correct answer.

Dates predicted by Aeneas were accurate to within 13 years — significantly outperforming human specialists.

When specialists had access to the model's predictions and similar inscription lists, they achieved greater accuracy in geographical identification and text restoration than either working alone or relying solely on the model.

Aeneas could unlock pieces of history though lost forever | Credit: Google DeepMind

Real-world applications

Testing on the well-known Res gestae divi Augusti, which details Roman emperor Augustus's life, showed the model's predictions aligned with historians' assessments.

The tool correctly identified spelling variations and features historians use to determine age and origin, without being misled by dates mentioned within the text.

When examining a Latin altar inscription, Aeneas notably included another altar from the same region in its similar inscriptions list, despite not being programmed with geographical or temporal connections.

The Temple of Augustus and Rome with the Res Gestae Divi Augusti ('Deeds of the Divine Augustus') inscribed on the walls of the cella in Ankara, Turkey | Credit: Following Hadrian

Addressing limitations of manual research

Anne Rogerson, who studies Latin texts at the University of Sydney, highlights the model's capacity to analyse vast amounts of data beyond any individual's capability.

"It's giving a hypothesis based on the evidence base that it's working from, so it's a rational guess rather than a wild stab in the dark," Anne says regarding the model's reliability when compared with more general AI tools.

Youtube Placeholder

The tool can help historians locate similar inscriptions — a process that typically requires weeks or months of manual research.

Thea Sommerschield, an epigrapher at the University of Nottingham, explains that finding comparable inscriptions is "incredibly time consuming" using traditional methods.

Thea Sommerschield, an epigrapher at the University of Nottingham

Current constraints and future development

The team acknowledges Aeneas faces limitations due to its smaller training database compared to models like ChatGPT and Microsoft's Copilot, potentially affecting performance on unusual inscriptions.

Anne suggests the model might prove less useful for unique inscriptions or those from periods with fewer available artefacts.

Despite these constraints, the research demonstrates how specialised AI applications can augment rather than replace human expertise in historical scholarship.

The development represents Google DeepMind's continued expansion into academic research applications, following previous work on ancient Greek text decipherment.

Company portals