The HIMANIS project
Indexing the Trésor des Chartes registers
Tell me more


Manuscripts are among the most important witnesses to our European shared cultural heritage. Despite a large digitization, the wealth of their content remains largely inaccessible : current handwritten text recognition technology is not accurate enough to allow full text search. This situation raises the need of searching approaches specifically designed for text images.

HIMANIS aims at developing cost-effective solutions for querying large sets of handwritten document images.

Transcription and Indexing

a new indexing/searching technology for historical manuscripts

Full text search

a new paradigm to study our historical heritage, as conveyed by manuscripts, by using full text search technology.

History of Europe

a new vision of the raise of nation states in Europe via a new study of the corpus under this paradigm.

The Chancery Corpus

The “Tresor des Chartes” is the most valuable and iconic archive of the French kings and has been built as a collection as early as in the 13th c. Within the “Tresor”, the collection of medieval registers (aka “Chancery corpus”) stand out. They keep record of the charters, grants and privileges given by the king of France and were produced directly by the French royal chancery. Dating from 1302 to 1483, they contain ca. 68,000 charters and documents. This large and iconic collection bears witness to the rationalization of late medieval administration and is a key source to our understanding of medieval Europe and the rise of centralized nation state on the continent as a consequence of the long lasting wars between France and England.


Institut de recherche et d’histoire des textes

Paris, France

CNRS institute devoted to fundamental research on medieval manuscripts and early printed books.


Paris, France

The global leader in handwriting recognition, text extraction and document classification software.

Pattern Recognition and Human Language Technology

Valencia, Spain

Universitat Politècnica de València research center dedicated to Multimodal Interaction, Pattern Recognition, Image Processing and Language Processing

Artificial Intelligence and Cognitive Engineering

Groningen, The Netherlands

Research center covering Cognitive Modeling, Multi-Agent Systems, Autonomous Perceptive Systems, Sensory Cognition.




Machine Learning and Deep Learning Agency specialized on Data Science and Data Viz

Archives Nationales


The French national Archive public service

Bibliothèque Nationale de France


Collect, preserve, enrich and make available the French national documentary heritage


On HIMANIS corpus

  • Preparatory KWS Experiments for Large-Scale Indexing of a Vast Medieval Manuscript Collection in the HIMANIS Project., DOI: 10.1109/ICDAR.2017.59, 2017

On other corpora