A digital studio creating effective process automation.
Who we are.
Oliver Buxton DunnCo-founder and manager
Stephanie BrownPalaeography consultant
Ruth MurphyModern Transcription Lead
Yiannos StathopoulosCo-founder and tech advisor
Chloe AshleyAdmin lead
Alexis LitvineCo-founder and manager
Stan HintonSoftware Developer
Image and data acquisition (including image pre-processing treatment)
We create digital images from any historical documents, including archival work and photography. We also provide professional photographic equipment and can provide on-site or remote archival photography training.
You tell us what you want from your records, and we do the work - extracting all the useful data for your research and to help others make the most of your records.
Everything you need to know about your data, and the accuracy of each stage of the extraction process.
Data enrichment and visualisation
If you need an impactful presentation of your historical data, we offer public-oriented and interactive tools to increase the reach of your project.
Public participation in data gathering and correction
We manage crowd-sourcing campaigns for you or help you create the tools required for your project.
Data acquisition strategy consultancy
We provide flexible advice and support while you carry out your work, and to help you identify the best solutions for your research.
Grant writing and management consultancy
We have a wealth of experience in grant/funding application writing and management. As of 2023 we have a near 100% success rate in grants to which we have contributed.
Cust-3 (TNA) is an astonishing dataset that comprehensively captures England's global trade from 1697 to 1780. The list also includes government taxation, which is an important resource for understanding the rise of the British Empire amongst and other significant events.
On the right we see our models identifying and linking tabulated data. Jamaican imports included 'Elephants Teeth’, Tortoise Shells, and Snuff.
The NIC asked us to create new statistics that would show the development and timing of the UK's communications infrastructure.
Our data, shown on the right, shows telegrams transmitted since the beginning, encompassing a period when the world was first connected by electric cables to quickly transmit messages across cities and continents in seconds, rather than days, weeks, or months.
Charles III University Madrid commissioned us to conduct advanced keyword searches across a quarter of a century of printed material and to deliver results in a comprehensible format. One difficulty was to pick up thousands of line breaks accurately. Many OCR tools do not detect column breakers and so the text jumps across columns.
We built segmentation models to capture the layout of these documents and matched the output of the OCR’ed text to exisiting data captured by genealogists from the Genwiki project.
Such as handwritten Vatican reports of discussions about important theological issues dating from the fifteenth century. We transformed idiosyncratic Latin into digestible, fluid prose.
On challenge was to eliminate background watermarks to enhance machine reading accuracy. Identifying archaic abbreviations in the text was another.
Get in touch with us ...
Whether you need to extract data at scale, want to integrate geospatial data and historical records or need data consultancy, we'd love to hear from you.