Case Studies
Digitization for a National Library
The Client
A National Library, an information treasure trove of the country's knowledge, history and culture, with millions of books, manuscripts and maps covering every subject
Business Needs & Challenges
Improve document management and retrieval. The project required the digitization of books on History and Culture. The service included data capture and creation of searchable PDFs (image plus hidden text). We processed around 20,000 pages, meeting the major challenge of maintaining the pixel measurements and the resolution as well as the height and width of the original document layout.
Datamatics Solution
The services provided by Datamatics on the project included:
- Data capture from TIFFs
- Creation of searchable PDFs and HTML
The data capture was done from the TIFFs. The quality check and the validation were done using internally developed tools to keep accuracy above 99.99% as required by customer. Then, the searchable PDFs and HTML documents were created. The Datamatics team also met the challenge of capturing the Unicode-UTF 8 characters present in the input documents.
Benefits
- The improved document management system has reduced time and efforts on retrieval
- The digitization of the books on History and Culture has enabled conservation of important historical and cultural information and prevents damage to or loss of hard copies
- The documents can also be identified and retrieved easily and accurately because of the improved naming conventions used for each book and storage of the same
To know more about content management solutions from Datamatics, please write to business@datamatics.com.
