
The Giles Ecosystem – Storage, Text Extraction, and OCR of Documents
References
-
ABBYY
ABBYY FineReader [computer software]
2017
https://www.abbyy.com/en-us/finereader/ -
Adobe Systems
Adobe Acrobat Reader [computer software]
2017
https://get.adobe.com/reader/ -
Apache Software Foundation
Apache Kafka [computer software]
2017
https://kafka.apache.org/ -
Apache Software Foundation
Apache Zookeeper [computer software]
2017
https://zookeeper.apache.org/ -
Casties
R
Raspe
M
digilib [computer software]
2017
http://digilib.sourceforge.net/ -
Glyph & Cog, LLC
pdf to text; distributed as part of Xpdf [computer software]
2014
http://www.foolabs.com/xpdf/home.html -
Hockey
S M
Electronic texts in the Humanities 2000 Oxford University Press 11 23 10.1093/acprof:oso/9780198711940.003.0002 Chapter 2, Creating and Acquiring Electronic Texts -
Huculak
J M
Justice
B
Report for the University of Victoria Libraries on Fedora Commons-Based DAMS: Building Collaborative Scholarship Environments, A Test Case
Available from:
http://hdl.handle.net/1828/7212 [Accessed 16th December 2016] -
Oracle Corporation
MySQL [computer software]
2017
https://www.mysql.com/ -
Peirson
E B
Tutorial: Text Extraction and OCR with Tesseract and ImageMagick
2015
Available from:
https://diging.atlassian.net/wiki/display/DCH/Tutorial%3A+Text+Extraction+and+OCR+with+Tesseract+and+ImageMagick [Accessed 15th December 2016] -
Princeton University Library
Plum: A Hydra head to support digitization workflows
Available from:
https://github.com/pulibrary/plum [Accessed 16th December 2016] -
Schmidt
B
Tutorial: Command-line OCR on a Mac
Available from:
http://benschmidt.org/dighist13/?page_id=129 [Accessed 15th December 2016] -
Smith
R
Tesseract OCR [computer software]
2017
https://github.com/tesseract-ocr/tesseract
DOI: https://doi.org/10.5334/jors.164 | Journal eISSN: 2049-9647
Language: English
Submitted on: Feb 13, 2017
Accepted on: Sep 15, 2017
Published on: Sep 28, 2017
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
Keywords:
© 2017 Julia Damerow, B. R. Erick Peirson, Manfred D. Laubichler, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.