
PicAxe: Extracting Figures from Structurally and Syntactically Heterogeneous Corpora of PDF Files
Authors
Krishna Kamath
Master’s Program in Computer Science, University of Chicago, Chicago, Illinois
Qilin Zhou
Master’s Program in Computer Science, University of Chicago, Chicago, Illinois
Bruno Felalaga
Master’s Program in Computer Science, University of Chicago, Chicago, Illinois
Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois
DOI: https://doi.org/10.5334/jors.574 | Journal eISSN: 2049-9647
Language: English
Submitted on: Apr 28, 2025
Accepted on: Dec 1, 2025
Published on: Dec 16, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
Keywords:
© 2025 Anna C. Guerrero, Krishna Kamath, Qilin Zhou, Bruno Felalaga, Julia Damerow, Aaron R. Dinner, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.