PDFdigest: an adaptable layout-aware PDF-to-XML textual content extractor for scientific articles
Loading...
Date
Document Type
Document Version
Author
Citation
Ferrés D, Saggion H, Ronzano F, Bravo À. PDFdigest: an adaptable layout-aware PDF-to-XML textual content extractor for scientific articles. In: Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, editors. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018); 2018 May 7-12; Miyazaki, Japan. L18-1298.







