PUBLICATIONS
VICTEUR Project Publications
2026
- S. Guan, M. Lin, C. Xu, J. Zhao, and D. Greene, “Teaching VLMs to Admit Uncertainty in OCR from Lossy Visual Inputs,” in Proc. 14th International Conference on Learning Representations (ICLR 2026), 2026
- S. Datta, D. Roy, D. Greene, G. Meaney, K. Wade, and P. Mayr, “Cultural Analytics for Good: Building Inclusive Evaluation Frameworks for Historical IR,” in Advances in Information Retrieval (ECIR 2026), Lecture Notes in Computer Science, vol. 16485, Springer, Cham, 2026. [Link]
2025
- K. Wade. Mudie’s Select Library and the Shelf Life of the Nineteenth–Century Novel. Cambridge University Press, 2025. [Link]
- C. O’Neill, “Weder Fisch noch Fleisch: Tracing the Joycean in Terézia Mora’s Alle Tage,” Angermion, vol. 18, no. 1, pp. 39–58, 2025.
- S. Guan, M. Lin, C. Xu, X. Liu, J. Zhao, J. Fan, Q. Xu, and D. Greene, “PreP-OCR: A Complete Pipeline for Document Image Restoration and Enhanced OCR Accuracy,” in Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 2025. [Link]
- C. O’Neill, “Familial Trauma and Queer Sexual Catharsis in Neel Mukherjee’s A Life Apart,” in Queer Trauma Across Borders, A. Sathi, Ed., Palgrave Studies in Queer Literary, Visual and Material Cultures, vol. 1, Palgrave Macmillan, Cham, 2025.
- S. Saha, S. Datta, D. Roy, M. Mitra, and D. Greene, “Combining Query Performance Predictors: A Reproducibility Study,” in Advances in Information Retrieval (ECIR 2025), Lecture Notes in Computer Science, vol. 15575, Springer, Cham, 2025. [Link]
- S. Datta, D. Roy, D. Greene, and G. Meaney, “Tales and Truths: Exploring the Linguistic Journey of 19th Century Literature and Non-Fiction,” in Proceedings of the European Conference on Information Retrieval (ECIR’25), 2025. [Link]
2024
- M. Kelleher and K. Wade, “Irish Literary Feminism and Its Digital Archive(s),” in Technology in Irish Literature and Culture, J. O’Sullivan and M. Kelleher, Eds. Cambridge University Press, 2023, pp. 235–252. [Link]
- K. Mishler (Ed.), A Visit from the Banshee: Irish Ghost Stories and Supernatural Tales, MoLI Editions, 2024. [Link]
- S. Guan, C. Xu, M. Lin, and D. Greene, “Effective Synthetic Data and Test-Time Adaptation for OCR Correction,” in Proc. 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP’24), 2024. [Link]
- K. Wade, L. Cassidy, and D. Greene, “Mudie’s Select Library Catalogues, 1848–1907,” in Harvard Dataverse, 2024. [Link]
- S. Datta, D. Roy, D. Greene, and G. Meaney, “Unveiling Temporal Trends in 19th Century Literature: An Information Retrieval Approach,” in ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’24), 2024 [Link]
- S. Guan and D. Greene, “Synthetically Augmented Self-Supervised Fine-Tuning for Diverse Text OCR Correction,” in Proc. 50th European Conference on Artificial Intelligence (ECAI’24), 2024 [Link]
- S. Guan and D. Greene, “Advancing post-OCR correction: A comparative study of synthetic data,” in Findings of the Association for Computational Linguistics: ACL 2024, 2024 [Link]
2023
- B. Wickes, “Sound in Place: Italian Migrant Street Music in the Nineteenth-Century Novel,” in The Palgrave Handbook of European Migration in Literature and Culture, C. Stan and C. Sussman, Eds. Palgrave Macmillan, 2023, pp. 415–433. [Link]
- S. Sawant, S. Thakare, D. Greene, G. Meaney, and A. Smeaton, “Handwriting Analysis on the Diaries of Rosamond Jacob,” in Proc. 20th International Conference on Content-based Multimedia Indexing, 2023 [Link]
DATA
New data for new research
Research and podcasts
Text required
