Keyword Searching and Digital Archives on Web: Challenges and Innovations in GLAM
DOI:
https://doi.org/10.56294/la2025155Keywords:
keyword searching, digital archives, GLAM, semantic search, AI in heritage, TF-IDF, BERT, metadataAbstract
Introduction; In the evolving digital landscape, keyword searching plays a pivotal role in facilitating access to information stored in cultural heritage archives.
Objective; This paper explores the current challenges and recent innovations in keyword search technologies within the Galleries, Libraries, Archives, and Museums (GLAM) sector, emphasizing web-based retrieval systems. With the growth of digital archives such as Europeana and the Digital Public Library of America (DPLA), institutions face complexities in semantic search, multilingual access, and metadata standardization.
Method; We evaluate traditional keyword models like TF-IDF against advanced AI-based approaches such as BERT, focusing on their effectiveness in web contexts.
Result; Through case studies and performance evaluations, we identify promising methodologies that improve semantic relevance and user accessibility.
Conclusion; The findings reveal that BERT-based models significantly outperform legacy methods, particularly in multilingual and semantically ambiguous search environments. The paper concludes with strategic recommendations for implementing AI-driven keyword search frameworks in GLAM archives.
References
1. Terras M. Digitization and digital resources in the. Digit Humanit Pract. 2012;47.
2. Azam A, Haque A, Rai SR. Predicting Housing Sale Prices Using Machine Learning with Various Data Split Ratios. Data Metadata [Internet]. 2024 Dec 15;3. Available from: https://dm.ageditor.ar/index.php/dm/article/view/231
3. Almrezeq N, Haque MA, Haque S, El-Aziz AAA. Device Access Control and Key Exchange (DACK) Protocol for Internet of Things. Int J Cloud Appl Comput [Internet]. 2022 Jan;12(1):1–14. Available from: https://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJCAC.297103
4. Wevers M, Smits T. The visual digital turn: Using neural networks to study historical images. Digit Scholarsh Humanit. 2020;35(1):194–207.
5. Singh PJYNJ. Contemporary Era of Artificial Intelligence based Digital Archiving in Libraries: A Virtual Approach. Emerg Technol Libr Trends Dev. :143.
6. Stanković R, Krstev C, Vitas D, Vulović N, Kitanović O. Keyword-based search on bilingual digital libraries. In: Semantic Keyword-Based Search on Structured Data Sources: COST Action IC1302 Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8–9, 2016, Revised Selected Papers 2. Springer; 2017. p. 112–23.
7. de Mooij J, Kurtan C, Baas J, Dastani M. A Computational Framework for Organizing and Querying Cultural Heritage Archives. J Comput Cult Herit. 2022;15(3):1–25.
8. RENN J. EUROPEAN CULTURAL HERITAGE ONLINE.
9. Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 2019. p. 4171–86.
10. Beel J, Gipp B, Langer S, Breitinger C. Paper recommender systems: a literature survey. Int J Digit Libr. 2016;17(4):305–38.
11. Kenny E. Europeana: Cultural Heritage in the Digital Age. In: Migrating Heritage. Routledge; 2016. p. 85–94.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Anil Kumar Sinha, Ankit Kumar, Khusboo Kumari, B. K. Mishra (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.