Keyword Searching and Digital Archives on Web: Challenges and Innovations in GLAM

Authors

  • Anil Kumar Sinha Department of Computer Science, V.K.S.U. Ara Author
  • Ankit Kumar Department of Computer Science, V.K.S.U. Ara Author
  • Khusboo Kumari P.G. Department of Physics, V.K.S.U. Ara Author
  • B. K. Mishra P.G. Department of Physics, V.K.S.U. Ara Author

DOI:

https://doi.org/10.56294/la2025155

Keywords:

keyword searching, digital archives, GLAM, semantic search, AI in heritage, TF-IDF, BERT, metadata

Abstract

Introduction; In the evolving digital landscape, keyword searching plays a pivotal role in facilitating access to information stored in cultural heritage archives. 
Objective; This paper explores the current challenges and recent innovations in keyword search technologies within the Galleries, Libraries, Archives, and Museums (GLAM) sector, emphasizing web-based retrieval systems. With the growth of digital archives such as Europeana and the Digital Public Library of America (DPLA), institutions face complexities in semantic search, multilingual access, and metadata standardization. 
Method; We evaluate traditional keyword models like TF-IDF against advanced AI-based approaches such as BERT, focusing on their effectiveness in web contexts. 
Result; Through case studies and performance evaluations, we identify promising methodologies that improve semantic relevance and user accessibility. 
Conclusion; The findings reveal that BERT-based models significantly outperform legacy methods, particularly in multilingual and semantically ambiguous search environments. The paper concludes with strategic recommendations for implementing AI-driven keyword search frameworks in GLAM archives.

References

1. Terras M. Digitization and digital resources in the. Digit Humanit Pract. 2012;47.

2. Azam A, Haque A, Rai SR. Predicting Housing Sale Prices Using Machine Learning with Various Data Split Ratios. Data Metadata [Internet]. 2024 Dec 15;3. Available from: https://dm.ageditor.ar/index.php/dm/article/view/231

3. Almrezeq N, Haque MA, Haque S, El-Aziz AAA. Device Access Control and Key Exchange (DACK) Protocol for Internet of Things. Int J Cloud Appl Comput [Internet]. 2022 Jan;12(1):1–14. Available from: https://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJCAC.297103

4. Wevers M, Smits T. The visual digital turn: Using neural networks to study historical images. Digit Scholarsh Humanit. 2020;35(1):194–207.

5. Singh PJYNJ. Contemporary Era of Artificial Intelligence based Digital Archiving in Libraries: A Virtual Approach. Emerg Technol Libr Trends Dev. :143.

6. Stanković R, Krstev C, Vitas D, Vulović N, Kitanović O. Keyword-based search on bilingual digital libraries. In: Semantic Keyword-Based Search on Structured Data Sources: COST Action IC1302 Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8–9, 2016, Revised Selected Papers 2. Springer; 2017. p. 112–23.

7. de Mooij J, Kurtan C, Baas J, Dastani M. A Computational Framework for Organizing and Querying Cultural Heritage Archives. J Comput Cult Herit. 2022;15(3):1–25.

8. RENN J. EUROPEAN CULTURAL HERITAGE ONLINE.

9. Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers). 2019. p. 4171–86.

10. Beel J, Gipp B, Langer S, Breitinger C. Paper recommender systems: a literature survey. Int J Digit Libr. 2016;17(4):305–38.

11. Kenny E. Europeana: Cultural Heritage in the Digital Age. In: Migrating Heritage. Routledge; 2016. p. 85–94.

Downloads

Published

2025-05-04

How to Cite

1.
Kumar Sinha A, Kumar A, Kumari K, K. Mishra B. Keyword Searching and Digital Archives on Web: Challenges and Innovations in GLAM. Land and Architecture [Internet]. 2025 May 4 [cited 2025 Aug. 29];4:155. Available from: https://la.ageditor.ar/index.php/la/article/view/155