Document Analysis and Retrieval Tasks in Scientific Digital Libraries

Page view(s)
0
Checked on
Document Analysis and Retrieval Tasks in Scientific Digital Libraries
Title:
Document Analysis and Retrieval Tasks in Scientific Digital Libraries
Journal Title:
Russian Summer School in Information Retrieval
OA Status:
Publication Date:
10 December 2015
Citation:
Gollapalli S.D., Caragea C., Li X., Giles C.L. (2015) Document Analysis and Retrieval Tasks in Scientific Digital Libraries. In: Braslavski P., Karpov N., Worring M., Volkovich Y., Ignatov D. (eds) Information Retrieval. RuSSIR 2014. Communications in Computer and Information Science, vol 505. Springer, Cham
Abstract:
Machine Learning (ML) algorithms have opened up new possibilities for the acquisition and processing of documents in Information Retrieval (IR) systems. Indeed, it is now possible to automate several labor-intensive tasks related to documents such as categorization and entity extraction. Consequently, the application of machine learning techniques for various large-scale IR tasks has gathered significant research interest in both the ML and IR communities. This tutorial provides a reference summary of our research in applying machine learning techniques to diverse tasks in Digital Libraries (DL). Digital library portals are specialized IR systems that work on collections of documents related to particular domains. We focus on open-access, scientific digital libraries such as CiteSeer\(^x\), which involve several crawling, ranking, content analysis, and metadata extraction tasks. We elaborate on the challenges involved in these tasks and highlight how machine learning methods can successfully address these challenges.
License type:
PublisherCopyrights
Funding Info:
Description:
ISBN:
978-3-319-25484-5
978-3-319-25485-2
Files uploaded:
File Size Format Action
There are no attached files.