Information Retrieval Models Foundations and Relationships

Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the ve...

Full description

Saved in:
Bibliographic Details
Main Author: Roelleke, Thomas (Author)
Format: Book
Language:English
Published: San Rafael, Calif. Morgan & Claypool 2013
Series:Synthesis lectures on information concepts, retrieval, and services
Subjects:
Online Access:Click Here to View Status and Holdings.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000aam a2200000#i 4501
001 wils-941439
005 2019451714
006 a af 1 c
007 zz
008 200123t 2013XXU ag| |#001 #cENG#D
020 # # |a 9781627050784  |q paperback 
040 # # |a UiTM  |b eng  |c UiTM  |d UiTM  |e rda 
041 0 # |a eng 
090 0 0 |a ZA3075  |b .R645 2013 
100 1 # |a Roelleke, Thomas  |e author 
245 1 0 |a Information Retrieval Models  |b Foundations and Relationships  |c Thomas Roelleke 
264 # 1 |a San Rafael, Calif.  |b Morgan & Claypool  |c 2013 
264 # 4 |c ©2013 
300 # # |a xxi, 141 pages  |b illustrations  |c 27 cm 
336 # # |a text  |2 rdacontent 
337 # # |a unmediated  |2 rdamedia 
338 # # |a volume  |2 rdacarrier 
490 0 # |a Synthesis Lectures On Information Concepts, Retrieval, And Services 
504 # # |a Includes bibliographical references (pages 127-134) and indexes 
520 # # |a Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR). Regarding intuition and simplicity, though LM is clear from a probabilistic point of view, several people stated: "It is easy to understand TF-IDF and BM25. For LM, however, we understand the math, but we do not fully understand why it works." This book takes a horizontal approach gathering the foundations of TF-IDF, PRF, BIR, Poisson, BM25, LM, probabilistic inference networks (PIN's), and divergence-based models. The aim is to create a consolidated and balanced view on the main models. A particular focus of this book is on the "relationships between models." This includes an overview over the main frameworks (PRF, logical IR, VSM, generalized VSM) and a pairing of TF-IDF with other models. It becomes evident that TF-IDF and LM measure the same, namely the dependence (overlap) between document and query. The Poisson probability helps to establish probabilistic, non-heuristic roots for TF-IDF, and the Poisson parameter, average term frequency, is a binding link between several retrieval models and model parameters. 
650 # 0 |a Information retrieval  |x Mathematical models 
830 # 0 |a Synthesis lectures on information concepts, retrieval, and services 
856 4 0 |z Click Here to View Status and Holdings.  |u https://opac.uitm.edu.my/opac/detailsPage/detailsHome.jsp?tid=941439