Information Retrieval Models Foundations and Relationships
Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the ve...
Saved in:
Main Author: | |
---|---|
Format: | Book |
Language: | English |
Published: |
San Rafael, Calif.
Morgan & Claypool
2013
|
Series: | Synthesis lectures on information concepts, retrieval, and services
|
Subjects: | |
Online Access: | Click Here to View Status and Holdings. |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
MARC
LEADER | 00000aam a2200000#i 4501 | ||
---|---|---|---|
001 | wils-941439 | ||
005 | 2019451714 | ||
006 | a af 1 c | ||
007 | zz | ||
008 | 200123t 2013XXU ag| |#001 #cENG#D | ||
020 | # | # | |a 9781627050784 |q paperback |
040 | # | # | |a UiTM |b eng |c UiTM |d UiTM |e rda |
041 | 0 | # | |a eng |
090 | 0 | 0 | |a ZA3075 |b .R645 2013 |
100 | 1 | # | |a Roelleke, Thomas |e author |
245 | 1 | 0 | |a Information Retrieval Models |b Foundations and Relationships |c Thomas Roelleke |
264 | # | 1 | |a San Rafael, Calif. |b Morgan & Claypool |c 2013 |
264 | # | 4 | |c ©2013 |
300 | # | # | |a xxi, 141 pages |b illustrations |c 27 cm |
336 | # | # | |a text |2 rdacontent |
337 | # | # | |a unmediated |2 rdamedia |
338 | # | # | |a volume |2 rdacarrier |
490 | 0 | # | |a Synthesis Lectures On Information Concepts, Retrieval, And Services |
504 | # | # | |a Includes bibliographical references (pages 127-134) and indexes |
520 | # | # | |a Information Retrieval (IR) models are a core component of IR research and IR systems. The past decade brought a consolidation of the family of IR models, which by 2000 consisted of relatively isolated views on TF-IDF (Term-Frequency times Inverse-Document-Frequency) as the weighting scheme in the vector-space model (VSM), the probabilistic relevance framework (PRF), the binary independence retrieval (BIR) model, BM25 (Best-Match Version 25, the main instantiation of the PRF/BIR), and language modelling (LM). Also, the early 2000s saw the arrival of divergence from randomness (DFR). Regarding intuition and simplicity, though LM is clear from a probabilistic point of view, several people stated: "It is easy to understand TF-IDF and BM25. For LM, however, we understand the math, but we do not fully understand why it works." This book takes a horizontal approach gathering the foundations of TF-IDF, PRF, BIR, Poisson, BM25, LM, probabilistic inference networks (PIN's), and divergence-based models. The aim is to create a consolidated and balanced view on the main models. A particular focus of this book is on the "relationships between models." This includes an overview over the main frameworks (PRF, logical IR, VSM, generalized VSM) and a pairing of TF-IDF with other models. It becomes evident that TF-IDF and LM measure the same, namely the dependence (overlap) between document and query. The Poisson probability helps to establish probabilistic, non-heuristic roots for TF-IDF, and the Poisson parameter, average term frequency, is a binding link between several retrieval models and model parameters. |
650 | # | 0 | |a Information retrieval |x Mathematical models |
830 | # | 0 | |a Synthesis lectures on information concepts, retrieval, and services |
856 | 4 | 0 | |z Click Here to View Status and Holdings. |u https://opac.uitm.edu.my/opac/detailsPage/detailsHome.jsp?tid=941439 |