On the Update of Term Weights in Dynamic Information Retrieval Systems.
Charles L. Viles, James C. French:
On the Update of Term Weights in Dynamic Information Retrieval Systems.
CIKM 1995: 167-174@inproceedings{DBLP:conf/cikm/VilesF95,
author = {Charles L. Viles and
James C. French},
title = {On the Update of Term Weights in Dynamic Information Retrieval
Systems},
booktitle = {CIKM '95, Proceedings of the 1995 International Conference on
Information and Knowledge Management, November 28 - December
2, 1995, Baltimore, Maryland, USA},
publisher = {ACM},
year = {1995},
pages = {167-174},
ee = {db/conf/cikm/VilesF95.html, http://doi.acm.org/10.1145/221270.221561},
crossref = {DBLP:conf/cikm/95},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
Using the vector space information retrieval model, we show that the update of term weights under document insertions is computationally expensive for weighting schemes that use collection statistics and normalization by document vector lengths.
In the dynamic setting, we argue that strict adherence to such schemes is impractical and unnecessary as long as retrieval effectiveness commensurate with strict adherence is attained.
Experiments using standard test collections as a source of document insertions support this argument.
These experiments indicate that term weights may drift from their mathematically defined values without a serious loss of retrieval effectiveness.
The only problematic setting is when new terms are present in newly inserted documents.
Ignoring these terms can cause an effectiveness degradation.
Copyright © 1995 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...
BibTeX
Printed Edition
CIKM '95, Proceedings of the 1995 International Conference on Information and Knowledge Management, November 28 - December 2, 1995, Baltimore, Maryland, USA.
ACM 1995
Contents BibTeX
Online Edition
Citation Page
BibTeX
References
- [1]
- IJsbrand Jan Aalbersberg:
Posting Compression in Dynamic Retrieval Environments.
SIGIR 1991: 72-81 BibTeX
- [2]
- Peter G. Anick, Rex A. Flynn:
Integrating a Dynamic Lexicon with a Dynamic Full-Text Retrieval System.
SIGIR 1993: 136-145 BibTeX
- [3]
- Eric W. Brown, James P. Callan, W. Bruce Croft:
Fast Incremental Indexing for Full-Text Information Retrieval.
VLDB 1994: 192-202 BibTeX
- [4]
- Douglas R. Cutting, Jan O. Pedersen:
Optimizations for Dynamic Inverted Index Maintenance.
SIGIR 1990: 405-411 BibTeX
- [5]
- Donna Harman:
Overview of the Third Text REtrieval Conference (TREC-3).
TREC 1994: 0- BibTeX
- [6]
- Shoshana Loeb, Douglas B. Terry:
Information Filtering - Preface to the Secial Section.
Commun. ACM 35(12): 26-28(1992) BibTeX
- [7]
- Michael Persin:
Document Filtering for Fast Ranking.
SIGIR 1994: 339-348 BibTeX
- [8]
- Gerard Salton:
Dynamic Document Processing.
Commun. ACM 15(7): 658-668(1972) BibTeX
- [9]
- Gerard Salton, Chris Buckley:
Term-Weighting Approaches in Automatic Text Retrieval.
Inf. Process. Manage. 24(5): 513-523(1988) BibTeX
- [10]
- Gerard Salton, Michael McGill:
Introduction to Modern Information Retrieval.
McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
BibTeX
- [11]
- Peter Schäuble:
SPIDER: A Multiuser Information Retrieval System for Semistructured and Dynamic Data.
SIGIR 1993: 318-327 BibTeX
- [12]
- Anthony Tomasic, Hector Garcia-Molina, Kurt A. Shoens:
Incremental Updates of Inverted Lists for Text Document Retrieval.
SIGMOD Conference 1994: 289-300 BibTeX
- [13]
- Charles L. Viles, James C. French:
Dissemination of Collection Wide Information in a Distributed Information Retrieval System.
SIGIR 1995: 12-20 BibTeX
- [14]
- ...
- [15]
- Tak W. Yan, Hector Garcia-Molina:
Index Structures for Selective Dissemination of Information Under the Boolean Model.
ACM Trans. Database Syst. 19(2): 332-364(1994) BibTeX
- [16]
- Justin Zobel, Alistair Moffat, Ron Sacks-Davis:
An Efficient Indexing Technique for Full Text Databases.
VLDB 1992: 352-362 BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
CIKM 1995 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:01:48 2009