ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

On the Update of Term Weights in Dynamic Information Retrieval Systems.

Charles L. Viles, James C. French: On the Update of Term Weights in Dynamic Information Retrieval Systems. CIKM 1995: 167-174
@inproceedings{DBLP:conf/cikm/VilesF95,
  author    = {Charles L. Viles and
               James C. French},
  title     = {On the Update of Term Weights in Dynamic Information Retrieval
               Systems},
  booktitle = {CIKM '95, Proceedings of the 1995 International Conference on
               Information and Knowledge Management, November 28 - December
               2, 1995, Baltimore, Maryland, USA},
  publisher = {ACM},
  year      = {1995},
  pages     = {167-174},
  ee        = {db/conf/cikm/VilesF95.html, http://doi.acm.org/10.1145/221270.221561},
  crossref  = {DBLP:conf/cikm/95},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX

Abstract

Using the vector space information retrieval model, we show that the update of term weights under document insertions is computationally expensive for weighting schemes that use collection statistics and normalization by document vector lengths. In the dynamic setting, we argue that strict adherence to such schemes is impractical and unnecessary as long as retrieval effectiveness commensurate with strict adherence is attained. Experiments using standard test collections as a source of document insertions support this argument. These experiments indicate that term weights may drift from their mathematically defined values without a serious loss of retrieval effectiveness. The only problematic setting is when new terms are present in newly inserted documents. Ignoring these terms can cause an effectiveness degradation.

Copyright © 1995 by the ACM, Inc., used by permission. Permission to make digital or hard copies is granted provided that copies are not made or distributed for profit or direct commercial advantage, and that copies show this notice on the first page or initial screen of a display along with the full citation.


ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 4, CIKM, DOLAP, GIS, SIGFIDET, ..." and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX

Printed Edition

CIKM '95, Proceedings of the 1995 International Conference on Information and Knowledge Management, November 28 - December 2, 1995, Baltimore, Maryland, USA. ACM 1995
Contents BibTeX

Online Edition

Citation Page BibTeX

References

[1]
IJsbrand Jan Aalbersberg: Posting Compression in Dynamic Retrieval Environments. SIGIR 1991: 72-81 BibTeX
[2]
Peter G. Anick, Rex A. Flynn: Integrating a Dynamic Lexicon with a Dynamic Full-Text Retrieval System. SIGIR 1993: 136-145 BibTeX
[3]
Eric W. Brown, James P. Callan, W. Bruce Croft: Fast Incremental Indexing for Full-Text Information Retrieval. VLDB 1994: 192-202 BibTeX
[4]
Douglas R. Cutting, Jan O. Pedersen: Optimizations for Dynamic Inverted Index Maintenance. SIGIR 1990: 405-411 BibTeX
[5]
Donna Harman: Overview of the Third Text REtrieval Conference (TREC-3). TREC 1994: 0- BibTeX
[6]
Shoshana Loeb, Douglas B. Terry: Information Filtering - Preface to the Secial Section. Commun. ACM 35(12): 26-28(1992) BibTeX
[7]
Michael Persin: Document Filtering for Fast Ranking. SIGIR 1994: 339-348 BibTeX
[8]
Gerard Salton: Dynamic Document Processing. Commun. ACM 15(7): 658-668(1972) BibTeX
[9]
Gerard Salton, Chris Buckley: Term-Weighting Approaches in Automatic Text Retrieval. Inf. Process. Manage. 24(5): 513-523(1988) BibTeX
[10]
Gerard Salton, Michael McGill: Introduction to Modern Information Retrieval. McGraw-Hill Book Company 1984, ISBN 0-07-054484-0
BibTeX
[11]
Peter Schäuble: SPIDER: A Multiuser Information Retrieval System for Semistructured and Dynamic Data. SIGIR 1993: 318-327 BibTeX
[12]
Anthony Tomasic, Hector Garcia-Molina, Kurt A. Shoens: Incremental Updates of Inverted Lists for Text Document Retrieval. SIGMOD Conference 1994: 289-300 BibTeX
[13]
Charles L. Viles, James C. French: Dissemination of Collection Wide Information in a Distributed Information Retrieval System. SIGIR 1995: 12-20 BibTeX
[14]
...
[15]
Tak W. Yan, Hector Garcia-Molina: Index Structures for Selective Dissemination of Information Under the Boolean Model. ACM Trans. Database Syst. 19(2): 332-364(1994) BibTeX
[16]
Justin Zobel, Alistair Moffat, Ron Sacks-Davis: An Efficient Indexing Technique for Full Text Databases. VLDB 1992: 352-362 BibTeX
BibTeX
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
CIKM 1995 Proceedings, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Sat May 16 23:01:48 2009