Computer Laboratory

Technical reports

Wearing proper combinations

Karen Spärck Jones

November 2005, 27 pages

Abstract

This paper discusses the proper treatment of multiple indexing fields, representations, or streams, in document retrieval. Previous experiments by Robertson and his colleagues have shown that, with a widely used type of term weighting and fields that share keys, document scores should be computed using term frequencies over fields rather than by combining field scores. Here I examine a wide range of document and query indexing situations, and consider their implications for this approach to document scoring.

Full text

PDF (0.3 MB)

BibTeX record

@TechReport{UCAM-CL-TR-655,
  author =	 {Sp{\"a}rck Jones, Karen},
  title = 	 {{Wearing proper combinations}},
  year = 	 2005,
  month = 	 nov,
  url = 	 {http://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-655.pdf},
  institution =  {University of Cambridge, Computer Laboratory},
  number = 	 {UCAM-CL-TR-655}
}