Department of Computer Science and Technology

Technical reports

Wearing proper combinations

Karen Spärck Jones

November 2005, 27 pages

DOI: 10.48456/tr-655

Abstract

This paper discusses the proper treatment of multiple indexing fields, representations, or streams, in document retrieval. Previous experiments by Robertson and his colleagues have shown that, with a widely used type of term weighting and fields that share keys, document scores should be computed using term frequencies over fields rather than by combining field scores. Here I examine a wide range of document and query indexing situations, and consider their implications for this approach to document scoring.

Full text

PDF (0.3 MB)

BibTeX record

@TechReport{UCAM-CL-TR-655,
  author =	 {Sp{\"a}rck Jones, Karen},
  title = 	 {{Wearing proper combinations}},
  year = 	 2005,
  month = 	 nov,
  url = 	 {https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-655.pdf},
  institution =  {University of Cambridge, Computer Laboratory},
  doi = 	 {10.48456/tr-655},
  number = 	 {UCAM-CL-TR-655}
}