Papers for Relevance Assessment by Enrique Amigó

Research Question: Combining metrics and similarities from models (manual summaries) without considering metric scales.

Paper ID
(Link to PDF)

Title

Author(s)

M93-1007

MUC-5 EVALUATION METRICS

Nancy Chinchor, Ph.D; Beth Sundheim

18_Paper

Multiple Similarity Measures and Source-Pair Information in Story Link Detection

Francine Chen, Ayman Farahat and Thorsten Brants

P01-1066

Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems

Marilyn A. Walker; Rebecca Passonneau; Julie E. Boland

W97-0601

Evaluating Interactive Dialogue Systems: Extending Component Evaluation to Integrated System Evaluation

Marilyn A. Walker; Diane J. Litman; Candace A. Kamm; Alicia Abella

Lin

ROUGE: A Package for Automatic Evaluation of Summaries

Chin-Yew Lin

W05-1203

Measuring the Semantic Similarity of Texts

Courtney Corley; Rada Mihalcea

215_pdf_2-col

Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics

Chin-Yew Lin and Franz Josef Och

W02-0716

Towards a Speech-to-Speech Machine Translation Quality Metric

Kurt Godden

Hori

Evaluation Measures Considering Sentence Concatenation for Automatic Summarization by Sentence or Word Extraction

Chiori Hori, Tsutomu Hirao and Hideki Isozaki

W98-1214

CHOOSING A DISTANCE METRIC FOR AUTOMATIC WORD CATEGORIZATION

Emin Erkan Korkmaz; Gokturk Ucoluk

W02-0406

Manual and automatic evaluation of summaries

Chin-Yew Lin; Eduard Hovy

W03-w7_eacl03pastra.local

Colouring Summaries BLEU

Katerina Pastra

W00-1401

Evaluation Metrics for Generation

Srinivas Bangalore; Owen Rambow; Steve Whittaker

W03-1101

Improving Summarization Performance by Sentence Compression --- A Pilot Study

Chin-Yew Lin

W05-0907

Evaluating DUC 2004 Tasks with the QARLA Framework

Enrique Amigo; Julio Gonzalo; Anselmo Penas; Felisa Verdejo