Anna Korhonen

Anna Korhonen - Royal Society University Research Fellow

University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK
Office: GS-12
Phone: (+44) 1223 763 672

Department of Theoretical and Applied Linguistics (DTAL)
Faculty of English Building, 9 West Road
Cambridge CB3 9DB, UK
Office: TR-12
Phone: (+44) 1223 767 389

Email:
anna.korhonen @ cl.cam.ac.uk

News

Current students: Please see my project proposals for 2013-2014.

Prospective PhD students: I supervise PhD students at the Computer Laboratory and DTAL. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me. If you are interested in pursuing a PhD, please contact me well in advance.

Research

My research has principally been in the area of Natural Language Processing and Computational Linguistics. Some current areas of interest include:

  • automatic lexical acquisition
  • computational lexical semantics
  • computational models of discourse
  • lexical and domain adaptation
  • unsupervised and lightly supervised approaches to NLP
  • scientific text processing and text mining
  • NLP for biomedicine
  • NLP for real-world applications
  • computational models of human language learning
  • computational neuro-linguistics

Biography

I am a Royal Society University Research Fellow at the University of Cambridge where I have a joint affiliation between the Computer Laboratory and the Department of Theoretical and Applied Linguistics (DTAL).

People

Current PhD students:

Current postdocs:

Past PhD students and postdocs:

Projects

Current projects

Recent projects

  • PANACEA - Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies.
    Funded by EU FP7 (2010-2012).
    Working with Laura Rimell, and project partners UPF (Spain), CNR-ILC (Italy), ILSP (Greece), Linguatec (Germany), DCU (Ireland)

  • Lexical Acquisition for the Biomedical Domain
    Funded by EPSRC (2009-2012).
    Working with Lin Sun, Diarmuid Ó Séaghdha, and Tom Lippincott.

  • Developing Multilingual Technologies for Automatic Lexical Acquisition.
    Funded by Isaac Newton Trust (2010-2012).
    Working with Tin Van de Cruys and Thierry Poibeau.

  • COMPLEX - Computational Natural Language Processing and the Neuro-Cognition of Language.
    Co-funded by EPSRC, ESRC and MRC (2008-2011).
    Working with with Lorraine K. Tyler, William Marslen-Wilson, and Paula Buttery.

  • Developing Multilingual Technologies for Automatic Lexical Acquisition.
    Funded by British Council (2008-2009).
    Working with Thierry Poibeau.

  • ACLEX - Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications.
    Funded by EPSRC (2005-2008).
    Working with Ted Briscoe and Judita Preiss.

  • Using Automatic Verb Classification to Aid Event Extraction.
    JSPS Postdoctoral Fellowship.
    Funded by the Japan Society for the Promotion of Science (2004-2005)

  • FLYSLIP - Integrating Literature, Experiments and Curation in Drosophila Genomics Research.
    Funded by BBSRC (2004-2007).
    With Ted Briscoe, Simone Teufel, and Rachel Drysdale.

Teaching

I have (co-)taught the following courses in Cambridge

Computer Laboratory:

RCEAL and DTAL:

  • Computational Linguistics
  • Computational Corpus Linguistics
  • Computational Lexical Semantics
  • Computational Language Learning

Activities

Current activities:

Recent activities:

Media


Mining the Language of Science. Research Horizons. November 18, 2011.

Computer System Developed to Analyse the Cancer Risk of a Chemical. CNN News. November 21, 2011.

Publications

2014

Diarmuid Ó Séaghdha and Anna Korhonen. 2014. Probabilistic distributional semantics with latent variable models. Computational Linguistics. Accepted for publication.

Colin Kelly, Barry Devereux and Anna Korhonen. 2014. Automatic extraction of property norm-like data from large text corpora. Cognitive Science. Accepted for publication.

2013

Felix Hill, Anna Korhonen and Christian Bentz. 2013. A quantitative empirical analysis of the abstract/concrete distinction. Cognitive Science.
LINK

Ekaterina Shutova, Barry Devereux and Anna Korhonen. 2013. Conceptual Metaphor Theory Meets the Data: A Corpus-based Human Annotation Study. Language Resources and Evaluation.
LINK

Ekaterina Shutova, Jakub Kaplan, Simone Teufel and Anna Korhonen. 2013. A Computational Model of Logical Metonymy. ACM Transactions on Speech and Language Processing. 10(3). 11.
LINK

Jeroen Geertzen, Theodora Alexopoulou and Anna Korhonen. 2013. Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT). Accepted for publication in Proceedings of the 31st Second Language Research Forum (SLRF), Carnegie Mellon, Cascadilla Press.
LINK

Roi Reichart and Anna Korhonen. 2013. Improved Lexical Acquisition through DPP-based Verb Clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Lin Sun, Diana McCarthy and Anna Korhonen. 2013. Diathesis alternation approximation for verb clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Felix Hill, Douwe Kiela and Anna Korhonen. 2013. Concreteness and corpora: A theoretical and practical analysis. In Proceedings of the ACL 2013 Workshop on Cognitive Modelling and Computational Linguistics, Sofia, Bulgaria.
LINK

Felix Hill, Christian Bentz and Anna Korhonen. 2013. Large-scale empirical analyses of concreteness. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2013. Minimally Supervised Learning for Unconstrained Conceptual Property Extraction. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Yufan Guo, Roi Reichart and Anna Korhonen. 2013. Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2013. A Tensor-based Factorization Model of Semantic Compositionality. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Yufan Guo, Ilona Silins, Ulla Stenius and Anna Korhonen. 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. Bioinformatics (2013) 29 (11): 1440-1447.
LINK

Thomas Lippincott, Laura Rimell, Karin Verspoor and Anna Korhonen. 2013. Approaches to verb subcategorization for biomedicine . Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 212-227.
LINK

Thomas Lippincott, Laura Rimell, Helen L. Johnson, Karin Verspoor and Anna Korhonen. 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 228-237.
LINK

Aline Villavicencio, Thierry Poibeau, Anna Korhonen and Afra Alishahi. 2013. Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Thierry Poibeau, Aline Villavicencio, Anna Korhonen and Afra Alishahi. 2013. Computational Modeling as a Methodology for Studying Human Language Learning . In Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Anna Korhonen. 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. In the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.

2012

Roi Reichart and Anna Korhonen. 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Multi-way Tensor Factorization for Unsupervised Lexical Acquisition. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Tim van de Cruys and Anna Korhonen. 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Danish Contractor, Yufan Guo and Anna Korhonen. 2012. Using Argumentative Zones for Extractive Summarization of Scientific Articles. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen. 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Simone Teufel and Anna Korhonen. 2012. Statistical Metaphor Processing. Computational Linguistics, 39(2).
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Learning syntactic verb frames using graphical models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea.
LINK

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Hogberg and Ulla Stenius. 2012. Exocrine pancreatic carcinogenesis and autotaxin expression. PLoS ONE 7(8): e43209.
LINK

Anna Korhonen, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg and Ulla Stenius. 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS ONE 7(4):e33427.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Modelling selectional preferences in a lexical hierarchy. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics (*SEM 2012). Montreal, QC.
LINK

Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Merging Lexicons for Higher Precision Subcategorization Frame Acquisition. Proceedings of the LREC 2012 Workshop on Language Resource Merging, Istanbul, Turkey.
LINK

Ilona Silins, Anna Korhonen, Johan Hogberg and Ulla Stenius. 2012. Data and Literature Gathering in Chemical Cancer Risk Assessment. Integrated Environmental Assessment and Management. 2012, Jan 3.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2012. Semi-supervised learning for automatic conceptual property extraction. Proceedings of the NAACL 2012 Cognitive Modeling and Computational Linguistics Workshop.
LINK

Omri Abend, Chris Biemann, Anna Korhonen, Ari Rappoport, Roi Reichart and Anders Sogaard. 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP.
LINK

Robert Berwick, Anna Korhonen, Thierry Poibeau and Aline Villavicencio. 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss.
LINK

2011

Yufan Guo, Anna Korhonen, Ilona Silins and Ulla Stenius. 2011. Weakly-supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics 2011; doi: 10.1093/bioinformatics/btr536.
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics 12:212.
LINK

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics 2011, 12:69.
LINK

Lin Sun and Anna Korhonen. Hierarchical Verb Clustering Using Graph Factorization. 2011. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Yufan Guo, Anna Korhonen and Thierry Poibeau. 2011. A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2011. Latent Vector Weighting for Word Meaning in Context Edinburgh. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Omri Abend, Anna Korhonen, Ari Rappoport and Roi Reichart. 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP.
LINK

Barry Devereux, Anna Korhonen, Paula Buttery and Lorraine Tyler. 2011. The role of verb subcategorization frames and selectional preferences in sentence processing: an investigation using corpus-derived measures. Multidisciplinary Workshop on the mental representation of verbal argument structure. Paris, France.

Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Jie Zhuang, Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2011. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations. AMLaP. Paris, France.
LINK

2010

Anna Korhonen. 2010. Automatic Lexical Classification - Bridging Research and Practice. In Philoshophical Transactions A of the Royal Society. 368: 3621-3632.
LINK

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen. 2010. Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data. Research on Language and Computation.
PDF

Lin Sun, Thierry Poibeau, Anna Korhonen and Cedric Messiant. 2010. Investigating the cross-linguistic potential of VerbNet -style classification. In Proceedings of Coling. Beijing, China.
PDF

Ekaterina Shutova, Lin Sun and Anna Korhonen. 2010. Metaphor Identification Using Verb and Noun Clustering. In Proceedings of Coling. Beijing, China.
PDF

Tom Lippincott, Diarmuid O Seaghdha, Lin Sun and Anna Korhonen. 2010. Exploring variation across biomedical subdomains. In Proceedings of Coling. Beijing, China.
PDF

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PDF

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun and Ulla Stenius. 2010. Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes. In Proceedings of bio-NLP 2010. Uppsala, Sweden
PDF

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Johan Hogberg, Kristian Dreij, and Ulla Stenius. 2010. Carcinogen-induced inflammation and pancreatic cancer. In Proceedings of the 101th Annual Meeting of the American Association for Cancer Research. Washington, D.C., USA.
PDF

Colin Kelly, Barry Devereux and Anna Korhonen. 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Barry Devereux, Colin Kelly and Anna Korhonen. 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Brian Murphy, Kai-min Kevin Chang and Anna Korhonen. 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
LINK

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2010. Annotating the Enron Email Corpus with Number Senses. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). Valletta, Malta.
PDF

Barry Devereux, Colin Kelly, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora. The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives.
PDF

2009

Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
PDF

Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009. Number Sense Disambiguation. In Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF

Lin Sun and Anna Korhonen. 2009. Improving Verb Clustering with Automatically Acquired Selectional Preferences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF

Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment. In Proceedings of BioNLP. Boulder, Colorado.
PDF

Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF

Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009. Improved Cancer Risk Assessment Using Text Mining. In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF

Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009. Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF

2008

Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proceedings of Coling 2008. Manchester, UK.
PDF

Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008. A New Challenge for Text Mining: Cancer Risk Assessment. In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF

Andreas Vlachos, Zoubin Ghahramani and Anna Korhonen. 2008. Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF

Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008. LexSchem: A Large Subcategorization Lexicon for French Verbs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008. A Large-Scale Classification of English Verbs. In the Journal of Language Resources and Evaluation. 42(1). 21-40.
LINK

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Verb Class Discovery from Rich Syntactic Data. In Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics. Haifa, Israel.
PDF

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. Hyderabad, India.
PDF

2007

Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF

Paula Buttery and Anna Korhonen. 2007. I will shoot your shopping down and you can shoot all my tins - Automatic Lexical Acquisition from the CHILDES Database. In Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

Paula Buttery, Aline Villavicencio and Anna Korhonen. 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

2006

Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006. Automatic Classification of Verbs in Biomedical Texts. In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF

Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006. Zone Analysis in Biology Articles as a Basis for Information Extraction. In the International Journal of Medical Informatics on Natural Language Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes. In Proceedings of EURALEX. Turin, Italy.
DOC

Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006. A Large Subcategorization Lexicon for Natural Language Processing Applications. In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. Extending VerbNet with Novel Verb Classes. In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

2005

Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005. Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.
LINK

Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Ann Arbor, Michigan.
PDF

Timothy Baldwin, Anna Korhonen and Aline Villavicencio. 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition. Ann Arbor, Michigan.
PDF

Paula Buttery and Anna Korhonen. 2005. Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech. In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes. Saarbrucken, Germany.
PDF

2004

Judita Preiss and Anna Korhonen. 2004. WSD for Subcategorization Acquisition Task Description. In Proceedings of the ACL SENSEVAL-3 Workshop. Barcelona, Spain.
PDF

Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen. 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, Spain.
PDF

Anna Korhonen and Ted Briscoe. 2004. Extended Lexical-Semantic Classification of English Verbs. In Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF

2003

Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 64-71.
PDF

Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization Acquisition using Word Sense Disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 48-55.
PDF

Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio. 2003. Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment. Sapporo, Japan.
PDF

2002

Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet. In Proceedings of the COLING Workshop on Building and Using Semantic Networks. Taipei, Taiwan.
PDF

Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems. In Proceedings of the Sixth Conference on Natural Language Learning. Taipei, Taiwan. 91-97.
PDF

Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition. In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition. Philadelphia, USA. 51-58.
PS

Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF

Judita Preiss, Anna Korhonen and Ted Briscoe. 2002. Subcategorization Acquisition as an Evaluation Method for WSD. In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF

Anna Korhonen. 2002. Subcategorization Acquisition. PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge.
PDF

2000

Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help Subcategorization Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 216-223.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Is Hypothesis Testing Useful for Subcategorization Acquisition? Technical Report UCAM-CL-TR-491. Computer Laboratory, University of Cambridge.
PDF

1999

Melanie Baljko and Anna Korhonen. 1999. Proceedings of the ACL 1999 Student Session. University of Maryland, Maryland.
PDF

1998

Anna Korhonen. 1998. Automatic Extraction of Subcategorization Frames from Corpora - Improving Filtering with Diathesis Alternations. In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing. Saarbrucken, Germany. 49-56.
PDF

Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada. 1493-1495.
PDF

1997

Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments. '97 Sparkle WP5 Deliverable.
PDF

Anna Korhonen. 1997. Acquiring Subcategorization from Textual Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS

  • © Anna Korhonen. Last updated: October 2013.