Anna Korhonen

Anna Korhonen - Reader in Computational Linguistics

Department of Theoretical and Applied Linguistics (DTAL)
Faculty of English Building, 9 West Road
Cambridge CB3 9DB, UK
Office: TR-12
Phone: (+44) 1223 767 389

University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK

Email:
anna.korhonen @ cl.cam.ac.uk

News

New dataset: My PhD student Felix Hill has released SimLex-999 - a gold standard resource for the evaluation of models that learn the meaning of words and concepts. This novel resource provides a way of measuring how well models capture similarity, rather than relatedness or association, and is freely available for downloading.

Current MPhil students:

Prospective PhD students:

  • PhD studentship: We are now advertising a Cambridge Cancer Centre (CCC) Non-Clinical PhD Studentship to commence in October 2015. The studentship will focus on Literature-Based Discovery for Cancer Biology. Please see the CCC website for details and how to apply. The deadline for applications is 30 November 2014.

  • I supervise PhD students at DTAL and the Computer Laboratory. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me. If you are interested in pursuing a PhD, please contact me well in advance.

Research

My research has principally been in the area of Natural Language Processing and Computational Linguistics. Some current areas of interest include:

  • lexical acquisition
  • computational semantics
  • computational models of discourse
  • lexical and domain adaptation
  • statistical and machine learning approaches for NLP
  • text mining
  • multilingual NLP
  • NLP for biomedicine
  • NLP for real-world applications
  • computational models of human language learning
  • computational neuro-linguistics

Biography

I am a Reader in Computational Linguistics at the University of Cambridge. I am based at the Department of Theoretical and Applied Linguistics (DTAL) and am also affiliated with the Computer Laboratory.

People

Current PhD students:

  • Simon Baker. Adaptive semantic text classification for biomedicine.
  • Felix Hill. Abstract/concrete distinction.
  • Yan Huang. Natural Language Processing for analysis of learner language.
  • Stuart Moore. Number sense disambiguation.

Current postdocs:

Past PhD students and postdocs:

Projects

Current and recent projects

Teaching

In 2014-15, I am teaching the following courses

Computer Laboratory:

DTAL:

Activities

Current activities:

Recent activities:

Media


Mining the Language of Science. Research Horizons. November 18, 2011.

Computer System Developed to Analyse the Cancer Risk of a Chemical. CNN News. November 21, 2011.

Publications

2014

Felix Hill, Roi Reichart and Anna Korhonen. 2014. SimLex-999: Evaluating Semantic Models with (Genuine) Similarity Estimation. arxiv preprint arxiv:1408:3456
LINK
Accompanying dataset

Felix Hill, Roi Reichart and Anna Korhonen. 2014. Multi-Modal Models for Concrete and Abstract Concept Meaning. To appear in Transactions of ACL (TACL).
LINK

Felix Hill and Anna Korhonen. 2014. Learning Abstract Concepts from Multi-Modal Data: Since You Probably Can't See What I Mean. In Proceedings of EMNLP 2014. Doha, Qatar.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2014. Probabilistic distributional semantics with latent variable models. Computational Linguistics 40(3): 587-631.
LINK

Simon Baker, Roi Reichart and Anna Korhonen. 2014. An Unsupervised Model for Instance Level Subcategorization Acquisition. In Proceedings of EMNLP 2014, Doha, Qatar.
LINK

Yufan Guo, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg, Ulla Stenius and Anna Korhonen. 2014. CRAB 2.0: A text mining tool for supporting literature review in chemical cancer risk assessment. In Proceedings of Coling 2014 (a demo paper), Dublin, Ireland.
LINK

Douwe Kiela, Felix Hill, Anna Korhonen and Stephen Clark. 2014. Improving multi-modal representations using image dispersion: Why less is sometimes more. In Proceedings of ACL 2014. Baltimore, USA.
LINK

Felix Hill and Anna Korhonen. 2014. Concreteness and subjectivity as dimensions of lexical meaning. In Proceedings of ACL 2014. Baltimore, USA.
LINK

Carolina Scarton, Lin Sun, Karin Kipper-Schuler, Magali Sanches Duran, Martha Palmer and Anna Korhonen. 2014. Verb Clustering for Brazilian Portuguese. 15th International Conference in Computational Linguistics and Intelligent Text Processing. In Lecture Notes in Computer Science. Vol. 8404. Springer. 25-40.
LINK

Xiao Jiang, Yufan Guo, Jeroen Geertzen, Theodora Alexopoulou, Lin Sun and Anna Korhonen. 2014. Native Language Identification Using Large, Longitudinal Data. In Proceedings of LREC. Reykjavik, Iceland.
LINK

Ilona Silins, Anna Korhonen and Ulla Stenius. 2014. Evaluation of carcinogenic modes of action for pesticides in fruit on the Swedish market using a text-mining tool. Front Pharmacol. 2014 Jun 23;5:145. doi: 10.3389/fphar.
LINK

Anna Korhonen, Yufan Guo, Meliha Yetisgen-Yildiz, Ulla Stenius, Masashi Narita and Pietro Lio. 2014. Improving Literature-Based Discovery with Text Mining. In Proceedings of CIBB 2014. Cambridge, UK.
LINK

Ilona Silins, Anna Korhonen, Yufan Guo, Ulla Stenius. 2014. A text mining approach for chemical risk assessment and cancer research. In Proceedings of Eurotox 2014. Edinburgh, UK.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2014. Automatic extraction of property norm-like data from large text corpora. Cognitive Science, 38: 638-682. doi: 10.1111/cogs.12091.
LINK

2013

Felix Hill, Anna Korhonen and Christian Bentz. 2013. A quantitative empirical analysis of the abstract/concrete distinction. Cognitive Science.
LINK

Ekaterina Shutova, Barry Devereux and Anna Korhonen. 2013. Conceptual Metaphor Theory Meets the Data: A Corpus-based Human Annotation Study. Language Resources and Evaluation.
LINK

Ekaterina Shutova, Jakub Kaplan, Simone Teufel and Anna Korhonen. 2013. A Computational Model of Logical Metonymy. ACM Transactions on Speech and Language Processing. 10(3). 11.
LINK

Jeroen Geertzen, Theodora Alexopoulou and Anna Korhonen. 2013. Automatic linguistic annotation of large scale L2 databases: The EF-Cambridge Open Language Database (EFCAMDAT). In Proceedings of the 31st Second Language Research Forum (SLRF), Carnegie Mellon, Cascadilla Press.
LINK

Roi Reichart and Anna Korhonen. 2013. Improved Lexical Acquisition through DPP-based Verb Clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Lin Sun, Diana McCarthy and Anna Korhonen. 2013. Diathesis alternation approximation for verb clustering. In Proceedings of ACL 2013, Sofia, Bulgaria.
LINK

Felix Hill, Douwe Kiela and Anna Korhonen. 2013. Concreteness and corpora: A theoretical and practical analysis. In Proceedings of the ACL 2013 Workshop on Cognitive Modelling and Computational Linguistics, Sofia, Bulgaria.
LINK

Felix Hill, Christian Bentz and Anna Korhonen. 2013. Large-scale empirical analyses of concreteness. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2013. Minimally Supervised Learning for Unconstrained Conceptual Property Extraction. In Proceedings of the Annual Meeting of the Cognitive Science Society, Berlin, Germany.
LINK

Yufan Guo, Roi Reichart and Anna Korhonen. 2013. Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2013. A Tensor-based Factorization Model of Semantic Compositionality. In Proceedings of the NAACL-HLT 2013, Atlanta, US.
LINK

Yufan Guo, Ilona Silins, Ulla Stenius and Anna Korhonen. 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. Bioinformatics (2013) 29 (11): 1440-1447.
LINK

Thomas Lippincott, Laura Rimell, Karin Verspoor and Anna Korhonen. 2013. Approaches to verb subcategorization for biomedicine . Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 212-227.
LINK

Thomas Lippincott, Laura Rimell, Helen L. Johnson, Karin Verspoor and Anna Korhonen. 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 228-237.
LINK

Aline Villavicencio, Thierry Poibeau, Anna Korhonen and Afra Alishahi. 2013. Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Thierry Poibeau, Aline Villavicencio, Anna Korhonen and Afra Alishahi. 2013. Computational Modeling as a Methodology for Studying Human Language Learning . In Cognitive Aspects of Computational Language Acquisition. Springer.
LINK

Anna Korhonen. 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. In the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.

2012

Roi Reichart and Anna Korhonen. 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Multi-way Tensor Factorization for Unsupervised Lexical Acquisition. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Tim van de Cruys and Anna Korhonen. 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Danish Contractor, Yufan Guo and Anna Korhonen. 2012. Using Argumentative Zones for Extractive Summarization of Scientific Articles. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen. 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK

Ekaterina Shutova, Simone Teufel and Anna Korhonen. 2012. Statistical Metaphor Processing. Computational Linguistics, 39(2).
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Learning syntactic verb frames using graphical models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea.
LINK

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Hogberg and Ulla Stenius. 2012. Exocrine pancreatic carcinogenesis and autotaxin expression. PLoS ONE 7(8): e43209.
LINK

Anna Korhonen, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg and Ulla Stenius. 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS ONE 7(4):e33427.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Modelling selectional preferences in a lexical hierarchy. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics (*SEM 2012). Montreal, QC.
LINK

Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Merging Lexicons for Higher Precision Subcategorization Frame Acquisition. Proceedings of the LREC 2012 Workshop on Language Resource Merging, Istanbul, Turkey.
LINK

Ilona Silins, Anna Korhonen, Johan Hogberg and Ulla Stenius. 2012. Data and Literature Gathering in Chemical Cancer Risk Assessment. Integrated Environmental Assessment and Management. 2012, Jan 3.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2012. Semi-supervised learning for automatic conceptual property extraction. Proceedings of the NAACL 2012 Cognitive Modeling and Computational Linguistics Workshop.
LINK

Omri Abend, Chris Biemann, Anna Korhonen, Ari Rappoport, Roi Reichart and Anders Sogaard. 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP.
LINK

Robert Berwick, Anna Korhonen, Thierry Poibeau and Aline Villavicencio. 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss.
LINK

2011

Yufan Guo, Anna Korhonen, Ilona Silins and Ulla Stenius. 2011. Weakly-supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics 2011; doi: 10.1093/bioinformatics/btr536.
LINK

Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics 12:212.
LINK

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics 2011, 12:69.
LINK

Lin Sun and Anna Korhonen. Hierarchical Verb Clustering Using Graph Factorization. 2011. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Yufan Guo, Anna Korhonen and Thierry Poibeau. 2011. A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2011. Latent Vector Weighting for Word Meaning in Context Edinburgh. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK

Omri Abend, Anna Korhonen, Ari Rappoport and Roi Reichart. 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP.
LINK

Barry Devereux, Anna Korhonen, Paula Buttery and Lorraine Tyler. 2011. The role of verb subcategorization frames and selectional preferences in sentence processing: an investigation using corpus-derived measures. Multidisciplinary Workshop on the mental representation of verbal argument structure. Paris, France.

Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Jie Zhuang, Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK

Colin Kelly, Barry Devereux and Anna Korhonen. 2011. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations. AMLaP. Paris, France.
LINK

2010

Anna Korhonen. 2010. Automatic Lexical Classification - Bridging Research and Practice. In Philoshophical Transactions A of the Royal Society. 368: 3621-3632.
LINK

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen. 2010. Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data. Research on Language and Computation.
PDF

Lin Sun, Thierry Poibeau, Anna Korhonen and Cedric Messiant. 2010. Investigating the cross-linguistic potential of VerbNet -style classification. In Proceedings of Coling. Beijing, China.
PDF

Ekaterina Shutova, Lin Sun and Anna Korhonen. 2010. Metaphor Identification Using Verb and Noun Clustering. In Proceedings of Coling. Beijing, China.
PDF

Tom Lippincott, Diarmuid O Seaghdha, Lin Sun and Anna Korhonen. 2010. Exploring variation across biomedical subdomains. In Proceedings of Coling. Beijing, China.
PDF

Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PDF

Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun and Ulla Stenius. 2010. Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes. In Proceedings of bio-NLP 2010. Uppsala, Sweden
PDF

Sandeep Kadekar, Ilona Silins, Anna Korhonen, Johan Hogberg, Kristian Dreij, and Ulla Stenius. 2010. Carcinogen-induced inflammation and pancreatic cancer. In Proceedings of the 101th Annual Meeting of the American Association for Cancer Research. Washington, D.C., USA.
PDF

Colin Kelly, Barry Devereux and Anna Korhonen. 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Barry Devereux, Colin Kelly and Anna Korhonen. 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF

Brian Murphy, Kai-min Kevin Chang and Anna Korhonen. 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
LINK

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2010. Annotating the Enron Email Corpus with Number Senses. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). Valletta, Malta.
PDF

Barry Devereux, Colin Kelly, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora. The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives.
PDF

2009

Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
PDF

Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009. Number Sense Disambiguation. In Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF

Lin Sun and Anna Korhonen. 2009. Improving Verb Clustering with Automatically Acquired Selectional Preferences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF

Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment. In Proceedings of BioNLP. Boulder, Colorado.
PDF

Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF

Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009. Improved Cancer Risk Assessment Using Text Mining. In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF

Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009. Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF

2008

Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008. The Choice of Features for Classification of Verbs in Biomedical Texts. In Proceedings of Coling 2008. Manchester, UK.
PDF

Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008. A New Challenge for Text Mining: Cancer Risk Assessment. In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF

Andreas Vlachos, Zoubin Ghahramani and Anna Korhonen. 2008. Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF

Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008. LexSchem: A Large Subcategorization Lexicon for French Verbs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008. A Large-Scale Classification of English Verbs. In the Journal of Language Resources and Evaluation. 42(1). 21-40.
LINK

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Verb Class Discovery from Rich Syntactic Data. In Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics. Haifa, Israel.
PDF

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. Hyderabad, India.
PDF

2007

Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF

Paula Buttery and Anna Korhonen. 2007. I will shoot your shopping down and you can shoot all my tins - Automatic Lexical Acquisition from the CHILDES Database. In Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

Paula Buttery, Aline Villavicencio and Anna Korhonen. 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

2006

Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006. Automatic Classification of Verbs in Biomedical Texts. In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF

Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006. Zone Analysis in Biology Articles as a Basis for Information Extraction. In the International Journal of Medical Informatics on Natural Language Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes. In Proceedings of EURALEX. Turin, Italy.
DOC

Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006. A Large Subcategorization Lexicon for Natural Language Processing Applications. In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. Extending VerbNet with Novel Verb Classes. In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

2005

Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005. Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.
LINK

Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Ann Arbor, Michigan.
PDF

Timothy Baldwin, Anna Korhonen and Aline Villavicencio. 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition. Ann Arbor, Michigan.
PDF

Paula Buttery and Anna Korhonen. 2005. Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech. In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes. Saarbrucken, Germany.
PDF

2004

Judita Preiss and Anna Korhonen. 2004. WSD for Subcategorization Acquisition Task Description. In Proceedings of the ACL SENSEVAL-3 Workshop. Barcelona, Spain.
PDF

Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen. 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, Spain.
PDF

Anna Korhonen and Ted Briscoe. 2004. Extended Lexical-Semantic Classification of English Verbs. In Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF

2003

Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 64-71.
PDF

Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization Acquisition using Word Sense Disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 48-55.
PDF

Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio. 2003. Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment. Sapporo, Japan.
PDF

2002

Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet. In Proceedings of the COLING Workshop on Building and Using Semantic Networks. Taipei, Taiwan.
PDF

Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems. In Proceedings of the Sixth Conference on Natural Language Learning. Taipei, Taiwan. 91-97.
PDF

Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition. In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition. Philadelphia, USA. 51-58.
PS

Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF

Judita Preiss, Anna Korhonen and Ted Briscoe. 2002. Subcategorization Acquisition as an Evaluation Method for WSD. In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF

Anna Korhonen. 2002. Subcategorization Acquisition. PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge.
PDF

2000

Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help Subcategorization Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 216-223.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Is Hypothesis Testing Useful for Subcategorization Acquisition? Technical Report UCAM-CL-TR-491. Computer Laboratory, University of Cambridge.
PDF

1999

Melanie Baljko and Anna Korhonen. 1999. Proceedings of the ACL 1999 Student Session. University of Maryland, Maryland.
PDF

1998

Anna Korhonen. 1998. Automatic Extraction of Subcategorization Frames from Corpora - Improving Filtering with Diathesis Alternations. In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing. Saarbrucken, Germany. 49-56.
PDF

Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada. 1493-1495.
PDF

1997

Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments. '97 Sparkle WP5 Deliverable.
PDF

Anna Korhonen. 1997. Acquiring Subcategorization from Textual Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS

  • © Anna Korhonen. Last updated: October 2014.