University of Cambridge Home Anna Korhonen

I am a Royal Society University Research Fellow. I work at the University of Cambridge, in the Computer Laboratory and in the Research Centre for English and Applied Linguistics.

Contact
Research
Biography
Activities
Projects
Resources
Publications
Links



Contact Information



Prospective PhD students: I supervise PhD students both at the Computer Laboratory and RCEAL. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me.


Research Interests

Computational Linguistics / Natural Language Processing:

automatic lexical acquisition, text classification, text mining, large-scale lexicon development, statistical NLP, evaluation of NLP systems, applications of NLP to real-world tasks (e.g. text mining from biomedical texts) and to research in related fields (e.g. cognitive sciences)

Linguistics:

syntax, syntax-semantics interface, lexical semantics


Short Biography

I currently hold a University Research Fellowship from the Royal Society at the University of Cambridge. From 2004 to 2005 I was a JSPS Postdoctoral Fellow in Japan where I worked at the National Institute of Informatics in Tokyo. Earlier on in 2004 I was a visiting researcher in the University of Pennsylvania, at the Department of Computer and Information Science. Before that (2001-2003) I was a postdoctoral researcher in Cambridge.

I received my PhD in Computer Science from the University of Cambridge (Computer Laboratory, Trinity Hall) in 2002 under the supervision of Ted Briscoe. Before my doctorate (1996-1997), I did an MPhil in Computer Speech and Language Processing at the Department of Engineering in this same university. I also hold an MA in Theoretical Linguistics from the University of Reading, School of Linguistics and Applied Language Studies (1995). I did my undergraduate studies in linguistics in the University of Helsinki in Finland (1990-1993).


Current and Recent Activities









Recent and Current Projects
  • 'Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications' (ACLEX)
    with Ted Briscoe and Judita Preiss.
    08/2005-07/2008, funded by the EPSRC

  • 'Using Automatic Verb Classification to Aid Event Extraction'
    08/2004-07/2005, JSPS Postdoctoral Fellowship, funded by the Japan Society for the Promotion of Science (JSPS)

Resources




Publications

Always in need of updating!

2010

Anna Korhonen. 2010. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. To Appear in the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.

2009

Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics. To Appear in Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.

Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009. The First Step in the Development of Text Mining Technology for Cancer Risk Assessment: Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF

Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009. Number Sense Disambiguation. In Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF

Lin Sun and Anna Korhonen. 2009. Improving Verb Clustering with Automatically Acquired Selectional Preferences. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF

Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009. User-Driven Development of Text Mining Resources for Cancer Risk Assessment. In Proceedings of BioNLP. Boulder, Colorado.
PDF

Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009. Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF

Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009. Improved Cancer Risk Assessment Using Text Mining. In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF

Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009. Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering. In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF

2008

Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008. The Choice of Features for Classification of Verbs in Biomedical Texts. To Appear in Proceedings of Coling 2008. Manchester, UK.
PDF

Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008. A New Challenge for Text Mining: Cancer Risk Assessment. In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF

Andreas Vlachos, Zoubin Ghahramani, and Anna Korhonen. 2008. Dirichlet Process Mixture Models for Verb Clustering. To Appear in Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF

Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008. LexSchem: A Large Subcategorization Lexicon for French Verbs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco.

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008. A Large-Scale Classification of English Verbs. In the Journal of Language Resources and Evaluation. 42(1). 21-40.

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Verb Class Discovery from Rich Syntactic Data. In Proceedings of the 9th International Conference on Intelligent Text Processing and Computational Linguistics. Haifa, Israel.
PDF

Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. Hyderabad, India.
PDF

2007

Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF

Paula Buttery and Anna Korhonen. 2007 I will shoot your shopping down and you can shoot all my tins - Automatic Lexical Acquisition from the CHILDES Database. In Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

Paula Buttery, Aline Villavicencio and Anna Korhonen (eds.). 2007. The proceedings of the ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF

2006

Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006. Automatic Classification of Verbs in Biomedical Texts. In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF

Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006. Zone Analysis in Biology Articles as a Basis for Information Extraction. In the International Journal of Medical Informatics on Natural Language Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes. In Proceedings of EURALEX. Turin, Italy.
DOC

Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006. A Large Subcategorization Lexicon for Natural Language Processing Applications. In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. Extending VerbNet with Novel Verb Classes. In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF

2005

Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005. Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.

Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005. Automatic Acquisition of Adjectival Subcategorization from Corpora. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Ann Arbor, Michigan.
PDF

Timothy Baldwin, Anna Korhonen and Aline Villavicencio (eds.). 2005. Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition. Ann Arbor, Michigan.
PDF

Paula Buttery and Anna Korhonen. 2005. Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech. In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes. Saarbrucken, Germany.
PDF

2004

Judita Preiss and Anna Korhonen. 2004. WSD for Subcategorization Acquisition Task Description. In Proceedings of the ACL SENSEVAL-3 Workshop. Barcelona, Spain.
PDF

Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen (eds.). 2004. Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing. Barcelona, Spain.

Anna Korhonen and Ted Briscoe. 2004. Extended Lexical-Semantic Classification of English Verbs. In Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF

2003

Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering Polysemic Subcategorization Frame Distributions Semantically. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 64-71.
PDF / PS

Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization Acquisition using Word Sense Disambiguation. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 48-55.
PDF / PS

Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio (eds.). 2003. Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment. Sapporo, Japan.
PDF

2002

Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet. In Proceedings of the COLING Workshop on Building and Using Semantic Networks. Taipei, Taiwan.
PDF / PS

Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition Systems. In Proceedings of the Sixth Conference on Natural Language Learning. Taipei, Taiwan. 91-97.
PDF / PS

Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition. In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition. Philadelphia, USA. 51-58.
PS / PS

Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF / PS

Judita Preiss, Anna Korhonen and Ted Briscoe. 2002. Subcategorization Acquisition as an Evaluation Method for WSD. In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF / PS

Anna Korhonen. 2002. Subcategorization Acquisition. PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of Cambridge.
PDF

2000

Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help Subcategorization Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 216-223.
PDF / PS

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical Filtering and Subcategorization Frame Acquisition. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF / PS

Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Is Hypothesis Testing Useful for Subcategorization Acquisition? Technical Report UCAM-CL-TR-491. Computer Laboratory, University of Cambridge.
PDF

1999

Melanie Baljko and Anna Korhonen (eds.). 1999. Proceedings of the ACL 1999 Student Session. University of Maryland, Maryland.
PDF

1998

Anna Korhonen. 1998. Automatic Extraction of Subcategorization Frames from Corpora - Improving Filtering with Diathesis Alternations. In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing. Saarbrucken, Germany. 49-56.
PDF / PS

Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada. 1493-1495.
PDF / PS

1997

Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments. '97 Sparkle WP5 Deliverable.
PDF / PS

Anna Korhonen. 1997. Acquiring Subcategorization from Textual Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS


Links

Research Local
Search