|
 |
 |
|
Contact Information
|
Cambridge CB3 OFD United Kingdom
Phone:
(+44) 1223 763 672 / 763 500
Email:
Anna.Korhonen at cl.cam.ac.uk
|
Faculty of English Building
Cambridge CB3 9DB United Kingdom
Phone:
(+44) 1223 767 389 / 767 392
Email:
Anna.Korhonen at cl.cam.ac.uk
|
|
|
|
|
|
Prospective PhD students: I supervise PhD students both at the Computer Laboratory and RCEAL.
Please take a look at my current research interests and ongoing projects, and
read the departmental pages on postgraduate opportunities before contacting me.
|
|
|
|
Research Interests
|
Computational Linguistics / Natural Language Processing:
automatic lexical acquisition,
text classification,
text mining,
large-scale lexicon development,
statistical NLP,
evaluation of NLP systems,
applications of NLP to real-world tasks (e.g. text mining from biomedical texts)
and to research in related fields (e.g. cognitive sciences)
Linguistics:
syntax,
syntax-semantics interface,
lexical semantics
|
|
|
|
|
Short Biography
|
|
|
|
|
|
Current and Recent Activities
|
|
|
|
|
Recent and Current Projects
|
-
'Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications'
(ACLEX)
with Ted Briscoe and
Judita Preiss.
08/2005-07/2008, funded by the EPSRC
-
'Using Automatic Verb Classification to Aid Event Extraction'
08/2004-07/2005, JSPS Postdoctoral Fellowship,
funded by the Japan Society for the Promotion of Science (JSPS)
|
|
|
|
Resources
|
|
|
|
|
Publications
|
Always in need of updating!
2010
Anna Korhonen. 2010. Tools and Procedures for the Acquisition of Morphological and
Syntactical Information from Corpora.
To Appear in the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.
2009
Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine
Learning and Linguistics.
To Appear in Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009.
The First Step in the Development of Text Mining Technology for Cancer Risk Assessment:
Identifying and Organizing Scientific Evidence in Risk Assessment Literature.
In
BMC Bioinformatics 2009, 10:303.
PDF
Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009.
Number Sense Disambiguation. In
Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF
Lin Sun and Anna Korhonen. 2009.
Improving Verb Clustering with Automatically Acquired Selectional Preferences.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF
Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009.
User-Driven Development of Text Mining Resources for Cancer Risk Assessment.
In Proceedings of BioNLP. Boulder, Colorado.
PDF
Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009.
Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications.
North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF
Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009.
Improved Cancer Risk Assessment Using Text Mining.
In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF
Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009.
Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering.
In
Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF
2008
Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008.
The Choice of
Features for Classification of Verbs in Biomedical Texts. To Appear in
Proceedings of Coling 2008. Manchester, UK.
PDF
Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008.
A New Challenge for Text Mining: Cancer Risk Assessment.
In
Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF
Andreas Vlachos, Zoubin Ghahramani, and Anna Korhonen. 2008.
Dirichlet Process Mixture Models for Verb Clustering.
To Appear in
Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF
Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008.
LexSchem: A Large Subcategorization Lexicon for French Verbs.
In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC).
Marrakech, Morocco.
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008.
A Large-Scale Classification of English Verbs. In the
Journal of Language Resources and Evaluation. 42(1). 21-40.
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008.
Verb Class Discovery from Rich Syntactic Data. In
Proceedings of the 9th International Conference on Intelligent Text Processing
and Computational Linguistics. Haifa, Israel.
PDF
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English
Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint
Conference on Natural Language Processing. Hyderabad, India.
PDF
2007
Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition
of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF
Paula Buttery and Anna Korhonen. 2007 I will shoot your shopping down and you can shoot all my tins -
Automatic Lexical Acquisition from the CHILDES Database. In
Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF
Paula Buttery, Aline Villavicencio and Anna Korhonen (eds.). 2007.
The proceedings of the ACL 2007 Workshop on Cognitive Aspects of
Computational Language Acquisition.
Prague, Czech Republic.
PDF
2006
Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006.
Automatic Classification of Verbs in Biomedical Texts.
In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF
Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006.
Zone Analysis in Biology Articles as a Basis for Information Extraction. In
the International Journal of Medical Informatics on Natural Language
Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006.
A Large-Scale Extension of VerbNet with Novel Verb Classes.
In Proceedings of EURALEX. Turin, Italy.
DOC
Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006.
A Large Subcategorization Lexicon for Natural Language Processing Applications.
In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006.
Extending VerbNet with Novel Verb Classes.
In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
2005
Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005.
Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In
Computer Speech and Language. 19(4). 365-377.
Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005.
Automatic Acquisition of Adjectival Subcategorization from Corpora.
In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics.
Ann Arbor, Michigan. PDF
Timothy Baldwin, Anna Korhonen and Aline Villavicencio (eds.). 2005.
Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition.
Ann Arbor, Michigan. PDF
Paula Buttery and Anna Korhonen. 2005.
Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech.
In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes.
Saarbrucken, Germany. PDF
2004
Judita Preiss and Anna Korhonen. 2004.
WSD for Subcategorization Acquisition Task Description. In
Proceedings of the ACL SENSEVAL-3 Workshop.
Barcelona, Spain. PDF
Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen (eds.). 2004.
Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing.
Barcelona, Spain.
Anna Korhonen and Ted Briscoe. 2004.
Extended Lexical-Semantic Classification of English Verbs. In
Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics.
Boston, MA. PDF
2003
Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering
Polysemic Subcategorization Frame Distributions Semantically.
In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics.
Sapporo, Japan. 64-71. PDF /
PS
Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization
Acquisition using Word Sense Disambiguation.
In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics.
Sapporo, Japan. 48-55. PDF / PS
Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio (eds.). 2003.
Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment.
Sapporo, Japan. PDF
2002
Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet.
In Proceedings of the COLING Workshop on Building and Using Semantic Networks.
Taipei, Taiwan. PDF / PS
Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness
of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition
Systems. In Proceedings of the Sixth Conference on Natural Language Learning.
Taipei, Taiwan. 91-97. PDF / PS
Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition.
In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition.
Philadelphia, USA. 51-58. PS / PS
Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization
Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense
Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF / PS
Judita Preiss, Anna Korhonen and Ted Briscoe. 2002.
Subcategorization Acquisition as an Evaluation Method for WSD.
In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF / PS
Anna Korhonen. 2002. Subcategorization Acquisition.
PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of
Cambridge. PDF
2000
Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help
Subcategorization Acquisition.
In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing
and Very Large Corpora. Hong Kong. 216-223. PDF / PS
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical
Filtering and Subcategorization Frame Acquisition.
In Proceedings of the Joint SIGDAT Conference on
Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF / PS
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000.
Is Hypothesis Testing Useful for Subcategorization Acquisition?
Technical Report UCAM-CL-TR-491. Computer Laboratory, University of
Cambridge. PDF
1999
Melanie Baljko and Anna Korhonen (eds.). 1999.
Proceedings of the ACL 1999 Student Session.
University of Maryland, Maryland. PDF
1998
Anna Korhonen. 1998. Automatic Extraction of Subcategorization
Frames from Corpora - Improving Filtering with Diathesis Alternations.
In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing.
Saarbrucken, Germany. 49-56. PDF / PS
Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation
in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada.
1493-1495. PDF / PS
1997
Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic
Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments.
'97 Sparkle WP5 Deliverable. PDF / PS
Anna Korhonen. 1997. Acquiring Subcategorization from Textual
Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS
|
|
|
|
|
Links
|
|
|
 |
|
 |
 |
|
|
|