Anna Korhonen - Royal Society University Research Fellow
University of Cambridge Computer Laboratory
William Gates Building, 15 JJ Thomson Avenue
Cambridge CB3 0FD, UK
Office: GS-12
Phone: (+44) 1223 763 672
Department of Theoretical and Applied Linguistics (DTAL)
Faculty of English Building, 9 West Road
Cambridge CB3 9DB, UK
Office: TR-12
Phone: (+44) 1223 767 389
Email:
anna.korhonen @ cl.cam.ac.uk
News
Prospective PhD students: I supervise PhD students at the Computer Laboratory and DTAL. Please take a look at my current research interests and ongoing projects, and read the departmental pages on postgraduate opportunities before contacting me. If you are interested in pursuing a PhD, please contact me well in advance.
Research
My research has principally been in the area of Natural Language Processing and Computational Linguistics. Some current areas of interest include:
- automatic lexical acquisition
- computational lexical semantics
- computational models of discourse
- lexical and domain adaptation
- unsupervised and lightly supervised approaches to NLP
- scientific text processing and text mining
- NLP for biomedicine
- NLP for real-world applications
- computational models of human language learning
- computational neuro-linguistics
Biography
I am a Royal Society University Research
Fellow at
the University of Cambridge
where I have a joint affiliation between
the Computer Laboratory
and the Department of Theoretical and Applied Linguistics (DTAL).
- JSPS Postdoctoral Fellow, National Institute of Informatics, Tokyo, Japan (2004-2005)
- Visiting researcher, University of Pennsylvania, Department of Computer and Information Science (2004)
- Post-doctoral researcher, University of Cambridge Computer Laboratory (2001-2003)
- PhD in Computer Science, University of Cambridge Computer Laboratory, Trinity Hall) (1998-2001)
- MPhil in Computer Speech and Language Processing, Department of Engineering, University of Cambridge (1996-1997)
- MA in Theoretical Linguistics, University of Reading, School of Linguistics and Applied Language Studies (1994-1995)
People
Current PhD students:
-
Yufan Guo. Discovering the information structure of documents.
- Felix Hill. Learning adjective semantics for natural language generation.
- Colin Kelly. Acquisition of conceptual representations from corpora.
- Tom Lippincott. Domain variation and adaptation in biomedicine.
- Stuart Moore. Number sense disambiguation.
- Lin Sun. Automatic verb classification.
- Quang Phu (Chris) Vo. Word sense induction.
Current postdocs:
Past PhD students and postdocs:
Projects
-
PANACEA -
Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies.
Funded by EU FP7 (2010-2012).
Working with Laura Rimell, and project partners UPF (Spain), CNR-ILC (Italy), ILSP (Greece), Linguatec (Germany), DCU (Ireland) - The Education First-Cambridge Learner Corpus of English -
a data driven approach to second language learning.
Funded by EF and Isaac Newton Trust (2011-2013).
Working with Dora Alexopoulou, Brechtje Post and Jeroen Geertzen. -
Lexical Acquisition for the Biomedical Domain
Funded by EPSRC (2009-2012).
Working with Lin Sun, Diarmuid Ó Séaghdha, and Tom Lippincott. -
CRAB - Using Text Mining to Aid Cancer Risk Assessment.
Funded by MRC, EU and FSA and FORMAS in Sweden (2007-2014)
Working with Ulla Stenius, Johan Hogberg, Ilona Silins, Lin Sun and Yufan Guo. - Developing Lexical Resources for Natural Language Processing Applications.
University Research Fellowship.
Funded by the Royal Society (2005-2013).
- Developing Multilingual Technologies for Automatic Lexical Acquisition.
Funded by Isaac Newton Trust (2010-2012).
Working with Tin Van de Cruys and Thierry Poibeau. - COMPLEX - Computational Natural Language Processing and the Neuro-Cognition of Language.
Co-funded by EPSRC, ESRC and MRC (2008-2011).
Working with with Lorraine K. Tyler, William Marslen-Wilson, and Paula Buttery. - Developing Multilingual Technologies for Automatic Lexical Acquisition.
Funded by British Council (2008-2009).
Working with Thierry Poibeau. - ACLEX - Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications.
Funded by EPSRC (2005-2008).
Working with Ted Briscoe and Judita Preiss. - Using Automatic Verb Classification to Aid Event Extraction.
JSPS Postdoctoral Fellowship.
Funded by the Japan Society for the Promotion of Science (2004-2005) - FLYSLIP - Integrating Literature, Experiments and Curation in Drosophila Genomics Research.
Funded by BBSRC (2004-2007).
With Ted Briscoe, Simone Teufel, and Rachel Drysdale.
Teaching
I have (co-)taught the following courses in Cambridge
Computer Laboratory:
- Natural Language Processing, Computer Science Tripos
- Natural Language Processing modules, MPhil in Advanced Computer Science
- Biomedical Informatics, MPhil in Advanced Computer Science
RCEAL and DTAL:
- Computational Linguistics
- Computational Corpus Linguistics
- Computational Lexical Semantics
- Computational Language Learning
Activities
Current activities:
- Program Co-Chair for EMNLP 2013 with Tim Baldwin
- Publicity Co-Chair for ACL 2013
- Shared Task Co-Chair for *SEM 2013 with Malvina Nissim
- Editorial Board member for Computational Linguistics
(2011-2013)
- Board member for SIGLEX (2010-2013)
- A member of Association for Computational Linguistics
- A member of Cambridge Neuroscience
- A member of Cambridge Cancer Centre
Recent activities:
- Area Chair for EMNLP-CoNLL-2012
- Co-chair for the
EACL 2012 ROBUS-UNSUP Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
with
Roi Reichart,
Omri Abend,
Ari Rappoport,
Anders Soegaard,
and
Chris Biemann
- Co-chair for the
EACL 2012 Workshop on Computational Models of Language Acquisition and Loss
with
Aline Villavicencio,
Thierry Poibeau,
and
Bob Berwick
-
Co-chair for the
The EMNLP 2011 Workshop on Unsupervised Learning in NLP
with
Roi Reichart,
Omri Abend
and
Ari Rappoport
-
Co-chair for the
The NIPS 2011 Workshop MLINI - Machine Learning and Interpretation in Neuroimaging
-
Co-chair for the
NAACL-HLT-2010 Workshop on Computational Neurolinguistics
with
Brian Murphy and
Kai-min Kevin Chang
- Co-chair for the Interdisciplinary Workshop on Verbs - The Identification and Representation of Verb Features, Scuola Normale Superiore, Pisa, November 4-5, 2010 with Sabine Schulte im Walde, Aline Villavicencio, Alessandro Lenci, Alissa Melinger, and Pier Marco Bertinetto
- Area Chair for EACL-2009
- Co-organizer for the Nordic Conference in Computational Linguistics 2009
-
Co-chair for the
ACL-2007 Workshop on Cognitive Aspects of Computational Language Acquisition
with
Paula Buttery and
Aline Villavicencio
- Co-organizer for the ESSLLI-2006 Course in Data-driven Methods for Acquiring Linguistic Information with Tim Baldwin, Aline Villavicencio and Valia Kordoni
- Co-organizer for the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition with Tim Baldwin and Aline Villavicencio
- Co-organizer for the ACL-2004 Workshop on Multiword Expressions: Integrating Processing with Takaaki Tanaka, Aline Villavicencio and Francis Bond
- Co-organizer for SENSEVAL-3 task with Judita Preiss
- Co-editor for the Computer Speech and Language Special Issue on Multiword Expressions with Aline Villavicencio, Francis Bond and Diana McCarthy
- Co-organizer for the ACL-2003 workshop on Multiword Expressions: Analysis, Acquisition and Treatment with Francis Bond, Diana McCarthy and Aline Villavicencio
Media
Mining the Language of Science. Research Horizons. November 18, 2011.
Computer System Developed to Analyse the Cancer Risk of a Chemical. CNN News. November 21, 2011.
Publications
Always in need of updating!
2013
Yufan Guo, Roi Reichart and Anna Korhonen. 2013. Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints. To appear in Proceedings of the NAACL-HLT 2013, Atlanta, US.
Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2013. A Tensor-based Factorization Model of Semantic Compositionality. To appear in Proceedings of the NAACL-HLT 2013, Atlanta, US.
Yufan Guo, Ilona Silins, Ulla Stenius and Anna Korhonen. 2013. Active learning-based information structure analysis of full scientific articles and two applications for biomedical literature review. To appear in Bioinformatics.
LINK
Thomas Lippincott, Laura Rimell, Karin Verspoor and Anna Korhonen. 2013. Approaches to verb subcategorization for biomedicine . Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 212-227.
LINK
Thomas Lippincott, Laura Rimell, Helen L. Johnson, Karin Verspoor and Anna Korhonen. 2013. Acquisition and evaluation of verb subcategorization resources for biomedicine. Journal of Biomedical Informatics. Volume 46, Issue 2. Pages 228-237.
LINK
Aline Villavicencio, Thierry Poibeau, Anna Korhonen and Afra Alishahi. 2013.
Cognitive Aspects of
Computational Language Acquisition.
Springer.
LINK
Thierry Poibeau, Aline Villavicencio, Anna Korhonen and Afra Alishahi. 2013.
Computational Modeling as a Methodology for Studying Human Language Learning . In Cognitive Aspects of
Computational Language Acquisition. Springer.
LINK
Anna Korhonen. 2013. Tools and Procedures for the Acquisition of Morphological and Syntactical Information from Corpora. To Appear in the International Handbook of Dictionaries. Mouton de Gruyter, Berlin.
2012
Roi Reichart and Anna Korhonen. 2012. Document and Corpus Level Inference For Unsupervised Learning of Information Structure of Scientific Documents. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Tim Van de Cruys, Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Multi-way Tensor Factorization for Unsupervised Lexical Acquisition. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Ekaterina Shutova, Tim van de Cruys and Anna Korhonen. 2012. Unsupervised Metaphor Paraphrasing Using a Vector Space Model. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Danish Contractor, Yufan Guo and Anna Korhonen. 2012. Using Argumentative Zones for Extractive Summarization of Scientific Articles. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Yufan Guo, Ilona Silins, Roi Reichart and Anna Korhonen. 2012. CRAB Reader: A Tool for Analysis and Visualization of Argumentative Zones in Scientific Literature. In Proceedings of the 24th International Conference on Computational Linguistics (COLING), Mumbai, India.
LINK
Ekaterina Shutova, Simone Teufel and Anna Korhonen. 2012. Statistical Metaphor Processing. Computational Linguistics, 39(2).
LINK
Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Learning syntactic verb frames using graphical models. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012). Jeju, Korea.
LINK
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Kristian Dreij, Lauy Al-Anati, Johan Hogberg and Ulla Stenius. 2012. Exocrine pancreatic carcinogenesis and autotaxin expression. PLoS ONE 7(8): e43209.
LINK
Anna Korhonen, Diarmuid Ó Séaghdha, Ilona Silins, Lin Sun, Johan Hogberg and Ulla Stenius. 2012. Text mining for literature review and knowledge discovery in cancer risk assessment and research. PLoS ONE 7(4):e33427.
LINK
Diarmuid Ó Séaghdha and Anna Korhonen. 2012. Modelling selectional preferences in a lexical hierarchy. In Proceedings of the 1st Joint Conference on Lexical and Computational Semantics (*SEM 2012). Montreal, QC.
LINK
Laura Rimell, Thierry Poibeau, and Anna Korhonen. 2012. Merging Lexicons for Higher Precision Subcategorization Frame Acquisition. Proceedings of the LREC 2012 Workshop on Language Resource Merging, Istanbul, Turkey.
LINK
Ilona Silins, Anna Korhonen, Johan Hogberg and Ulla Stenius. 2012. Data and Literature Gathering in Chemical Cancer Risk Assessment. Integrated Environmental Assessment and Management. 2012, Jan 3.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2012. Semi-supervised learning for automatic conceptual property extraction. Proceedings of the NAACL 2012 Cognitive
Modeling and Computational Linguistics Workshop.
LINK
Omri Abend, Chris Biemann, Anna Korhonen, Ari Rappoport, Roi Reichart and Anders Sogaard. 2012. Proceedings of the EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP.
LINK
Robert Berwick, Anna Korhonen, Thierry Poibeau and Aline Villavicencio. 2012. Proceedings of the EACL Workshop on Computational Models of Language Acquisition and Loss.
LINK
2011
Yufan Guo, Anna Korhonen, Ilona Silins and Ulla Stenius. 2011. Weakly-supervised learning of information structure of scientific abstracts - is it accurate enough to benefit real-world tasks in biomedicine? Bioinformatics 2011; doi: 10.1093/bioinformatics/btr536.
LINK
Tom Lippincott, Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Exploring subdomain variation in biomedical language. BMC Bioinformatics 12:212.
LINK
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment. BMC Bioinformatics 2011, 12:69.
LINK
Lin Sun and Anna Korhonen. Hierarchical Verb Clustering Using Graph Factorization. 2011. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Yufan Guo, Anna Korhonen and Thierry Poibeau. 2011. A Weakly-supervised Approach to Argumentative Zoning of Scientific Documents. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Tim Van de Cruys, Thierry Poibeau and Anna Korhonen. 2011. Latent Vector Weighting for Word Meaning in Context Edinburgh. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Diarmuid Ó Séaghdha and Anna Korhonen. 2011. Probabilistic models of similarity in syntactic context. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP). Edinburgh, UK.
LINK
Omri Abend, Anna Korhonen, Ari Rappoport and Roi Reichart. 2011. Proceedings of the EMNLP Workshop on Unsupervised Learning in NLP.
LINK
Barry Devereux, Anna Korhonen, Paula Buttery and Lorraine Tyler. 2011. The role of verb subcategorization frames and selectional preferences in sentence processing: an investigation using corpus-derived measures. Multidisciplinary Workshop on the mental representation of verbal argument structure. Paris, France.
Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Parsing sentences are unlikely: corpus-based analyses of the neural processing of verbs. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK
Jie Zhuang, Barry Devereux, Anna Korhonen and Lorraine Tyler. 2011. Lexical and syntactic competition effects in verb processing: evidence from corpus-based statistics. International Conference on Cognitive Neuroscience (ICON). Palma, Mallorca, Spain.
LINK
Colin Kelly, Barry Devereux and Anna Korhonen. 2011. Automatic extraction of property norm-like features from large text corpora with gold standard, human and semantic-similarity evaluations. AMLaP. Paris, France.
LINK
2010
Anna Korhonen. 2010. Automatic Lexical Classification - Bridging
Research and Practice. In Philoshophical Transactions A of the Royal Society. 368: 3621-3632.
LINK
Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen. 2010. Towards unrestricted, large-scale acquisition of feature-based conceptual representations from corpus data. Research on Language and Computation.
PDF
Lin Sun, Thierry Poibeau, Anna Korhonen and Cedric Messiant. 2010. Investigating the cross-linguistic potential of VerbNet -style classification. In Proceedings of Coling. Beijing, China.
PDF
Ekaterina Shutova, Lin Sun and Anna Korhonen. 2010. Metaphor Identification Using Verb and Noun Clustering. In Proceedings of Coling. Beijing, China.
PDF
Tom Lippincott, Diarmuid O Seaghdha, Lin Sun and Anna Korhonen. 2010. Exploring variation across biomedical subdomains. In Proceedings of Coling. Beijing, China.
PDF
Barry Devereux, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. Large-Scale Acquisition of Feature-Based Conceptual Representations from Textual Corpora. In Proceedings of the Annual Meeting of the Cognitive Science Society.
PDF
Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun and Ulla Stenius. 2010. Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes. In Proceedings of bio-NLP 2010. Uppsala, Sweden
PDF
Sandeep Kadekar, Ilona Silins, Anna Korhonen, Johan Hogberg, Kristian Dreij, and Ulla Stenius. 2010. Carcinogen-induced inflammation and pancreatic cancer. In Proceedings of the
101th Annual Meeting of the American Association for Cancer Research. Washington, D.C., USA.
PDF
Colin Kelly, Barry Devereux and Anna Korhonen. 2010. Acquiring Human-like Feature-Based Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF
Barry Devereux, Colin Kelly and Anna Korhonen. 2010. Using fMRI Activation to Conceptual Stimuli to Evaluate Methods for Extracting Conceptual Representations from Corpora. In Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
PDF
Brian Murphy, Kai-min Kevin Chang and Anna Korhonen. 2010. Proceedings of the NAACL-HLT Workshop on Computational Neurolinguistics. Los Angeles, CA, USA.
LINK
Stuart Moore, Anna Korhonen and Sabine Buchholz. 2010. Annotating the Enron Email Corpus with Number Senses. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). Valletta, Malta.
PDF
Barry Devereux, Colin Kelly, Nicholas Pilkington, Thierry Poibeau and Anna Korhonen, 2010. The Acquisition of Unconstrained Feature-Based Conceptual Representations from Corpora. The Rovereto Workshop on Concepts, Actions, and Objects: Functional and Neural Perspectives.
PDF
2009
Anna Korhonen. 2009. Automatic Lexical Classification - Balancing between Machine Learning and Linguistics.
In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation. Hong Kong.
PDF
Anna Korhonen, Lin Sun, Ilona Silins, and Ulla Stenius. 2009.
The First Step in the Development of Text Mining Technology for Cancer Risk Assessment:
Identifying and Organizing Scientific Evidence in Risk Assessment Literature. In BMC Bioinformatics 2009, 10:303.
PDF
Stuart Moore, Anna Korhonen and Sabine Buchholz. 2009.
Number Sense Disambiguation. In
Proceedings of the 12th Conference of the Pacific Association for Computational Linguistics. Sapporo, Japan.
PDF
Lin Sun and Anna Korhonen. 2009.
Improving Verb Clustering with Automatically Acquired Selectional Preferences.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore.
PDF
Lin Sun, Anna Korhonen, Ilona Silins, and Ulla Stenius. 2009.
User-Driven Development of Text Mining Resources for Cancer Risk Assessment.
In Proceedings of BioNLP. Boulder, Colorado.
PDF
Karin Kipper-Schuler, Anna Korhonen, and Susan Brown. 2009.
Proceedings of the NAACL 2009 Tutorial on VerbNet and Its Applications. North
American Chapter of the Association for Computational Linguistics -
Human Language Technologies (NAACL HLT) 2009 Boulder, Colorado.
PDF
Ilona Silins, Anna Korhonen, Johan Hogberg, Lin Sun, and Ulla Stenius. 2009.
Improved Cancer Risk Assessment Using Text Mining.
In Proceedings of the 100th Annual Meeting of the American Association for Cancer Research. Denver, Colorado.
PDF
Andreas Vlachos, Anna Korhonen, and Zoubin Ghahramani. 2009.
Unsupervised and Constrained Dirichlet Process Mixture Models for Verb Clustering.
In Proceedings of the EACL workshop on GEometrical Models of Natural Language Semantics. Athens, Greece.
PDF
2008
Anna Korhonen, Yuval Krymolowski and Nigel Collier. 2008.
The Choice of
Features for Classification of Verbs in Biomedical Texts. In Proceedings of Coling 2008. Manchester, UK.
PDF
Ian Lewin, Ilona Silins, Anna Korhonen, Johan Hogberg, and Ulla Stenius. 2008.
A New Challenge for Text Mining: Cancer Risk Assessment.
In Proceedings of the ISMB BioLINK Special Interest Group on Text Data Mining. Toronto, Canada.
PDF
Andreas Vlachos, Zoubin Ghahramani and Anna Korhonen. 2008.
Dirichlet Process Mixture Models for Verb Clustering.
In Proceedings of the ICML Workshop on Prior Knowledge for Text and Language. Helsinki, Finland.
PDF
Cedric Messiant, Anna Korhonen and Thierry Poibeau. 2008.
LexSchem: A Large Subcategorization Lexicon for French Verbs.
In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC).
Marrakech, Morocco.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2008.
A Large-Scale Classification of English Verbs. In the
Journal of Language Resources and Evaluation. 42(1). 21-40.
LINK
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008.
Verb Class Discovery from Rich Syntactic Data. In
Proceedings of the 9th International Conference on Intelligent Text Processing
and Computational Linguistics. Haifa, Israel.
PDF
Lin Sun, Anna Korhonen, and Yuval Krymolowski. 2008. Automatic Classification of English
Verbs Using Rich Syntactic Features. In Proceedings of the 3rd International Joint
Conference on Natural Language Processing. Hyderabad, India.
PDF
2007
Judita Preiss, Ted Briscoe and Anna Korhonen. 2007. A System for Large-scale Acquisition
of Verbal, Nominal and Adjectival Subcategorization Frames from Corpora. In
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. Prague, Czech Republic.
PDF
Paula Buttery and Anna Korhonen. 2007. I will shoot your shopping down and you can shoot all my tins -
Automatic Lexical Acquisition from the CHILDES Database. In
Proceedings of ACL 2007 Workshop on Cognitive Aspects of Computational Language Acquisition. Prague, Czech Republic.
PDF
Paula Buttery, Aline Villavicencio and Anna Korhonen. 2007.
The proceedings of the ACL 2007 Workshop on Cognitive Aspects of
Computational Language Acquisition.
Prague, Czech Republic.
PDF
2006
Anna Korhonen, Yuval Krymolowski, and Nigel Collier. 2006.
Automatic Classification of Verbs in Biomedical Texts.
In Proceedings of ACL-COLING 2006. Sydney, Australia.
PDF
Yoko Mizuta, Anna Korhonen, Tony Mullen and Nigel Collier. 2006.
Zone Analysis in Biology Articles as a Basis for Information Extraction. In
the International Journal of Medical Informatics on Natural Language
Processing in Biomedicine and Its Applications. 75(6). 468-87.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006. A Large-Scale Extension of VerbNet with Novel Verb Classes.
In Proceedings of EURALEX. Turin, Italy.
DOC
Anna Korhonen, Yuval Krymolowski, and Ted Briscoe. 2006.
A Large Subcategorization Lexicon for Natural Language Processing Applications.
In Proceedings of the 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. 2006.
Extending VerbNet with Novel Verb Classes.
In Proceedings of 5th international conference on Language Resources and Evaluation. Genova, Italy.
PDF
2005
Aline Villavicencio, Francis Bond, Anna Korhonen, and Diana McCarthy. 2005.
Introduction to the Special Issue on Multiword Expressions: Having a Crack at a Hard Nut. In Computer Speech and Language. 19(4). 365-377.
LINK
Jeremy Yallop, Anna Korhonen and Ted Briscoe. 2005.
Automatic Acquisition of Adjectival Subcategorization from Corpora.
In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics.
Ann Arbor, Michigan.
PDF
Timothy Baldwin, Anna Korhonen and Aline Villavicencio. 2005.
Proceedings of the ACL-SIGLEX 2005 Workshop on Deep Lexical Acquisition.
Ann Arbor, Michigan.
PDF
Paula Buttery and Anna Korhonen. 2005.
Large-scale Analysis of Verb Subcategorization Differences between Child Directed Speech and Adult Speech.
In Proceedings of the Interdisciplinary Workshop on the Identification and Representation of Verb Features and Verb Classes.
Saarbrucken, Germany.
PDF
2004
Judita Preiss and Anna Korhonen. 2004.
WSD for Subcategorization Acquisition Task Description. In
Proceedings of the ACL SENSEVAL-3 Workshop.
Barcelona, Spain.
PDF
Takaaki Tanaka, Aline Villavicencio, Francis Bond and Anna Korhonen. 2004.
Proceedings of the ACL-SIGLEX 2004 Workshop on Multiword Expressions: Integrating Processing.
Barcelona, Spain.
PDF
Anna Korhonen and Ted Briscoe. 2004.
Extended Lexical-Semantic Classification of English Verbs. In
Proceedings of the HLT/NAACL Workshop on Computational Lexical Semantics. Boston, MA.
PDF
2003
Anna Korhonen, Yuval Krymolowski and Zvika Marx. 2003. Clustering
Polysemic Subcategorization Frame Distributions Semantically.
In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics.
Sapporo, Japan. 64-71.
PDF
Anna Korhonen and Judita Preiss. 2003. Improving Subcategorization
Acquisition using Word Sense Disambiguation.
In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics.
Sapporo, Japan. 48-55.
PDF
Francis Bond, Diana McCarthy, Anna Korhonen and Aline Villavicencio. 2003.
Proceedings of the ACL-SIGLEX 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment.
Sapporo, Japan.
PDF
2002
Anna Korhonen. 2002. Assigning Verbs to Semantic Classes via WordNet.
In Proceedings of the COLING Workshop on Building and Using Semantic Networks.
Taipei, Taiwan.
PDF
Anna Korhonen and Yuval Krymolowski. 2002. On the Robustness
of Entropy-Based Similarity Measures in Evaluation of Subcategorization Acquisition
Systems. In Proceedings of the Sixth Conference on Natural Language Learning.
Taipei, Taiwan. 91-97.
PDF
Anna Korhonen. 2002. Semantically Motivated Subcategorization Acquisition.
In Proceedings of the ACL Workshop on Unsupervised Lexical Acquisition.
Philadelphia, USA. 51-58.
PS
Judita Preiss and Anna Korhonen. 2002. Improving Subcategorization
Acquisition with WSD. In Proceedings of the ACL Workshop on Word Sense
Disambiguation: Recent Successes and Future Directions. Philadelphia, USA. 102-108.
PDF
Judita Preiss, Anna Korhonen and Ted Briscoe. 2002.
Subcategorization Acquisition as an Evaluation Method for WSD.
In Proceedings of LREC. Canary Islands, Spain. 1551-1556.
PDF
Anna Korhonen. 2002. Subcategorization Acquisition.
PhD thesis published as Technical Report UCAM-CL-TR-530. Computer Laboratory, University of
Cambridge.
PDF
2000
Anna Korhonen. 2000. Using Semantically Motivated Estimates to Help
Subcategorization Acquisition.
In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing
and Very Large Corpora. Hong Kong. 216-223.
PDF
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000. Statistical
Filtering and Subcategorization Frame Acquisition.
In Proceedings of the Joint SIGDAT Conference on
Empirical Methods in Natural Language Processing and Very Large Corpora. Hong Kong. 199-205.
PDF
Anna Korhonen, Genevieve Gorrell and Diana McCarthy. 2000.
Is Hypothesis Testing Useful for Subcategorization Acquisition?
Technical Report UCAM-CL-TR-491. Computer Laboratory, University of
Cambridge.
PDF
1999
Melanie Baljko and Anna Korhonen. 1999.
Proceedings of the ACL 1999 Student Session.
University of Maryland, Maryland.
PDF
1998
Anna Korhonen. 1998. Automatic Extraction of Subcategorization
Frames from Corpora - Improving Filtering with Diathesis Alternations.
In Proceedings of the ESSLLI 98 Workshop on Automated Acquisition of Syntax and Parsing.
Saarbrucken, Germany. 49-56.
PDF
Diana McCarthy and Anna Korhonen. 1998. Detecting Verbal Participation
in Diathesis Alternations. In Proceedings of the ALC-COLING 98. Montreal, Canada.
1493-1495.
PDF
1997
Ted Briscoe, John Carroll and Anna Korhonen. 1997. Automatic
Extraction of Subcategorization Frames from Corpora - a Framework and 3 Experiments.
'97 Sparkle WP5 Deliverable.
PDF
Anna Korhonen. 1997. Acquiring Subcategorization from Textual
Corpora. MPhil dissertation. Department of Engineering, University of Cambridge.
PS
- © Anna Korhonen. Last updated: November 2012.
