Natural Language and Information Processing Research Group
Current or recent NLIP projects
- Distributional Compositional Semantics for Text Processing (DisCoTex) (see Stephen Clark's page)
- A Unified Model of Compositional and Distributional Semantics: Theory and Applications
- The Education First-Cambridge Learner Corpus of English - a data driven
approach to second language learning (see Anna Korhonen's page)
- PANACEA - Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies
- SpaceBook - Spatial & Personal Adaptive Communication Environment
- FAUST - Feedback for User Adaptive Statistical Translation
- Computational Natural Language Processing and the Neuro-Cognition of Language (no project web-page, see Anna Korhonen's page).
- CRAB: Using Text Mining to Aid Cancer Risk Assessment
- Integrating pragmatic insights with HPSG (no project web-page, see Ann Copestake's page).
- Applying Computational Semantics (no project web-page, see Ann Copestake's page).
- Delph-in interfaces project, funded by Boeing (no project web-page, see Ann Copestake's page).
Past projects
The group's research projects have included ones on language processing resources and tools, on logic and formalisms, on front ends e.g. for database and unstructured information access, and on speech processing. More recent and current projects, funded under both UK and European Programmes, have involved further development of tools and processors, automatic summarising, text and spoken message retrieval; natural language processing for formal specifications; and the acquisition of lexical knowledge and construction of multilingual lexical knowledge bases. Projects with individual pages are:
- Lexical Acquisition for the Biomedical Domain (no project web-page, see Anna Korhonen's page).
- Studying the appropriateness of different formulations of a discourse relation in context (no project web-page, see Advaith Siddharthan's page).
- SciBorg Extracting the Science from Scientific Publications: see also the SciBorg wiki page.
- FlySlip: Integrating Literature, Experiments and Curation in Drosophila Genomics Research
- CitRAZ: Rhetorical Citation Maps and Domain-Independent Argumentative Zoning
- ACLEX: Accurate and Comprehensive Lexical Classification for Natural Language Processing Applications
- Accurate and Efficient Parsing of Biomedical Text
- DeepThought
- Multiword expressions
- Robust Accurate Statistical Parsing (RASP)
- Alvey Natural Language Tools
- Acquilex
- The Cambridge/Acquilex Lexical Database System
Further details on research can be found on individual homepages which can be accessed via the list of NLIP people. For a list of PhD topics, please also see the list of NLIP people.
We constructed a research demo system which builds web pages automatically from group bibliographies: see the (slightly outdated) output for the NLIP group.
