research topics in NLP

  1. Optical character recognition (OCR) 光学字符识别

    Given an image representing print test, determine the corresponding text
  2. Questing answering 问题回答

Given a human-languagge question,determine its answer. Typical questions have a specific right answer(such as :“What is th capital of Canada?”), but sometimes openended questions are also considered (such as “What is the meaning of life?”). Recent works have looked at even moew complex questions.
3. ### Recognizing Textual entailment 认识文字蕴含
Given two text fragments, determine if one being true entails the other,entails the other’s negation,or allows the other to be either true or false.

  1. Relationship extraction 关系提取

    Given a chunk of text, identify the relationships among named entities(e.g who is married to whom).

  2. Sentiment analysis 情绪分析

    Extract subjective information usually from a set of documents,often using online reviews to determine “polarity” about specific objects, It is espeially useful for identifying trends of public opinoin in the social media, for the purpose of marketing.

  3. Topic segmentation and recognition 主题细分与识别

    Given a chunk of text, separate it into segments each of which is devoted to a topic,and identify the topic of the segment.

  4. Word sense disambiguation 词义消歧

    Many words have more than one meaning; we have to select the meaning which makes the most sense in contect. For this problem, we are typically given a list of words and associated word senses, e.g from a dictionary or from an online resource such as WordNet

  5. Automatic summary 自动汇总

    Produce a readable summary of a chunk of text. Often used to provide summaries of text of a known type,such as articles in the financial section of a newspaper.

  6. Coreference resolution 共指解析

    Given a sentence or larger chunk of text, determine which words(“mentions”) refer to the same objects(“entities”),Anaphora resolution is a specific example of this task, and is specifically concerned with matching up pronouns with the nouns or names to which they refer. The more general task of coreference resolution also includes identifyinf so-called"bridging relationshios" involving referring expressions .For example,in a sentence such as “He entered John’s house through the front door”.“the front door” is a referring expression and tehe bridging relationship to be identified is the fact that the door being rederred to is the front door of John’s house (rather than os some other structure that mighe also be referred to).

  7. Discourse analysis 话语分析

    This rubric includes anumber of related tasks. One task is identifying the discourse structure of connected text, i.e the nature of the discourse relationships between sentences(e.g elaboration,explanation,contrast). Another possible task is recognizing and classifying the speech ects in a chunk of text(e.g yes-no question,content question , statement,assertion,etc).

  8. Speech recognition 语音识别

    Given a sound clip of a person or people speaking, determine the textural representation of the speech. This is the opposite of text to speech and is one of the extremely difficult problems colloquially termed “Ai-complete” (see above). In natural speech there are hardly any pauses between successive(see below). Note also that in most spoken languages, the sounds representing successive letters blend into each other in a process termed coaticulation, so the conversion of the analog signal to distrete chatacters can be a very difficult process.
    给定一个人讲话的声音片段,确定语音的纹理表示。这与文字与语音相反,是通称为“ Ai完全”(见上文)的极为困难的问题之一。在自然语言中,连续之间几乎没有任何停顿(见下文)。还要注意的是,在大多数口语中,代表连续字母的声音在称为涂复的过程中相互融合,因此,将模拟信号转换为离散字符可能是一个非常困难的过程。

  9. Speech segmentation 语音分割

    Given a sound clip of a person or people speaking, separate it into words, A subtasks of speech recognition and typically grouped with it


  10. text to speech 文字转语音

