Archivos de la categoría ‘Human Language Technologies (Q2)’

Explanation of three of the topics

Marzo 30, 2008

In this article I’ll make the asked explanation on three of the topics we have spoken about recently:

This first topic I’m going to talk about is the “Humaine” or “Human-machine interaction Network on emotions” one of the current projets of the German Research Center for Artificial Intelligence.

Humaine aims to lay the foundations for European development of systems that can register, model and influence human emotional and emotion-related states and processes – ‘emotion-oriented systems’. Such systems may be central to future interfaces, but their conceptual underpinnings are not sufficiently advanced to be sure of their real potential or the best way to develop them.

One of the reasons is that relevant knowledge is dispersed across many disciplines. Humaine brings together leading experts from the key disciplines in a programme designed to achieve intellectual integration. It identifies six thematic areas that cut across traditional groupings and offer a framework for an appropriate division of labour – theory of emotion; signal/sign interfaces; the structure of emotionally coloured interactions; emotion in cognition and action; emotion in communication and persuasion; and usability of emotion-oriented systems. Teams linked to each area will run a workshop in it and carry out joint research to define an exemplar embodying guiding principles for future work in their area.

The second topic on which I am going to focus is the one called “Whiteboard“; a completed project of the same research centre. This project focused on the “Multilevel annotation for dimamic free text processing”.

The project aimed at designing, implementing, investigating and evaluating a new system architecture that facilitated the combination of different language technologies for a range of practical applications. Language technologies offered numerous means for a partial analysis of texts that could be employed for information retrieval, information extraction, language checking, and many other applications. Processing methods and tools differed along several dimensions, e.g., wrt. levels of linguistic description, depth of analysis, or the way knowledge of language is derived (linguistically or statistically).

Methods often overlaped in their functionality but differed in their strengths and weaknesses. Finding optimal combinations of heterogeneous techniques and processing components was one of the most difficult tasks in language processing – the challenge of the Whiteboard project. The novel architecture to be developed and explored in Whiteboard was based on the concept of an annotated text. The different LT components enriched an XML. Each component can exploit or disregard previously assigned annotations. Its architecture had a single shared data structure, which at the same time was the input, throughput, and output of the system. The envisaged architecture permited the pragmatic combination of different processing approaches, most notably novel ways of the combination of shallow and deep methods.

Finally, the last topic I had picked to focus on is the “Neca” or “The net environment for embodied emotional conversational agents”; one of the previous projects of the Austrian Research Institute for Artificial Intelligence.

The objective of the NECA project was to develop a new generation of mixed multi-user / multi-agent virtual spaces populated by affective conversational agents. The agents are be able to express themselves through synchronised emotional speech and non-verbal expression, generated from an abstract representation. This is the first time that such expressive capabilities are featured in Internet applications. The agents’ usefulness were evaluated in two concrete application scenarios. From a technical point of view, the NECA platform provides a confederation of dedicated components including an affective reasoner, co-ordinated generation of verbal and nonverbal aspects of communication, and emotional speech synthesis, thus providing a basis for the development of new Internet applications with emotional agents.

Sources:

http://www.dfki.de/pas/f2w.cgi?ltp/humaine-e

http://www.dfki.de/pas/f2w.cgi?ltc/whiteboard-e

http://www.ofai.at/research/nlu/projects/nlproject_neca.html

Recent research topics

Marzo 30, 2008
In this article recent research topics mentioned on different sites of Human Language technologies will be pointed out.

Referring to Researc Centers, the following ones are the most remarkables:

- In the German Research Centre the following themes are the most elaborated in research:

  • exploiting – and automatically extending – ontologies for content processing.
  • tighter integration of shallow and deep techniques in processing.
  • enriching deep processing with statistical methods.
  • combining language checking with structuring tools in document authoring.
  • document indexing for German and English.
  • automatically associating recognized information with related information and thus building up collective knowledge.
  • automatically structuring and visualizing extracted information.
  • processing information encoded in multiple languages, among them Chinese and Japanese.

- The Stanford Natural Processing Language Processing group of California works in several grounds:

  • Basic research on conputational-linguistic.
  • Grammar induction.
  • Sentence understanding.
  • Word sense disanbiguation.
  • Automatic question answering.

- The Edinburgh Language Technology Group produces research in the following areas:

  • Combining Shallow Semantics and Domain Knowledge.
  • Text Mining for Biomedical Content Curation.
  • Cross-retail Multi-agent Retail Comparison .
  • Smart Qualitalive Data: Methods and Community tools for Data Mark-up.
  • Machine Learning for Named Entity Recognition.
  • Integrated Models and Tools for Fine-Grained Prosody in Discourse.
  • Joint Action Science and Technology.
  • AMI consorting projects that are developing technologies for meeting browsing and to assist people participating in meetings from a remote location.
  • Study of how pairs collaborate when in planning a route on a map (Collaborating using diagrams).

Between the most highlighted research networks the most remarkable one could be:

- The European Network of Excellence in Human Language Technologies (ELSNET) is a plataform made up to reach the following goals:

  • Make an analisys of the present and future views.
  • Share knowledge and experience.
  • Work out innovative actions.
  • Make a united enviroment examination.
  • To unite the Human Language Technologies making posible the European research and developing.

Within the associations, The Spanish society for the procesement of Natural Language it is also of great importance and analizes these themes:

  • The fixing up of lexical ambiguity.
  • Rescuing of information of great importance.
  • Linguistic technics to work with multilinguism.
  • Linguistic knowledge to make possible the semantic errors.

Finally, within the latest conferences on Natural Language Processing, I have focused on the one called “XXIV. Edition of Anual Congress of Spanish Society for the processment of Natural Language 2008 (SEPLN ‘08)”.

-The main thematic areas of this conference were:

    • Linguistic, mathematic and psicolinguistic models of the language.
    • Linguistic of Corpus.
    • Automatic translation.
    • Recognizing th voice.
    • Semantic, pragmatism and discurs.
    • PLN industrial aplications.
    • Automatic analyis of texts’s containings.

Sources:

http://www-nlp.stanford.edu/

http://www.ltg.ed.ac.uk/projects

http://www.dfki.de/lt/projects.php

http://www.elsnet.org/http://www.sepln.org/

http://www.sepln.org/

http://basesdatos.uc3m.es/sepln2008/web/