
790 20. Knowledge Representation and Question Answering
Although the extraction of semantic relations appears to be at an early stage of devel-
opment (the process has not yet been described in detail by the LCC research group),
preliminary results are very encouraging (see Section 20.4 for an example of the use
of semantic relations).
The approach for the mapping of English text into LLF has been used, for example,
in the LCC QA system PowerAnswer [1, 20].
In the next section, we turn our attention to the reasoning task, and briefly describe
the reasoning component of the LCC QA system.
20.3 The COGEX Logic Prover of the LCC QA System
The approach used in many recent QA systems is roughly based on detecting matching
patterns between the question and the textual sources provided, to determine which
ones are answers to the question. We call the textual sources available to the system
candidate answers. Because of the ambiguity of natural language and of the large
amount of synonyms, however, these systems have difficulties reaching high success
rates (see, e.g., [20]). In fact, although it is relatively easy to find fragments of text that
possibly contain the answer to the question, it is typically difficult to associate to them
some kind of measure allowing to select one or more best answers. Since the candidate
answers can be conflicting, the inability to rank them is a substantial shortcoming.
To overcome these limitations, the LCC QA system has been recently extended
with a prover called
COGEX [20]. In high-level terms, COGEX is used to analyze the
connection between the question in input and the candidate answers obtained using
traditional QA techniques. Consider the question “Did John visit New York City on
Dec, 1?” and assume that the QA system has access to data sources containing the
fragments “John flew to the City on Dec, 1” and “In the morning of Dec, 1, John went
down memory lane to his trip to Australia”.
COGEX is capable of identifying that the
connection between question and candidate answer requires the knowledge that “New
York City” and “City” denote the same location, and that “flying to a location” implies
that the location will be visited. The type and number of these differences is used as a
measure of how close a question and candidate answer are—in our example, we would
expect that the first answer will be considered the closest to the question (as the second
does not describe an actual travel on Dec, 1). This measure gives an ordering of the
candidate answers, and ultimately allows the selection of the best matches.
The analysis carried out by
COGEX is based on world knowledge extracted from
WordNet (e.g., the descriptionof the meaningof “fly (to a location)”)as well asknowl-
edge about natural language (allowing to link “New York City” and “City”). In this
context, the descriptions of the meaning of words are often called glosses.
To be used in the QA system, glosses from WordNet have been collected and
mapped into logic forms. The resulting pairs &word, gloss_LLF' provide definitions
of word. Part of the associations needed to link “fly” and “visit” in the example above
are encoded in
COGEX by axioms (encoding complete definitions, from WordNet, of
those verbs with the meanings used in the example) such as
4
:
4
To complete the connection, axioms for “ travel” and “go” are also needed.