Task-Oriented

Semantic Interpretation



John F. Sowa  and  Arun K. Majumdar

VivoMind LLC









Topics Covered

  1. Background Knowledge

  2. Conceptual Graphs

  3. Knowledge Soup

  4. Finding Analogies

  5. A Large Application

  6. Comparison with Other Approaches

See paper:  http://www.jfsowa.com/pubs/tosi.htm






1. Background Knowledge


Two kinds of knowledge needed for understanding language:

  • Lexical knowledge:  Knowledge about words, word senses, and typical patterns of relations between words.

  • Task-oriented knowledge:  Knowledge about a task, how to do it, and the environment in which it is done.

General world knowledge includes the totality of task oriented knowledge for all the tasks that people do.












Lexical Knowledge

A sentence with an ambiguous word:

John went to the board.

Word sense determined by subsequent context:

  • He kicked it to see if it was rotten.

  • He wrote the homework assignment for the next week.

  • He presented arguments for the proposed merger.





Task-Oriented Knowledge

A driver stops the car to ask a question:

Driver:  Where is the nearest gas station?

Pedestrian:  Turn right at the corner, and go about two blocks.

D:  Thanks.

A short while later, the driver comes back:

D:  That gas station is closed.

P:  Yes. It hasn't been open for years.

D:  Why didn't you tell me?

P:  You didn't ask.

The pedestrian did not have or use task-dependent knowledge about cars, drivers, and gas stations.




Organizing Background Knowledge

Five ways of organizing knowledge for NL processing:

  1. Lexical semantics: 
    • Knowledge about words and patterns of relations.

    • Broad coverage, but not deep enough or precise enough for logical deduction.

  2. Axiomatized semantics: 
    • Formal rules or axioms intended for deduction.

    • Ranging from expert systems with a few dozen rules to the million axioms of Cyc.

  3. Statistical approaches: 
    • Empirical methods derived from large corpora.

    • Many different techniques designed for different purposes.

  4. Knowledge soup: 
    • Loosely organized, dynamically changing, often contradictory mess in the human brain.

    • Simulated with combinations of the previous three methods.

  5. Task-oriented semantic interpretation: 
    • Simplification of the knowledge soup for a single task.

    • Large amount of lexical knowledge, but only axiomatized for one task.

    • Implemented in a NL processor called Intellitex.





2. Conceptual Graphs


Conceptual Graph Display Format:

John is going to Boston by bus.


English:  John is going to Boston by bus.

Conceptual Graph Interchange Format (KIF):

[Go: *x] [Person: John *y] [City: Boston *z] [Bus: *w]
   (Agnt ?x ?y) (Dest ?x ?z) (Inst ?x ?z)

Knowledge Interchange Format (KIF):

(exists ((?x Go) (?y Person) (?z City) (?w Bus))
        (and (Name ?y John) (Name ?z Boston)
             (Agnt ?x ?y) (Dest ?x ?z) (Inst ?x ?w)))

Predicate calculus:

(∃x:Go)(∃y:Person)(∃z:City)(∃w:Bus)
   (name(y,'John') ∧ name(z,'Boston') ∧
      agnt(x,y) ∧ dest(x,z) ∧ inst(x,w))





CG with Indexicals

A little story:

At 10:17 UTC, there was a cat named Yojo and a mouse. Yojo chased the mouse. Then he caught the mouse. Then he ate the head of the mouse.

The chase with indexicals

Nested contexts based on Peirce's existential graphs, but isomorphic to Kamp's discourse representation structures.






CG with Indexicals Resolved

The chase with indexicals resolved

Nested contexts with all references resolved

Predicate calculus:

(∃s1:Situation)(pTim(s1,"10:17 UTC")
  ∧ dscr(s1,
    (∃s1,s2,s3:Situation)(∃x:Cat)(∃y:Mouse)(name(x,"Yojo")
      ∧ dscr(s2, (∃u:Chase)(agnt(u,x) ∧ thme(u,y)))
      ∧ dscr(s3, (∃v:Catch)(agnt(v,x) ∧ thme(v,y)))
      ∧ dscr(s4,
        (∃w:Eat)(∃z:Head)(agnt(w,x) ∧ ptnt(w,z) ∧ part(y,w)))
      ∧ next(s2,s3) ∧ next(s3,s4)))).





Efficient Algorithms



Graph structure supports highly efficient algorithms:






3. Knowledge Soup



Crystallizing theories out of knowledge soup






Organizing the Knowledge Soup






Pointers from Words to Theories


words → word senses → canonical graphs → theories





Canonical Graphs


A conceptual graph that represents a stereotypical pattern of concept and relation types:


This canonical graph shows the typical lexical pattern for the concept type Give.

More specialized canonical graphs can show details for concept types at any level.




Theory Revision





Four operators for belief or theory revision







Special Case with Just One Theory


words → word senses → canonical graphs → theory






Task-Oriented Semantic Interpretation


Canonical graphs derived from lexical resources

TOSI adds task-oriented canonical graphs for particular applications:






Intellitex






4. Analogies

More primitive and more general than formal logic.

Every method of logic depends on repeated analogies:

Essential for nonmonotonic reasoning (belief revision).

Essential for semantic interpretation (abduction):






VivoMind Analogy Engine

Three methods of analogy:

  1. Matching labels: 

    • Identical type labels.
    • Subtype - supertype.
    • Siblings of same supertype.
    • More distant cousins.

  2. Matching subgraphs: 

    • Match isomorphic subgraphs independent of labels.
    • Combine adjacent nodes to make them isomorphic.

  3. Matching transformations: 

    • Find transformations that map subgraphs to one another.

Methods #1 and #2 take (N log N) time.

Method #3 takes polynomial time (analogies of analogies).




An Example

Analogy of Cat to Car
CatCar
headhood
eyeheadlight
corneaglass plate
mouthfuel cap
stomachfuel tank
bowelcombustion chamber
anusexhaust pipe
skeletonchasis
heartengine
pawwheel
furpaint

VAE used methods #1 and #2.

All source data taken from WordNet.






Matching Labels and Subgraphs

Corresponding parts have similar functions:

Approximate matching (missing esophagus and muffler):

Another pair of matching subgraphs:






Mapping Different Representations

Method #3 for relating data structures that represent equivalent information.

  • A physical structure describable in different ways:

  • English description:  "A red pyramid A, a green pyramid B, and a yellow pyramid C support a blue block D, which supports an orange pyramid E."

  • A relational database would use tables.

  • But many different options for chosing tables, rows and columns, and labels for the columns.





Representation in a Relational DB






CG Derived from Relational DB






CG Derived from English

"A red pyramid A, a green pyramid B, and a yellow pyramid C support a blue block D, which supports an orange pyramid E."






The Two CGs Look Very Different






Transformations Found by VAE



Top transformation applied to 5 subgraphs.

Bottom one applied to 3 subgraphs.

One application could be due to chance, but 3 or 5 contribute strong evidence for the mapping.






5. A Large Application

An earlier version of Intellitex was applied to three languages — English, COBOL, and JCL:

Same parser but different grammars for all three languages:






Results

Job finished in 8 weeks by two programmers, Arun Majumdar and André LeClerc.






Contradiction Found by VAE

From analyzing English documentation:

From analyzing COBOL programs:

What is the reason for this contradiction?




Quick Patch in 1979

A COBOL programmer made a quick patch:

For more than 20 years:

VAE discovered the two computer "employees".




Mismatch Found by VAE

Question:  Which location determines the market?

According to the documentation:  The business unit.

According to COBOL:  The client HQ.




Canonical Graphs

Same task-oriented canonical graphs for interpreting the semantics of English, COBOL, and JCL:


   [Process]->(Uses)->[Data]

   [Process]->(Generates)->[Data]

   [Process]->(Modifies)->[Data]

With subtypes for all the kinds of processes and data, including files and records, of COBOL and JCL.




6. Comparison with Other Approaches


Four widely used approaches in AI:

TOSI is an attempt to combine the strengths of all four, but within a flexible, modular, generalizable framework.




Task-Oriented Semantic Interpreter

Broad-coverage syntax and semantics:

TOSI supplements the lexicon with canonical graphs for the specific task:






Conclusions

Conceptual graphs represent

CG tools support






For Further Reading

A textbook on knowledge representation:

An article that goes into further detail on these topics:

A guided tour of ontology, conceptual structures, and logic:

An article about the templates used for information extraction, their use in shallow parsing, and their relationship to conceptual graphs:

A philosophical analysis of the problems and issues underlying ontology and knowledge representation:







Copyright ©2002, John F. Sowa