MENTAL
 Main Menu
 Applications
 Artificial Intelligence
 Knowledge Representation


Knowledge Representation
 KNOWLEDGE
REPRESENTATION

"Knowledge is language that names and describes" (Wittgenstein).

"Science is organized knowledge" (Herbert Spencer).

"Knowledge is linguistic" (Peirce)



The Scale "Data - Information - Knowledge"

The triad "Data - Information - Knowledge" constitutes a scale of increasing semantics, from the superficial to the deep, from form to substance. Since semantics is inexpressible, there is a certain fuzzy boundary between these three concepts. Nevertheless, we can point out the main characteristics of each of them:


Data

In general, a datum is an isolated value or is the value of an attribute or predicate of a certain unknown entity. The value can be quantitative or qualitative, with or without unit (in the case of numerical value). For example: the datum "33" we do not know if it is someone's age, a temperature, a length, a speed, etc.; the datum "blue" we do not know if it is the color of something, an example of a 4-letter word, etc. And if we know the numerical value and the unit (i.e. we have a magnitude), we do not know to which entity it corresponds. For example, 1, 80 meters can be the height of a person, the width of a closet, etc.


Information

A piece of information is a piece of data (or set of data) with context, with reference, associated with a particular entity. Information = data + interpretation. For example: "Pepe is 33 years old", "the temperature in this room is 33 degrees", "my speed is 33 km/hour", etc.


Knowledge

Knowledge corresponds to the top of the semantic scale. As semantics cannot be defined, neither can knowledge be defined. It is the same as with consciousness, truth and life.

We can say that knowledge is the subjective interpretation of information. Knowledge is information (or set of information) with meaning, which is internalized, interpreted or evaluated subjectively within a context. When knowledge is made explicit, it becomes a set of interrelated information.


Knowledge representation

Knowledge representation is mainly used in so-called "expert systems". An expert system is a system that collects the knowledge of an expert (or set of experts) in a given domain. They are usually rule-based.

Some examples of expert systems are: Dendral (the first expert system for aiding in the identification of molecular structures of unknown substances), Macsyma (symbolic manipulation of mathematical expressions), Mycin (diagnosis and treatment of blood diseases), Prospector (search for geological deposits), KRM (nonlinear dynamics), ARIS (airport information resources), etc.


Knowledge vs. Information

There are many differentiating aspects between knowledge and information.
Metadata, Metainformation and Metaknowledge

A metadata is a piece of information applied to a piece of data. For example, in "dark blue", "dark" is a datum applied to the datum "blue".

A meta-data is a piece of information applied to a piece of information. For example, in "33 fruits, 12 are apples", "12 apples" is a meta-information of the information "33 fruits". And in "John has two children", a meta-information would be to say that his children are named Elena and Ivan.

A metaknowledge is knowledge that refers to other knowledge. For example, it can refer to the evaluation of knowledge, how to use the knowledge, its priority or weight, etc.


The Problem of Knowledge Representation

In the area of knowledge representation we can distinguish the following problems:
Knowledge Representation Systems

There are numerous knowledge representation systems. The following stand out:

Frames

Frames, proposed by Marvin Minsky in 1975 are the knowledge representation system that have historically had the highest level of acceptance.

Their characteristics are the following: An object (in OOP, object-oriented programming) is very similar to a frame. Objects have properties, which are internal attributes and methods. An object can share (inherit) properties (attributes and methods) from objects at a higher level. Objects communicate by means of messages. Each message causes the object to react, internally, externally, or both. Similarly, frames have associated properties, which are attributes (slots), procedures and rules.

Since Minsky's theoretical formulation, there have been several implementations of the frameworks, such as KL-ONE, KRL, OWL, and CLASSICS. KL-ONE is the most relevant. CLASSICS is a descendant of KL-ONE.

Minsky admitted that frames do not constitute a complete theory, but that frames can explain many features of human consciousness. For example, that intelligence consists of selecting in each new situation the most appropriate general frame and adapting it by changing the details. And that learning consists in the construction of new frames.


Semantic networks

A semantic network is a set of nodes and arcs. Each node represents an entity, which can be an action, an attribute, an event, a structure, a class, a frame, etc. An arc is a relationship between two nodes (entities). There are many types of relationships, among which are: A semantic network is different from a network. A network is a data structure. A semantic network is a knowledge representation system. Therefore, a semantic network has a higher semantic level than a network.

In a semantic network, the network can be hierarchical (a taxonomic hierarchy) or relational. In the hierarchical one there is a top node to which is assigned one or more child nodes, which in turn have other child nodes and so on until the end (bottom) is reached, whose nodes can be either entities or instances of entities.

The concept of inheritance is fundamental in hierarchical semantic networks. The properties of a node are based on the properties of the higher nodes in the hierarchy.

The most common type of semantic network is the IS-A network. In fact, this type is often mentioned as a synonym for semantic network. A "IS-A" network is a taxonomic hierarchy consisting of a system of hierarchy and inheritance links between nodes. Classical natural taxonomies are a good example: a dog is a canid, a canid is a mammal, a mammal is an animal.

Semantic networks "IS-A" are very flexible, but AI researchers have highlighted some major problems and drawbacks, including the following:
Frames vs. semantic networks

There are overlaps and differences between semantic networks and frames: In general, the frame-based network has received the most attention, both theoretically (cognitive science and linguistics) and practically, because of its flexibility and possibilities.


Ontologies

An ontology is a set of concepts and relationships between those concepts. Those relationships include system consistency constraints. For example, in the block world, the concepts are "block" and "ground", and the relationship "on". A block is on another block or on the ground. A block cannot be on itself.

Currently, knowledge representation has been reoriented towards the broad domain of ontologies. Most ontology languages are declarative, and are based, to a greater or lesser extent, on frames or on first-order predicate logic.

New developments in languages and knowledge representation systems with ontologies have been reoriented towards the Web. They make use of XML, XML Schema, RDF, RDF Schema standards, as well as Web ontology languages such as Web Ontology Language.
Production Systems

A production system is based on the logic programming paradigm. It is composed of a fact base (specific knowledge) and a rule base (generic knowledge) of the type "condition → action".

To infer new knowledge by rules there are two mechanisms:
  1. Forward inference (forward chaining). It is the generalized modus ponens: from facts and rules, new facts are obtained, and from deduced facts new facts are obtained, and so on until all possibilities of inference are exhausted.

  2. Backwards inference (backward chaining). It is the inverse process to the previous one. It goes from a possible fact to its validation by existing rules.
Advantages: The rules are independent of each other, are simple and are easily updated.

Disadvantages: Many rules are required, it needs an inference engine, and it does not allow inheritance or sharing in general.

Prolog is the best known logic programming language.


MENTAL, a Universal Language for Knowledge Representation

MENTAL, as a language for knowledge representation, solves the problem posed above:
Examples

The "IS-A" (x is y) relation of semantic networks is a binary subject-predicate type relation and is simply expressed as < i>x/y, where x is the subject, and y is the predicate. Examples: The relation "HAVE-A" (x has property y) of semantic networks is a ternary relation of the form x/(property/y). Examples:

Addenda

MENTAL, a language Tertium Comparationis for translation

"Tertium Comparationis" means in Latin "the third (part) of the comparison". It is the quality that two things have in common when compared. MENTAL can be used as an intermediate language to represent the common structure between different languages, for the process of translation (especially automatic) of a text in one language A into another language B.

Normally the translation of a text from one language to another is done from the surface level. With MENTAL the text is passed from the language A to a deep level (the MENTAL code), to then "emerge" in another language B. The deep level of MENTAL, together with its flexibility and power, is the one that can best reflect and represent the knowledge associated with a text.


Bibliography