Knowledge retrieval


Knowledge retrieval seeks to return information in a structured form, consistent with human cognitive processes as opposed to simple lists of data items. It draws on a range of fields including epistemology, cognitive psychology, cognitive neuroscience, logic and inference, machine learning and knowledge discovery, linguistics, and information technology.

Overview

In the field of retrieval systems, established approaches include:
Both approaches require a user to read and analyze often long lists of data sets or documents in order to extract meaning.
The goal of knowledge retrieval systems is to reduce the burden of those processes by improved search and representation. This improvement is needed to leverage the increasing data volumes available on the Internet.

Comparison with data and information retrieval

Data Retrieval and Information Retrieval are earlier and more basic forms of information access.
Data RetrievalInformation RetrievalKnowledge Retrieval
MatchBoolean matchpartial match, best matchpartial match, best match
Inferencedeductive inferenceinductive inferencedeductive inference, inductive inference, associative reasoning, analogical reasoning
Modeldeterministic modelstatistical and probabilistic modelsemantic model, inference model
Queryartificial languagenatural languageknowledge structure, natural language
Organizationtable, indextable, indexknowledge unit, knowledge structure
Representationnumber, rulenatural language, markup languageconcept graph, predicate logic, production rule, frame, semantic network, ontology
Storagedatabasedocument collectionsknowledge base
Retrieved Resultsdata setsections or documentsa set of knowledge unit

Knowledge retrieval focuses on the knowledge level. We need to examine how to extract, represent, and use the knowledge in data and information. Knowledge retrieval systems provide knowledge to users in a structured way. Compared to data retrieval and information retrieval, they use different inference models, retrieval methods, result organization, etc. Table 1, extending van Rijsbergen’s comparison of the difference between data retrieval
and information retrieval, summarizes the main characteristics of data retrieval, information retrieval, and knowledge retrieval. The core of data retrieval and information retrieval is retrieval subsystems. Data retrieval gets results through Boolean match. Information retrieval uses partial match and best match. Knowledge retrieval is also based on partial match and best match.
From an inference perspective, data retrieval uses deductive inference, and information retrieval uses inductive inference. Considering the limitations from the assumptions of different logics, traditional logic systems cannot reason efficiently. Associative reasoning, analogical reasoning and the idea of unifying reasoning and search may be effective methods of reasoning at the web scale.
From the retrieval perspective, knowledge retrieval systems focus on semantics and better organization of information. Data retrieval and information retrieval organize the data and documents by indexing, while knowledge retrieval organize information by indicating connections between elements in those documents.

Frameworks for knowledge retrieval systems

From computer science perspective, a logic framework concentrating on fuzziness of knowledge queries has been proposed and investigated in detail. Markup languages for knowledge reasoning and relevant strategies have been investigated, which may serve as possible logic reasoning foundations for text based knowledge retrieval.
From cognitive science perspective, especially from cognitive psychology and cognitive neuroscience perspective, the neurobiological basis for knowledge retrieval in the human brain has been investigated, and may serve as a cognitive model for knowledge retrieval.

Related disciplines

Knowledge retrieval can draw results from the following related theories and technologies:
Topics listed under each entry serve as examples and do not form a complete list. And many related disciplines should be added as the field grows mature.