Resource Description Framework

The Resource Description Framework is a family of World Wide Web Consortium specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling of information that is implemented in web resources, using a variety of syntax notations and data serialization formats. It is also used in knowledge management applications.
RDF was adopted as a W3C recommendation in 1999. The RDF 1.0 specification was published in 2004, the RDF 1.1 specification in 2014.

Overview

The RDF data model is similar to classical conceptual modeling approaches. It is based on the idea of making statements about resources in expressions of the form subject–predicate–object, known as triples. The subject denotes the resource, and the predicate denotes traits or aspects of the resource, and expresses a relationship between the subject and the object.
For example, one way to represent the notion "The sky has the color blue" in RDF is as the triple: a subject denoting "the sky", a predicate denoting "has the color", and an object denoting "blue". Therefore, RDF uses subject instead of object in contrast to the typical approach of an entity–attribute–value model in object-oriented design: entity, attribute, and value.
RDF is an abstract model with several serialization formats, so the particular encoding for resources or triples varies from format to format.
This mechanism for describing resources is a major component in the W3C's Semantic Web activity: an evolutionary stage of the World Wide Web in which automated software can store, exchange, and use machine-readable information distributed throughout the Web, in turn enabling users to deal with the information with greater efficiency and certainty. RDF's simple data model and ability to model disparate, abstract concepts has also led to its increasing use in knowledge management applications unrelated to Semantic Web activity.
A collection of RDF statements intrinsically represents a labeled, directed multi-graph. This in theory makes an RDF data model better suited to certain kinds of knowledge representation than are other relational or ontological models. However, in practice, RDF data is often stored in relational database or native representations.
As RDFS and OWL demonstrate, one can build additional ontology languages upon RDF.

History

The initial RDF design, intended to "build a vendor-neutral and operating system-independent system of metadata," derived from the W3C's Platform for Internet Content Selection, an early web content labelling system, but the project was also shaped by ideas from Dublin Core, and from the Meta Content Framework, which had been developed during 1995–1997 by Ramanathan V. Guha at Apple and Tim Bray at Netscape.
A first public draft of RDF appeared in October 1997, issued by a W3C working group that included representatives from IBM, Microsoft, Netscape, Nokia, Reuters, SoftQuad, and the University of Michigan.
In 1999, the W3C published the first recommended RDF specification, the Model and Syntax Specification. This described RDF's data model and an XML serialization.
Two persistent misunderstandings about RDF developed at this time: firstly, due to the MCF influence and the RDF "Resource Description" initialism, the idea that RDF was specifically for use in representing metadata; secondly that RDF was an XML format rather than a data model, and only the RDF/XML serialisation being XML-based. RDF saw little take-up in this period, but there was significant work done in Bristol, around ILRT at Bristol University and HP Labs, and in Boston at MIT. RSS 1.0 and FOAF became exemplar applications for RDF in this period.
The recommendation of 1999 was replaced in 2004 by a set of six specifications: "The RDF Primer", "RDF Concepts and Abstract", "RDF/XML Syntax Specification ", "RDF Semantics", "RDF Vocabulary Description Language 1.0", and "The RDF Test Cases".
This series was superseded in 2014 by the following six "RDF 1.1" documents: "RDF 1.1 Primer," "RDF 1.1 Concepts and Abstract Syntax," "RDF 1.1 XML Syntax," "RDF 1.1 Semantics," "RDF Schema 1.1," and "RDF 1.1 Test Cases".

RDF topics

Vocabulary

The vocabulary defined by the RDF specification is as follows:

Classes

rdf

rdf:XMLLiteral – the class of XML literal values
rdf:Property – the class of properties
rdf:Statement – the class of RDF statements
rdf:Alt, rdf:Bag, rdf:Seq – containers of alternatives, unordered containers, and ordered containers
rdf:List – the class of RDF Lists
rdf:nil – an instance of rdf:List representing the empty list
rdfs
rdfs:Resource – the class resource, everything
rdfs:Literal – the class of literal values, e.g. strings and integers
rdfs:Class – the class of classes
rdfs:Datatype – the class of RDF datatypes
rdfs:Container – the class of RDF containers
rdfs:ContainerMembershipProperty – the class of container membership properties, rdf:_1, rdf:_2,..., all of which are sub-properties of rdfs:member
Properties

rdf

rdf:type – an instance of rdf:Property used to state that a resource is an instance of a class
rdf:first – the first item in the subject RDF list
rdf:rest – the rest of the subject RDF list after rdf:first
rdf:value – idiomatic property used for structured values
rdf:subject – the subject of the RDF statement
rdf:predicate – the predicate of the RDF statement
rdf:object – the object of the RDF statement

rdf:Statement, rdf:subject, rdf:predicate, rdf:object are used for reification.

rdfs

rdfs:subClassOf – the subject is a subclass of a class
rdfs:subPropertyOf – the subject is a subproperty of a property
rdfs:domain – a domain of the subject property
rdfs:range – a range of the subject property
rdfs:label – a human-readable name for the subject
rdfs:comment – a description of the subject resource
rdfs:member – a member of the subject resource
rdfs:seeAlso – further information about the subject resource
rdfs:isDefinedBy – the definition of the subject resource

This vocabulary is used as a foundation for RDF Schema, where it is extended.

Serialization formats

Several common serialization formats are in use, including:

Turtle, a compact, human-friendly format.
N-Triples, a very simple, easy-to-parse, line-based format that is not as compact as Turtle.
N-Quads, a superset of N-Triples, for serializing multiple RDF graphs.
JSON-LD, a JSON-based serialization.
N3 or Notation3, a non-standard serialization that is very similar to Turtle, but has some additional features, such as the ability to define inference rules.
RDF/XML, an XML-based syntax that was the first standard format for serializing RDF.
RDF/JSON, an alternative syntax for expressing RDF triples using a simple JSON notation.

RDF/XML is sometimes misleadingly called simply RDF because it was introduced among the other W3C specifications defining RDF and it was historically the first W3C standard RDF serialization format. However, it is important to distinguish the RDF/XML format from the abstract RDF model itself. Although the RDF/XML format is still in use, other RDF serializations are now preferred by many RDF users, both because they are more human-friendly, and because some RDF graphs are not representable in RDF/XML due to restrictions on the syntax of XML QNames.
With a little effort, virtually any arbitrary XML may also be interpreted as RDF using GRDDL, Gleaning Resource Descriptions from Dialects of Languages.
RDF triples may be stored in a type of database called a triplestore.

Resource identification

The subject of an RDF statement is either a uniform resource identifier or a blank node, both of which denote resources. Resources indicated by blank nodes are called anonymous resources. They are not directly identifiable from the RDF statement. The predicate is a URI which also indicates a resource, representing a relationship. The object is a URI, blank node or a Unicode string literal.
As of RDF 1.1 resources are identified by IRI's. IRI is a generalization of URI.
In Semantic Web applications, and in relatively popular applications of RDF like RSS and FOAF, resources tend to be represented by URIs that intentionally denote, and can be used to access, actual data on the World Wide Web. But RDF, in general, is not limited to the description of Internet-based resources. In fact, the URI that names a resource does not have to be dereferenceable at all. For example, a URI that begins with "http:" and is used as the subject of an RDF statement does not necessarily have to represent a resource that is accessible via HTTP, nor does it need to represent a tangible, network-accessible resource — such a URI could represent absolutely anything. However, there is broad agreement that a bare URI which returns a 300-level coded response when used in an HTTP GET request should be treated as denoting the internet resource that it succeeds in accessing.
Therefore, producers and consumers of RDF statements must agree on the semantics of resource identifiers. Such agreement is not inherent to RDF itself, although there are some controlled vocabularies in common use, such as Dublin Core Metadata, which is partially mapped to a URI space for use in RDF. The intent of publishing RDF-based ontologies on the Web is often to establish, or circumscribe, the intended meanings of the resource identifiers used to express data in RDF. For example, the URI:

http://www.w3.org/TR/2004/REC-owl-guide-20040210/wine#Merlot

is intended by its owners to refer to the class of all Merlot red wines by vintner, a definition which is expressed by the OWL ontology — itself an RDF document — in which it occurs. Without careful analysis of the definition, one might erroneously conclude that an instance of the above URI was something physical, instead of a type of wine.
Note that this is not a 'bare' resource identifier, but is rather a URI reference, containing the '#' character and ending with a fragment identifier.

Statement reification and context

The body of knowledge modeled by a collection of statements may be subjected to reification, in which each statement is assigned a URI and treated as a resource about which additional statements can be made, as in "Jane says that John is the author of document X". Reification is sometimes important in order to deduce a level of confidence or degree of usefulness for each statement.
In a reified RDF database, each original statement, being a resource, itself, most likely has at least three additional statements made about it: one to assert that its subject is some resource, one to assert that its predicate is some resource, and one to assert that its object is some resource or literal. More statements about the original statement may also exist, depending on the application's needs.
Borrowing from concepts available in logic, some RDF model implementations acknowledge that it is sometimes useful to group statements according to different criteria, called situations, contexts, or scopes, as discussed in articles by RDF specification co-editor Graham Klyne. For example, a statement can be associated with a context, named by a URI, in order to assert an "is true in" relationship. As another example, it is sometimes convenient to group statements by their source, which can be identified by a URI, such as the URI of a particular RDF/XML document. Then, when updates are made to the source, corresponding statements can be changed in the model, as well.
Implementation of scopes does not necessarily require fully reified statements. Some implementations allow a single scope identifier to be associated with a statement that has not been assigned a URI, itself. Likewise named graphs in which a set of triples is named by a URI can represent context without the need to reify the triples.

Query and inference languages

The predominant query language for RDF graphs is SPARQL. SPARQL is an SQL-like language, and a recommendation of the W3C as of January 15, 2008.
The following is an example of a SPARQL query to show country capitals in Africa, using a fictional ontology:

PREFIX ex:
SELECT ?capital ?country
WHERE

Other non-standard ways to query RDF graphs include:

RDQL, precursor to SPARQL, SQL-like
Versa, compact syntax, solely implemented in 4Suite.
RQL, one of the first declarative languages for uniformly querying RDF schemas and resource descriptions, implemented in RDFSuite.
SeRQL, part of Sesame
XUL has a template element in which to declare rules for matching data in RDF. XUL uses RDF extensively for databinding.
Validation and description

There are several proposals to validate and describe RDF:

SPARQL Inferencing Notation was based on SPARQL queries. It has been effectively deprecated in favor of SHACL.
SHACL is expresses constraints on RDF Graphs. SHACL is divided in two parts: SHACL Core and SHACL-SPARQL. SHACL Core consists of a list of built-in constraints such as cardinality, range of values and many others. SHACL-SPARQL consists of all features of SHACL Core plus the advanced features of SPARQL-based constraints and an extension mechanism to declare new constraint components.
ShEx is a concise language for RDF validation and description.
Examples

Example 1: Description of a person named Eric Miller

The following example is taken from the W3C website describing a resource with statements "there is a Person identified by http://www.w3.org/People/EM/contact#me, whose name is Eric Miller, whose email address is e.miller123example, and whose title is Dr."
The resource "http://www.w3.org/People/EM/contact#me" is the subject.
The objects are:

"Eric Miller",
mailto:e.miller123example, and
"Dr.".

The subject is a URI.
The predicates also have URIs. For example, the URI for each predicate:

"whose name is" is http://www.w3.org/2000/10/swap/pim/contact#fullName,
"whose email address is" is http://www.w3.org/2000/10/swap/pim/contact#mailbox,
"whose title is" is http://www.w3.org/2000/10/swap/pim/contact#personalTitle.

In addition, the subject has a type, which is person.
Therefore, the following "subject, predicate, object" RDF triples can be expressed:

http://www.w3.org/People/EM/contact#me, http://www.w3.org/2000/10/swap/pim/contact#fullName, "Eric Miller"
http://www.w3.org/People/EM/contact#me, http://www.w3.org/2000/10/swap/pim/contact#mailbox, mailto:e.miller123example
http://www.w3.org/People/EM/contact#me, http://www.w3.org/2000/10/swap/pim/contact#personalTitle, "Dr."
http://www.w3.org/People/EM/contact#me, http://www.w3.org/1999/02/22-rdf-syntax-ns#type, http://www.w3.org/2000/10/swap/pim/contact#Person

In standard N-Triples format, this RDF can be written as:

"Eric Miller".
.
"Dr.".
.

Equivalently, it can be written in standard Turtle format as:

@prefix eric: .
@prefix contact: .
@prefix rdf: .
eric:me contact:fullName "Eric Miller".
eric:me contact:mailbox .
eric:me contact:personalTitle "Dr.".
eric:me rdf:type contact:Person.

Or, it can be written in RDF/XML format as:

Eric Miller

Dr.

Example 2: The postal abbreviation for New York

Certain concepts in RDF are taken from logic and linguistics, where subject-predicate and subject-predicate-object structures have meanings similar to, yet distinct from, the uses of those terms in RDF. This example demonstrates:
In the English language statement 'New York has the postal abbreviation NY' , 'New York' would be the subject, 'has the postal abbreviation' the predicate and 'NY' the object.
Encoded as an RDF triple, the subject and predicate would have to be resources named by URIs. The object could be a resource or literal element. For example, in the N-Triples form of RDF, the statement might look like:

"NY".

In this example, "urn:x-states:New%20York" is the URI for a resource that denotes the US state New York, "http://purl.org/dc/terms/alternative" is the URI for a predicate, and "NY" is a literal string. Note that the URIs chosen here are not standard, and don't need to be, as long as their meaning is known to whatever is reading them.

Example 3: A Wikipedia article about Tony Benn

In a like manner, given that "http://en.wikipedia.org/wiki/Tony_Benn" identifies a particular resource, to say that the title of this resource is "Tony Benn" and its publisher is "Wikipedia" would be two assertions that could be expressed as valid RDF statements. In the N-Triples form of RDF, these statements might look like the following:

"Tony Benn".
"Wikipedia".

To an English-speaking person, the same information could be represented simply as:

The title of this resource, which is published by Wikipedia, is 'Tony Benn'

However, RDF puts the information in a formal way that a machine can understand. The purpose of RDF is to provide an encoding and interpretation mechanism so that resources can be described in a way that particular software can understand it; in other words, so that software can access and use information that it otherwise couldn't use.
Both versions of the statements above are wordy because one requirement for an RDF resource is that it be unique. The subject resource must be unique in an attempt to pinpoint the exact resource being described. The predicate needs to be unique in order to reduce the chance that the idea of Title or Publisher will be ambiguous to software working with the description. If the software recognizes http://purl.org/dc/elements/1.1/title, it will also know that this title is different from a land title or an honorary title or just the letters t-i-t-l-e put together.
The following example, written in Turtle, shows how such simple claims can be elaborated on, by combining multiple RDF vocabularies. Here, we note that the primary topic of the Wikipedia page is a "Person" whose name is "Tony Benn":

@prefix rdf: .
@prefix foaf: .
@prefix dc: .

dc:publisher "Wikipedia" ;
dc:title "Tony Benn" ;
foaf:primaryTopic .

Applications

DBpedia – Extracts facts from Wikipedia articles and publishes them as RDF data.
YAGO – Similar to DBpedia extracts facts from Wikipedia articles and publishes them as RDF data.
Wikidata – Collaboratively edited knowledge base hosted by the Wikimedia Foundation.
Creative Commons – Uses RDF to embed license information in web pages and mp3 files.
FOAF – designed to describe people, their interests and interconnections.
Haystack client – Semantic web browser from MIT CS & AI lab.
IDEAS Group – developing a formal 4D ontology for Enterprise Architecture using RDF as the encoding.
Microsoft shipped a product, Connected Services Framework, which provides RDF-based Profile Management capabilities.
MusicBrainz – Publishes information about Music Albums.
NEPOMUK, an open-source software specification for a Social Semantic desktop uses RDF as a storage format for collected metadata. NEPOMUK is mostly known because of its integration into the KDE SC 4 desktop environment.
Cochrane is a global publisher of clinical study meta-analyses in evidence based healthcare. They use an ontology driven data architecture to semantically annotate their published reviews with RDF based structured data.
RDF Site Summary – one of several "RSS" languages for publishing information about updates made to a web page; it is often used for disseminating news article summaries and sharing weblog content.
Simple Knowledge Organization System – a KR representation intended to support vocabulary/thesaurus applications
SIOC – designed to describe online communities and to create connections between Internet-based discussions from message boards, weblogs and mailing lists.
Smart-M3 – provides an infrastructure for using RDF and specifically uses the ontology agnostic nature of RDF to enable heterogeneous mashing-up of information
LV2 - a libre plugin format using Turtle to describe API/ABI capabilities and properties

Some uses of RDF include research into social networking. It will also help people in business fields understand better their relationships with members of industries that could be of use for product placement. It will also help scientists understand how people are connected to one another.
RDF is being used to have a better understanding of road traffic patterns. This is because the information regarding traffic patterns is on different websites, and RDF is used to integrate information from different sources on the web. Before, the common methodology was using keyword searching, but this method is problematic because it does not consider synonyms. This is why ontologies are useful in this situation. But one of the issues that comes up when trying to efficiently study traffic is that to fully understand traffic, concepts related to people, streets, and roads must be well understood. Since these are human concepts, they require the addition of fuzzy logic. This is because values that are useful when describing roads, like slipperiness, are not precise concepts and cannot be measured. This would imply that the best solution would incorporate both fuzzy logic and ontology.

Citations

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...

Resource Description Framework

Overview

History

RDF topics

Vocabulary

Classes

rdf

rdfs

Properties

rdf

rdfs

Serialization formats

Resource identification

Statement reification and context

Query and inference languages

Validation and description

Examples

Example 1: Description of a person named Eric Miller

Example 2: The postal abbreviation for New York

Example 3: A Wikipedia article about Tony Benn

Applications

Citations