rNews 1: Introduction to rNews

rNews: Embedded Metadata For The News Industry

rNews is a data model for embedding machine-readable publishing metadata in web documents and a set of suggested implementations. In this document, we'll provide an overview rNews and an implementation guide. We'll get started by reviewing the class diagram of the rNews data model. Following that we'll review each individual class. After that we will use rNews to annotate a sample news document. We will conclude with a guide for implementors of rNews.

A Sample Story

Of course, it would be unconscionably boring of us to simply rattle off the technical specification of each class. To stave off boredom we're introducing a real-world example: a real New York Times story appearing in a make-believe newspaper called "The IPTC Herald Tribune."

Figure 1: Expert from our sample story.
Our sample article is headlined "Allies Are Split on Goal and Exit Strategy in Libya," and we'll be using it to show how to apply the rNews data model to real-world data. But before that, we need to review the data model itself.

 

The rNews Data Model

The typical news-focused website presents news in two major formats articles and media. Articles are composed of many attributes including dateline, body text, and word count. Media items come in many types such as audio, video, and image and are composed of attributes such as height, width and encoding. Articles and media items also share a number of common attributes including title, creation date, and copyright notice. The rNews data model, shown below, is built around these observations. This diagram shows the complete class diagram of the rNews data model.

rNews diagram
Figure 2: The complete class diagram of the rNews data model.

As you can see, the rNews data model is centered around two classes NewsItem class and the Concept class. We'll start by describing the NewsItem class. This class models attributes common to both articles and media items. The NewsItem class is extended by four subclasses Article, ImageObject, VideoObject and AudioObject. Each of these classes models attributes particular to each NewsItem subtype. An Article object may have one or more associatedMedia objects of type ImageObject, VideoObject and AudioObject. A NewsItem may be associated with one or more UserComments objects.

The other central class is the Concept class. Below this class is shown below in greater detail.

Figure: Relations between the Concept class and its subclasses.

The Concept class is used to model abstract concepts. It is extended by subclasses that model specific types of real-world concepts. The Place class in conjunction with the GeoCoordinates class extends the Concept class with geospatial attributes. The Person and the Organization classes model attributes particular to each entity type. Lastly the Place, Person and Organization classes may be associated with a PostalAddress.

A number of relations exist between the NewsItem class and the Concept class and its subclasses. An NewsItem may be about or it may mention a Concept object. Additionally a NewsItem may have several relations to both Person and Organization objects.

  • If a Person or an Organization is primarily responsible for the creation of a NewsItem, then a creator relation exists between the two objects.
  • If a Person or an Organization has substantially but not primarily contributed to the creation of a NewsItem, then a contributor relation exists between the two objects.
  • If a Person or an Organization is responsible for the editing of a NewsItem, then a editor relation exists between the two objects.
  • If a Person or an Organization is primarily responsible for the distribution of a NewsItem, then a provider relation exists between the two objects.
  • If a Person or an Organization is the copyright holder responsible for the creation of a NewsItem, then a copyrightHolder relation exists between the two objects.
  • If an Organization is the entity on whose behalf the creator of the NewsItem was working, then a sourceOrganization relation exists between the two objects.
  • If a Person is legally accountable responsible for a NewsItem, then a accountablePerson relation exists between the two objects.

 

The difference between an Organization that is the provider of a NewsItem and an Organization that is the sourceOrganization for a NewsItem may be confusing, so lets consider an example. Suppose The Boston Globe writes a general interest article about an impending snowstorm and the Associated Press distributes the article. Then this article's sourceOrganization is "Boston Globe" but its provider is "The Associated Press."

Like NewsItem objects, UserComments objects may be associated with a creator, but this creator may only be of type Person

The Table Below summarizes all class relations in rNews.

 

Subject ClassVerbObject ClassDefinition
NewsItem about Concept Indicates that the NewsItem is specifically about a concept.
NewsItem accountablePerson Person Specifies the person that is legally accountable for the NewsItem.
NewsItem comment UserComments Comments, typically from users, on this NewsItem.
NewsItem contributor Person | Organization A secondary contributor to the NewsItem
NewsItem copyrightHolder Person | Organization The party holding the legal copyright to the NewsItem.
NewsItem creator Person | Organization The author of the NewsItem.
NewsItem editor Person | Organization Specifies the person or organization who edited the NewsItem
NewsItem mentions Concept Indicates that the NewsItem. contains a reference to, but is not necessarily about a concept.
NewsItem provider Person | Organization Specifies the person or organization that distributed the NewsItem.
NewsItem sourceOrganization Organization The organization on whose behalf the creator of the NewsItem was working.
Article associatedMedia "ImageObject |
AudioObject |
VideoObject"
An image or audio or video object associated with an Article.
ImageObject associatedArticle Article An Article associated with the Media Object
ImageObject associatedMedia "ImageObject |
AudioObject |
VideoObject"
An image or audio or video object associated with an Article.
AudioObject associatedArticle Article An Article associated with the Media Object
AudioObject associatedMedia "ImageObject |
AudioObject |
VideoObject"
An image or audio or video object associated with an Article.
VideoObject associatedArticle Article An Article associated with the Media Object
VideoObject associatedMedia "ImageObject |
AudioObject |
VideoObject"
An image or audio or video object associated with an Article.
UserComments creator Person The creator of the comment.
UserComments discusses NewsItem Specifies the NewsItem associated with this Comment.
Place address PostalAddress A real-world postal address associated with this entity.
Place geoCoordinates GeoCoordinates The geo coordinates of the location.
Person address PostalAddress A real-world postal address associated with this entity.
Organization address PostalAddress A real-world postal address associated with this entity.

 

Want to comment on rNews: we invite you to post your comment to the rNews Forum.