First International Workshop on Consuming Linked Data (COLD2010)

November 8,2010
Shanghai, China

Abstract

The quantity of published Linked Data is increasing dramatically. However, applications that consume Linked Data are not yet widespread. Current approaches lack methods for seamless integration of Linked Data from multiple sources, dynamic discovery of available data and data sources, provenance and information quality assessment, application development environments, and appropriate end user interfaces. Addressing these issues requires well-founded research, including the development and investigation of concepts that can be applied in systems which consume Linked Data from the Web. The workshop on consuming Linked Data (COLD) aims to provide a platform for discussion and work on these open research problems. The main objective is to provide a venue for scientific discourse -- including systematic analysis and rigorous evaluation -- of concepts, algorithms and approaches for consuming Linked Data.

News

Objectives

The term Linked Data refers to a practice for publishing and interlinking structured data on the Web. Since the practice has been proposed in 2006, a grass-roots movement has started to publish and to interlink multiple open databases on the Web following the Linked Data principles. Due to conference workshops, tutorials, and general evangelism an increasing number of data publishers such as the BBC, Thomson Reuters, The New York Times, the Library of Congress, and the UK and US governments have adopted Linked Data principles. The ongoing effort resulted in bootstrapping the Web of Data which, today, comprises billions of RDF triples including millions of links between data sources. The published datasets include data about books, movies, music, radio and television programs, reviews, scientific publications, genes, proteins, medicine, and clinical trials, geographic locations, people, companies, statistical and census data, etc.

Access to Linked Data presents exciting opportunities for the next generation of Web-based applications: data from different providers can be aggregated and fragmentary information from multiple sources can be integrated to achieve a more comprehensive view. While a few applications, such as the BBC music guide have used Linked Data to significant benefit, the deployment methodology has been to harvest the data of interest from the Web to create a private, disconnected repository for each specific application. Such an approach can only be the beginning; new concepts to consume Linked Data are required in order to exploit the Web of Linked Data to its full potential. The concepts, patterns and tools necessary are very different from situations when resource identifiers are local or known a-priori, whole-repository queries are possible, access to the repository is reliable and relevant data sources are known to be trustworthy.

Several open issues that make the development of Linked Data based applications a challenging or still impossible task. These issues include the lack of approaches for seamless integration of Linked Data from multiple sources, for dynamic, on-the-fly discovery of available data, for information quality assessment, and for elaborate end user interfaces. These open issues can only be addressed appropriately when they are conceived as research problems that require the development and systematic investigation of novel approaches. The First International Workshop on Linked Data (COLD) aims to provide a platform for the presentation and discussion of such approaches. Our main objective is to receive submissions that present scientific discussion (including systematic evaluation) of concepts and approaches, instead of exposition of features implemented in Linked Data based applications. For practical systems without formalization or evaluation we refer interested participants to other offerings at ISWC, such as the Semantic Web Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.

Tentative Program

The exact timing and the length of the slots may still change. So please check again shortly before the workshop.

Topics of Interest

Relevant topics for COLD 2010 include but are not limited to:

  • Web scale data management (indexing, crawling, etc.)
  • Query processing over multiple linked datasets
  • Search in the Web of Data
  • Auto-discovery
    • of URIs,
    • of additional data that is not from the authoritative source of a URI,
    • of relevant linked datasets in general
  • Caching and replication
  • Dataset dynamics
    • processing change notifications,
    • keeping consistency,
    • temporal tracking of linked datasets
  • Reasoning on Linked Data from multiple sources
  • Knowledge discovery deriving insights from the Web of Data
  • Information quality of Linked Data
    • information quality assessment,
    • trustworthiness,
    • provenance
  • User interface research for the interaction with the Web of Data
    • user interaction and usability,
    • visualizing Linked Data,
    • natural language interfaces

Important Dates

  • Paper submission deadline: September 3, 2010, 23.59 Hawaii time (extended from August 27, 2010)
  • Acceptance notification: September 24, 2010
  • Camera-ready versions of accepted papers: October 7, 2010
  • Workshop date: November 8, 2010

Submissions

We seek full technical research papers only with a length of up to 12 pages. Paper submissions must be formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS).

Please submit your paper via EasyChair at http://www.easychair.org/conferences/?conf=cold2010

Submissions that do not comply with the formatting of LNCS or that exceed the 12 page limit will be rejected without review.

We note that the author list does not need to be anonymized, as we do not have a double-blind review process in place.

Submissions will be peer reviewed by three independent reviewers. Accepted papers have to be presented at the workshop proceedings.

Proceedings

Proceedings will be published online at CEUR-WS.

Workshop Organization

The workshop will be co-located with the 9th International Semantic Web Conference (ISWC) in Shanghai, China, and will be held on November 8, 2010.

The workshop will also consist of:

  • Opening session: This will permit introduction of the workshop topics, goals, participants, and expected outcomes.
  • Keynote speaker: Chris Welty (IBM) will speak about "Linked Data in Watson: Using Linked Data as Evidence for Natural Language Processing."
  • Research Track: Accepted research papers will be presented at the workshop.
  • Communication: Networked communication will be encouraged during the workshop using IRC, microblogging and other services, provided with the official hashtag (#cold2010) to follow the live-stream of the event.

Organizing Committee

Steering Committee

The role of the steering committee is to offer help and advice to the workshop organizing committee in matters of the overall technical organization. Members of the steering committee are:

Programme Committee

Contact

For further information about the workshop, please contact the workshops chairs at cold.org.ws@googlemail.com

Contains RDFa