CS5353: Assignment 1

From Minas

Tasks:

  1. Select 2 datasets in a given domain that you want to integrate;
  2. Encode the datasets into RDF/XML;
  3. Use an OWL editor, e.g., SWOOP or Protege, to create a simple taxonomy based on rdfs:class and rdfs:subClassOf that describes classes representing the dataset tuples of the selected RDF/XML files as well as a common superclass for the dataset classes;
  4. By using SPARQL in Pellet, the datasets in (2), and the taxonomy in (3), query for instances of classes that belong to both datasets;
  5. By using SPARQL in Pellet, query for instances that belong to a single dataset.
  6. Present the project for the class (each group will have 15 minutes to present and 5 five minutes to answer questions)

Deliverables:

  • The RDF/XML datasets
  • SPARQL queries and their results
  • A 4 page project report describing how the taxonomy in (3) was used to integrate the datasets in (2)
  • The powerpoint file used during the presentation