Kieliteknologian oppiaine

Semantic Web for Language Technology: Ctl310semw 2004k

- KIT-verkosto | Yleisen kielitieteen laitos| Kieliteknologian opetus| Helsingin yliopisto -
Kurssin sivut
Kurssin kalenteri
Kurssin materiaali
Luennoitsija
Kurssin kuvaus

Kurssin materiaali

On-line Resources.

1. The Semantic Web.

2. XML, metadata and ontologies.

3. RDF: Resource Description Framework.

4. Jena: RDF in Java. More RDF: RDF/XML.

5. RDF in SQL databases.

6. RDF query languages: RDQL.

7. RDF Schema (RDFS).

8. Ontologies. Protégé and RDFS.

9. Building an ontology.

10. Ontology languages: RDFS, DAML+OIL, OWL.

  • Lecture notes
    • Tutorial on OWL (ISWC2003) by Sean Bechofer, Ian Horrocks, Peter Patel-Schneider.
  • Further reading
  • Practical work with Protégé and OWL
    • Copy the shell script semw-protege-beta to your directory and make it executable.
      Start Protégé by: venus$ semw-protege-beta &
    • Import(*) the example koala ontology and work through the Protégé OWL Tutorial.
      (*)How to do imports. "Import from format" only handles CLIPS format, not OWL.
      The correct way to import an ontology (e.g. koala.owl) is to create a new ontology (e.g. mykoala.owl) and use the Metadata tab in Protégé to specify the ontology to be imported. This adds an <owl:imports> statement to the new ontology. See OWL Plugin FAQs and Protégé+OWL+Imports.
      There is also an incorrect way that can be done by the following 3 steps:
      1. Start a new project (Project -> New). Choose "OWL Files" in "Select Format".
      2. Save the project immediately with the required name (e.g. koala.pprj).
      This also creates a new "empty" OWL file (koala.owl). Close the project.
      3. Outside Protégé, overwrite the "empty" OWL file (koala.owl) with the "real" OWL file (koala.owl) from the existing ontology. Then reopen the project in Protégé.

11. OWL: Web Ontology Language.

12. Jena and OWL: Jena ontology API.

13. Linguistic ontologies: WordNet, OntoWordNet, FrameNet.

14. Semantic Web information extraction.

  • Lecture notes
  • Further reading
  • Practical work with GATE
    • Study GATE's DAML+OIL Exporter. Copy the DAML+OIL ontology to your directory.
    • Start GATE by: venus$ semw-gate & and repeat the first week's practical work.
      Make a corpus containing Shakespeare's sonnet130.html and run ANNIE on it.
    • Load the exporter (Processing resources -> New -> DAML+OIL Exporter).
      Specify the ontology (file:Exporter.daml) and leave exportFilePath blank.
    • Create a new corpus pipeline (Applications -> New -> Corpus Pipeline).
      Select the DAML Exporter as the only processing resource, and run it (-> Run).
      A mini-ontology sonnet130.html.daml is automatically populated with Persons.
  • Assignment 6: Semantic Web information extraction

© Graham Wilcock 2004.