Objectives
Semantic Information Management (SIM) is of paramount importance as it offers many applications ranging from semantic search to semantic advertising to semantic publishing. Concepts such as semantic networks, social semantic web, linked data, ontologies, schemas, semantic annotations and vocabularies have been drawing attention in information management field. Similarly, information professionals have to be conversant with such acronyms as XML, RDF, OWL, and SKOS in their daily work.
This course aims to make students familiar with the basic concepts of Semantic Web and its supporting technologies (ie, XML and RDF), schemas and ontologies (ie, RDFS and OWL), and semantic annotations and vocabularies (ie, RDFa, DC, SKOS) along with their use in information management applications.
Learning Outcomes
Students will:
- understand the underlying ideas and principles of Semantic Web and its layered architecture;
- become familiar with its main technologies including ontologies and semantic query languages (eg, SPARQL); and
- learn how to apply these technologies to develop information management applications.
Grading
Attendance will be strictly observed. You are expected to actively participate the class discussions and come prepared to the classroom by reading the assigned articles and chapters. Reading assignments will be listed in the weekly class schedule in the course web site and/or course blog. Students are expected to complete the assignments. Students who took programming and web design courses are strongly encouraged to take this course.
Evaluation
Required and Recommended Readings
- De Virgilio et al. (eds.) Semantic Web Information Management: A Model-Based Perspective.
(The full text of this book is available in PDF through the library. Note that you should use proxy go get access to the book from outside the campus.) - Berners-Lee, T., Hendler, J. and Lassila, O. (2001). The Semantic Web. Scientific American, 284(5), 34-43. (the first paper on Semantic Web concept)
- Shadbolt, N., Hall, W. & Berners-Lee, T. (2006 May). The Semantic Web revisited. IEEE Intelligent Systems, 21(3): 96-101.
- Özdemir, S. (2012 Eylül). Günümüz teknolojileri ne kadar “akıllı”? Bilişim Dergisi, no. 146, s. 100-107.(Please take a look at other articles in this special dossier of Bilişim Dergisi on Semantic Web).
- Akçapınar Sezer, E. (2012 Eylül). Anlamsal Web’in ontolojilerle hayata geçmesi, arama/sorgulamada köklü değişiklikler yaratacak. Bilişim Dergisi, no. 146, s. 112-115.
- Singhal, Amit (May 16, 2012). “Introducing the Knowledge Graph: Things, Not Strings“. Official Blog (of Google). Retrieved February 06, 2016. (watch the 2’44’’ video on how Google’s Knowledge Graph works).
- Singhal, Amit (2013). Google I/O 2013 keynote. (starts from 1:51:01; watch till 2:10:48).
- Warren, P.W. & Davies, N.J. (2007). Managing the risks from information —through semantic information management. BT Technology Journal, 25(1): 178-191. (filename: semantikbilgiyonetimi.pdf) Should be essential reading; a bit technical).
- Brooks, T.A. (2002). The Semantic Web, universalist ambition and some lessons from librarianship. Information Research, 7(4).
- Yadagiri, N. & Ramesh, P. The Semantic Web and the libraries: An overview. International Journal of Library Science, 7(1): 80-94
Course Materials & Readings
Unit 1: Introduction and Overview
Unit 2: Architecture and Infrastructure of Web
- Lecture Notes: Architecture and Infrastructure of Web
- Works to be done
- Download & install “Brackets”
- Sign up for “Web Protege Account”
- Create an account on “GitHub” (use your “namesurname” pattern for username). Enter your GitHub URL to Google Spreadsheet document
- Enter your Gmail account to Google Spreadsheet document
- Enroll “Semantic Web Technologies” course
- Download Semantic Web Information Management: A Model-Based Perspective.
(The full text of this book is available in PDF through the library. Note that you should use proxy go get access to the book from outside the campus.)
- Articles to be read
- Introduction part (pages from 1 to 7) of Semantic Web Information Management: A Model-Based Perspective.
- Berners-Lee, T., Hendler, J. and Lassila, O. (2001). The Semantic Web. Scientific American, 284(5), 34-43. (the first paper on Semantic Web concept)
- Shadbolt, N., Hall, W. & Berners-Lee, T. (2006 May). The Semantic Web revisited. IEEE Intelligent Systems, 21(3): 96-101.
Unit 3: Semantic Web
- Lecture Notes: Semantic Web
- Works to be done
- Articles to be read
- Wikipedia article on Semantic Web. (Also available in Turkish)
- Berners-Lee, T. (2007, November 21). Giant global graph. (this has something on graphs; we could talk about this in the coming weeks as well)
- Özdemir, S. (2012 Eylül). Günümüz teknolojileri ne kadar “akıllı”? Bilişim Dergisi, no. 146, s. 100-107. (Please take a look at other articles in this special dossier of Bilişim Dergisi on Semantic Web).
- Akçapınar Sezer, E. (2012 Eylül). Anlamsal Web’in ontolojilerle hayata geçmesi, arama/sorgulamada köklü değişiklikler yaratacak. Bilişim Dergisi, no. 146, s. 112-115.
Unit 4: Data and Metadata Management
- Lecture Notes: Data and Metadata Management
- Articles to be read
Unit 5: XML and XML Schema
- Lecture Notes: XML and XML Schema
- Articles to be read
- W3Schools XML Tutorial
- Berners-Lee, T. (1998 September) Semantic Web Road map. (Good overview of the Semantic Web architecture)
- Berners-Lee, T. (1998 September 17) What the Semantic Web can represent. (It explains what the Semantic Web is not.)
Unit 6: RDF and RDF Schema
- Lecture Notes: RDF and RDF Schema
- Articles to be read
- O. Erling & I. Mikhailov, Towards web scale RDF. Proc. SSWS, 2008
- “Chapter 3: The Semantic Web Languages” In De Virgilio et al. (eds.) Semantic Web Information Management: A Model-Based Perspective. (pp. 25-37) (The full text of this book is available in PDF through the library: http://link.springer.com/book/10.1007%2F978-3-642-04329-1 Note that you should use proxy go get access to the book from outside the campus.)
- RDF 1.1 Primer (W3C Working Group Note 25 February 2014): https://www.w3.org/TR/2014/NOTE-rdf11-primer-20140225/
- RDF Schema 1.1 (W3C Recommendation 25 February 2014): https://www.w3.org/TR/rdf-schema/
- Graph theory, https://en.wikipedia.org/wiki/Graph_theory
- Optional reading: “What makes you you”, by Tim Urban: http://waitbutwhy.com/2014/12/what-makes-you-you.html (Remember the example about the definition of human being with 2 legs, 2 arms etc. that I gave you in the class. If one leg is missing, the machine can no longer infer that this person is a “human being”! Read this interesting piece on “what makes you you”!
Midterm Project: Datas & Shemas
- Deadline: April 4, 2017
- How to submit:
- You have to use your GitHub account to submit your project.
- Your project name have to be “bby464Midterm”
- Your project URL have to be “https://github.com/username/bby464Midterm”
- The content of your project (Data):
- Author table (min 5)
- Publisher table (min 3)
- Book table (min 15)
- The content of your project (Files):
- Database Schema (Spreadsheet based, PDF)(db-schema.pdf)
- Database File (Spreadsheet based, PDF)(db-file.pdf)
- XML Schema (XSD)(xml-schema.xsd)
- XML XSL (XSL)(xml-style.xsl)(bonus)
- XML File (XML)(xml-file.xml)
- RDF Triples Document (Doc based, PDF)(rdf-document.pdf)
- RDF / XML File (XML)(rdf-file.xml)
(You can use DCMI Metadata Terms)
Unit 7: Ontologies and Vocabularies
- Lecture Notes: Protege, Open Source Ontology Editor
- Articles to be read
- Protege User Guide
- Schriml, M.L. et al. (2012). Disease ontology: A backbone for disease semantic integration. Nucleic Acid Research, 40: D940-D946.
-
- Web sites to check
- Protege: http://protege.stanford.edu/
- Disease Ontology: http://disease-ontology.org/
- ICD10 data: http://www.icd10data.com/
- MeSH: https://www.nlm.nih.gov/cgi/mesh/2016/MB_cgi?mode=&index=9322
- SNOMED CT: https://en.wikipedia.org/wiki/SNOMED_CT
Unit 8: Semantic Search and Knowledge Discovery
- Lecture Notes: Semantic Search
- Articles to be read
- Zadeh, L.A. (2006). From search engines to question answering systems – The problems of world knowledge, relevance, deduction and precisiation. Sanchez, E. (ed). Fuzzy logic and the Semantic Web (s. 163-210). Amsterdam: Elsevier. (If you say this papers is “tl;dr”, the crux of this paper was summarized in: Tonta, Y. (2012). Bilgi sınıflama, bilgi düzenleme ve bilgi erişim. In: Külcü, Ö., Çakmak, T. & Özel, N. (eds.). Prof. Dr. K. Gülbün Baydur’a armağan içinde (p. 155-172; especially starting from p. 166: “Arama Motorlarından Soru Yanıtlama Sistemlerine”). Ankara: Hacettepe Üniversitesi Bilgi ve Belge Yönetimi Bölümü.
- Süzen, A.A. & Taşdelen, K. (2016). Anlamsal Web Teknolojisi İle İçerik Arama. El-Cezeri Journal of Science and Engineering, 3: 118-124.
Unit 9: Social Semantic Search
- Guest Speaker: Dr. Güven Köse, Mantis Software Company
- Presentation: Social Semantic Search
Unit 10: Linked Data
- TED Talks
- Tim Berners-Lee, The next Web of open, linked data. (available in Turkish subtitles)
- Tim Berners-Lee, The year open data went worldwide. (available in Turkish subtitles)
- Hans Rosling, The best stats you ever seen: Link design to data.
- RelFinder: Interactive Relationship Discovery in RDF: http://www.visualdataweb.org/relfinder.php
Final Project: Ontology
- Deadline: May 23, 2017
- How to submit:
- You have to use your GitHub account to submit your project.
- Your project name have to be “bby464Final”
- Your project URL have to be “https://github.com/username/bby464Final”
- Project File Properties:
- Protege (http://webprotege.stanford.edu/) Output File
- Download Format (RDF / XML)
- Name of your Protege Project: BBY464NameSurname
- Project Details:
- You can choose any topic which you know well
- You have to use while you define / describe your ontology:
- Classes
- Object Properties
- Data Properties
- Individuals