Difference between revisions of "Data Extraction From Wikis"

From AIRWiki
Jump to: navigation, search
Line 1: Line 1:
 
{{Project
 
{{Project
  | title=Use of ontology for estraction of structured datas from wikis
+
  | title=Use of ontologies for the extraction of structured data from wikis
  | short_descr=Java application for estraction of data from wiki and re-organize them in a ontology  
+
  | short_descr=Development of a Java application for the extraction of data from wikis and their reorganization inside an ontology  
 
  | tutor=DavideEynard
 
  | tutor=DavideEynard
 
  | students=CarloMiglierina
 
  | students=CarloMiglierina
Line 16: Line 16:
  
 
=== Project name ===
 
=== Project name ===
''Use of ontology for estraction of structured datas from wikis''
+
''Use of ontologies for the extraction of structured data from wikis''
  
 
=== Project short description ===
 
=== Project short description ===
Wikipedia is the largest and most known example of wiki. There are a lot of information in this wiki, and a lot of users create and edit pages. But this free encyclopedia has a disadvantage: the datas are not structured and so it is not possible to do advanced researches. Moreover computers can't process those datas. The aim of this project is to create a Java application that can extract datas and put them in an ontology, in order to have structured datas. Using the ontology it is possible do advanced researches, and computers can process the datas. For example this application was used to organize datas of the characters of "The lord of the rings".
+
Wikipedia is the largest and most known example of wiki. There is a lot of information inside wikis that are built using its same technology, and a lot of users who create and edit their pages. But these free encyclopedias have a disadvantage: data is not structured and so it is not possible to do advanced researches. Moreover, computers cannot process these data. The aim of this project is to create a Java application that extracts semi-structured data from wiki templates and infoboxes and puts them inside an ontology, in order to have structured data. Using the ontology it is possible to do advanced researches, as computers can process these data. As an example, this application has been used to organize data about the characters of "The lord of the rings".
  
 
=== Dates ===
 
=== Dates ===

Revision as of 14:45, 8 October 2009

Use of ontologies for the extraction of structured data from wikis
Short Description: Development of a Java application for the extraction of data from wikis and their reorganization inside an ontology
Coordinator:
Tutor: DavideEynard (eynard@elet.polimi.it)
Collaborator:
Students: CarloMiglierina (carlo.miglierina@gmail.com)
Research Area: Social Software and Semantic Web
Research Topic:
Start: 2008/10/28
End: 2009/09/5
Status: Closed
Level: Bs
Type: Thesis

Part 1: project profile

Project name

Use of ontologies for the extraction of structured data from wikis

Project short description

Wikipedia is the largest and most known example of wiki. There is a lot of information inside wikis that are built using its same technology, and a lot of users who create and edit their pages. But these free encyclopedias have a disadvantage: data is not structured and so it is not possible to do advanced researches. Moreover, computers cannot process these data. The aim of this project is to create a Java application that extracts semi-structured data from wiki templates and infoboxes and puts them inside an ontology, in order to have structured data. Using the ontology it is possible to do advanced researches, as computers can process these data. As an example, this application has been used to organize data about the characters of "The lord of the rings".

Dates

Start date: 2008/10/28

End date: 2009/09/05

People involved

Project Advisor

Davide Eynard

Students

Students currently working on the project

Carlo Miglierina

Part 2: project description

Project Documentation (in Italian)

Project Presentation (in italian)