Difference between revisions of "Extraction"

From AIRWiki
Jump to: navigation, search
 
(23 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
{{Project
 
{{Project
|title=Extraction
+
|title=Automatic Extraction of Domain Ontologies from Text
|short_descr=This thesis to be developed together with Noustat S.r.l. (see http://www.noustat.it), who are developing research activities directed toward the optimization of knowledge management services, in collaboration with another company operating in this field. This project is aimed at removing the ontology building bottleneck, long and expensive activity that usually requires the direct collaboration of a domain expert. The possibility of automatic building the ontology, starting from a set of textual documents related to a specific domain, is expected to improve the ability to provide the knowledge management service, both by reducing the time-to-application, and by increasing the number of domains that can be covered. For this project, unsupervised learning methods will be applied in sequence, exploiting the topological properties of the ultra-metric spaces that emerge from the taxonomic structure of the concepts present in the texts, and associative methods will extend the concept network to lateral, non-hierarchical relationships.  
+
|image=Ontology_example.png
|tutor=AndreaBonarini;DavideEynard;MatteoMatteucci
+
|short_descr=This project is aimed at removing the ontology building bottleneck, semi-automatically building an ontology starting from a set of textual documents related to a specific domain.
 +
|tutor=DavideEynard;MatteoMatteucci
 
|students=FabioMarfia
 
|students=FabioMarfia
 
|resarea=Machine Learning
 
|resarea=Machine Learning
 +
|restopic=Knowledge Learning from Text
 
|start=2009/10/01
 
|start=2009/10/01
|status=Active
+
|status=Closed
 
|level=Ms
 
|level=Ms
 
|type=Thesis
 
|type=Thesis
 
}}
 
}}
This is an empty page for the Extractor project, born out of the
+
 
[[Automatic_generation_of_domain_ontologies]] proposal.
+
== '''Part 1: project profile''' ==
 +
 
 +
=== Project name ===
 +
''On the Use of Correspondence Analysis to Extract Ontologies and Other Semantics from Text''
 +
 
 +
 
 +
=== Project short description ===
 +
This project is aimed at removing the knowledge acquisition bottleneck, (semi-)automatically building a seed of knowledge-base from a set of textual documents related to a specific domain
 +
 
 +
=== Dates ===
 +
 
 +
* Start date: 2009/07/30
 +
* End date: 2010/05/05
 +
 
 +
 
 +
=== People involved ===
 +
 
 +
==== Politecnico di Milano people ====
 +
 
 +
* Ing. [[User:MatteoMatteucci|Matteo Matteucci]]
 +
* Ing. [[User:DavideEynard|Davide Eynard]]
 +
 
 +
 
 +
=== Students ===
 +
 
 +
* Ing. [[User:FabioMarfia|Fabio Marfia]]
 +
 
 +
 
 +
== '''Part 2: project description''' ==
 +
* [http://docs.google.com/Doc?docid=0Ac5SBJf9Fj2UZGR4NmtkcmpfMThmajczazdjeg&hl=en An early project description]
 +
* [http://docs.google.com/Doc?docid=0Ac5SBJf9Fj2UZGR4NmtkcmpfMTFkNGN2NThmcQ&hl=en The shared project document]
 +
* [http://davide.eynard.it/noustat/ A collection of basic tutorials on PCA and Correspondence Analysis]
 +
* [http://davide.eynard.it/noustat/papers%20murtagh/ A collection of papers by Fionn Murtagh]
 +
* [http://davide.eynard.it/noustat/papers%20ontology%20learning/ A collection of papers about ontology learning from text]
 +
* [http://airwiki.elet.polimi.it/mediawiki/images/d/d8/On_the_use_of_CA_to_learn_knowledge.pdf Master Thesis Work]

Latest revision as of 14:43, 21 January 2011

Automatic Extraction of Domain Ontologies from Text
Image of the project Extraction
Short Description: This project is aimed at removing the ontology building bottleneck, semi-automatically building an ontology starting from a set of textual documents related to a specific domain.
Coordinator:
Tutor: DavideEynard (eynard@elet.polimi.it), MatteoMatteucci (matteo.matteucci@polimi.it)
Collaborator:
Students: FabioMarfia (marfia@elet.polimi.it)
Research Area: Machine Learning
Research Topic: Knowledge Learning from Text
Start: 2009/10/01
Status: Closed
Level: Ms
Type: Thesis

Part 1: project profile

Project name

On the Use of Correspondence Analysis to Extract Ontologies and Other Semantics from Text


Project short description

This project is aimed at removing the knowledge acquisition bottleneck, (semi-)automatically building a seed of knowledge-base from a set of textual documents related to a specific domain

Dates

  • Start date: 2009/07/30
  • End date: 2010/05/05


People involved

Politecnico di Milano people


Students


Part 2: project description