Mining wikipedia categories
Title: | Wikipedia category map |
Image:wikipedia_categories.png |
Description: | Wikipedia articles are organized in a hierarchy of categories, manually assigned by users. This process can be considered a huge effort for the collective categorization of human knowledge; the result is a wide and disordered graph which can provide precious information for a variety of applications (natural language processing, information retrieval, ontology building...).
Aim of the project is the development of a tool for the visualization of this graph. The project can be extended to a thesis in various directions, as the development of advanced visualization features or the creation and population of an ontology. | |
Tutor: | DavidLaniado (david.laniado@gmail.com), RiccardoTasso (tasso@elet.polimi.it), MarcoColombetti (colombet@elet.polimi.it) | |
Start: | Nowwarning.pngThe date "Now" was not understood. | |
Students: | 1 - 2 | |
CFU: | 5 - 20 | |
Research Area: | Social Software and Semantic Web | |
Research Topic: | Wiki Analysis | |
Level: | Bs, Ms | |
Type: | Course, Thesis | |
Status: | Proposalwarning.png"Proposal" is not in the list of possible values (Active, Closed) for this property. |
Wikipedia articles are organized in a hierarchy of categories, manually assigned by users. This process can be considered a huge effort for the collective categorization of human knowledge; the result is a wide and disordered graph which can provide precious information for a variety of applications (natural language processing, information retrieval, ontology building...).
Aim of the project is the development of a tool for the visualization of this graph. The project can be extended to a thesis in various directions, as the development of advanced visualization features or the creation and population of an ontology.
- Tools and instruments
- the software can be implemented in any programming language; we have already developed a java prototype that queries the wikipedia APIs, which can be used as a starting point.