Difference between revisions of "Techniques to analyze the Wikipedia Social Network"

From AIRWiki
Jump to: navigation, search
(Project short description)
(Project short description)
 
(11 intermediate revisions by 2 users not shown)
Line 6: Line 6:
 
=== Project short description ===
 
=== Project short description ===
  
Wikipedia is the most large and known example of wiki. Thousands of users interacts within it to build a neutral encyclopedia. But not every user interact equally: there are users who spend much more time on Wikipedia than others and may influence it.
+
Wikipedia is the largest and most known example of wiki. Thousands of users cooperate within it to build a neutral encyclopedia. But not every user interacts equally: there are users who spend much more time on Wikipedia than others and may influence it.
Our work tries to define a network of social interactions between Wikipedia users implicitly extracted by Wikipedia dumps, to analyze its sociometric stars and to discover its subcommunities.
+
With its growing importance in the spread of information some questions are arising: how much can the content of Wikipedia be trusted? Who really writes on it? And what are their aims?
 +
Our work tries to study the community of users acting on a generic wiki with an automatic, general and modular process.
 +
The first step consists in the extraction of data from the MediaWiki dump. The second calculates the user contribution for each page of the wiki. Then those users who can be considered the "best" contributors for each page are selected and used to build a Social Network. A standard set of analysis is applied to this network, such as macroscopic study and centrality measures.
 +
 
 +
[[Image:Schemaprocessi.png]]
  
 
=== Dates ===
 
=== Dates ===
 
Start date: 2008/07/01
 
Start date: 2008/07/01
  
End date: who knows?
+
End date: 2009/04/20
  
 
=== People involved ===
 
=== People involved ===
Line 26: Line 30:
  
 
== '''Part 2: project description''' ==
 
== '''Part 2: project description''' ==
Soon available
+
 
 +
[[Media:Tesi Tasso - 23.03.2009.pdf|Project Documentation (in italian)]]
 +
 
 +
[[Media:Presentazione Tasso - 20.04.2009.pdf|Project Presentation (in italian)]]
 +
 
 +
= Related resources =
 +
 
 +
* [http://trust.cse.ucsc.edu/ WikiTrust project]
 +
 
 +
== Software for Social Network Analysis ==
 +
* [http://vlado.fmf.uni-lj.si/pub/networks/pajek/ Pajek: Program for Large Network Analysis]
 +
* [http://jung.sourceforge.net/ JUNG Java Universal Network/Graph Framework]
 +
* [http://cneurocvs.rmki.kfki.hu/igraph/ The igraph library]

Latest revision as of 17:57, 11 May 2009

Part 1: project profile

Project name

Techniques to analyze the Wikipedia Social Network

Project short description

Wikipedia is the largest and most known example of wiki. Thousands of users cooperate within it to build a neutral encyclopedia. But not every user interacts equally: there are users who spend much more time on Wikipedia than others and may influence it. With its growing importance in the spread of information some questions are arising: how much can the content of Wikipedia be trusted? Who really writes on it? And what are their aims? Our work tries to study the community of users acting on a generic wiki with an automatic, general and modular process. The first step consists in the extraction of data from the MediaWiki dump. The second calculates the user contribution for each page of the wiki. Then those users who can be considered the "best" contributors for each page are selected and used to build a Social Network. A standard set of analysis is applied to this network, such as macroscopic study and centrality measures.

Schemaprocessi.png

Dates

Start date: 2008/07/01

End date: 2009/04/20

People involved

Project Advisor

David Laniado

Students

Students currently working on the project

Riccardo Tasso - rayman (ta) fastwebnet (tod) it

Part 2: project description

Project Documentation (in italian)

Project Presentation (in italian)

Related resources

Software for Social Network Analysis