Abstract
The quality of Wikipedia articles is debatable. On the one hand, existing research indicates that not only are people willing to contribute articles but the quality of these articles is close to that found in conventional encyclopedias. On the other hand, the public has never stopped criticizing the quality of Wikipedia articles, and critics never have trouble finding low-quality Wikipedia articles. Why do Wikipedia articles vary widely in quality? We investigate the relationship between collaboration and Wikipedia article quality. We show that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate. Based on an empirical study, we classify contributors based on their roles in editing individual Wikipedia articles. We identify various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles. Our research helps identify collaboration patterns that are preferable or detrimental for article quality, thus providing insights for designing tools and mechanisms to improve the quality of Wikipedia articles.
- Adler, B. T. and Alfaro, L. D. 2007. A content-driven reputation system for the wikipedia. In Proceedings of the 16th International Conference on World Wide Web. ACM Press, New York, 261--270. Google ScholarDigital Library
- Anthony, D., Smith, S., and Williamson, T. 2009. Reputation and reliability in collective goods. Rational. Soc. 21, 3, 283--306.Google ScholarCross Ref
- Arazy, O. and Nov, O. 2010. Determinants of wikipedia quality: The roles of global and local contribution inequality. In Proceedings of the Conference on Computer Supported Cooperative Work (CSCW). 233--236. Google ScholarDigital Library
- Arazy, O., Stroulia, E., Ruecker, S., Arias, C., Fiorentino, C., Ganev, V. and Yau, T. 2010. Recognizing contributions in wikis: Authorship, categories, algorithms, and visualizations. J. Amer. Soc. Inf. Sci. Technol. 61, 6, 1166--1179. Google ScholarDigital Library
- Blumenstock, J. 2008. Size Matters: Word count as a measure of quality on wikipedia. In Proceedings of the 17th International Conference On World Wide Web. 1095--1096. Google ScholarDigital Library
- Bowles, S. and Gintis, H. 2002. Social capital and community governance. Econ. J. 112, 483, 419--436.Google ScholarCross Ref
- Bracewell, R. J. and Witte, S. P. 2003. Tasks, ensembles, and activity: Linkages between text production and situation of use in the workplace. Writt. Comm. 20, 4, 511--559.Google ScholarCross Ref
- Bryant, S. L., Forte, A., and Bruckman, A. 2005. Becoming wikipedian: Transformation of participation in a collaborative online encyclopedia. In Proceedings of the International ACM SIGGROUP Conference on Supporting Group Work. 1--10. Google ScholarDigital Library
- Bughin, J. R. 2007. How companies can make the most of user-generated content. McKinsey Quart., 1--4.Google Scholar
- Cohen, N. 2007. Courts turn to wikipedia, but selectively. New York Times (1/29/07).Google Scholar
- Denning, P., Horning, J., Parnas, D., and Weinstein, L. 2005. Wikipedia risks. Comm. ACM 48, 12, 152--152. Google ScholarDigital Library
- Dondio, P. and Barrett, S. 2007. Computational trust in web content quality: A comparative evaluation on the wikipedia project. Informatica 31, 151--160.Google Scholar
- Economist Intelligence Unit. 2007. Serious business: Web 2.0 goes corporate. http://replyweb20.files.wordpress.com/2008/01/web_20_goes_corporate.pdfGoogle Scholar
- Ede, L. and Lunsford, A. 2001. Collaboration and concepts of authorship. J. Mod. Lang. Assoc. Amer. 116, 2, 354--369.Google Scholar
- Ehmann, K., Large, A., and Beheshti, J. 2008. Collaboration in context: Comparing article evolution among subject disciplines in wikipedia. First Monday 13, 10.Google ScholarCross Ref
- Eisenhardt, K. and Tabrizi, B. 1995. Accelerating adaptive processes: Product innovation in the global computer industry. Admin. Sci. Quart. 40, 84--110.Google ScholarCross Ref
- Fitzgerald, J. 1987. Research on revision in writing. Rev. Educ. Res. 57, 4, 481--506.Google ScholarCross Ref
- Gacek, C. and Arief, B. 2004. The many meanings of open source. IEEE Softw. 21, 1, 34--40. Google ScholarDigital Library
- Giles, J. 2005. Internet encyclopedias go head to head. Nature 438, 7070, 900--901.Google Scholar
- Hair, J., Anderson, R., and Tatham, R. 1998. Multivariate Data Analysis. Prentice-Hall, Upper Saddle River, NJ. Google ScholarDigital Library
- He, J., Lan, M., Tan, C. L., Sung, S. Y., and Low, H. B. 2004. Initialization of clusters refinement algorithms: A review and comparative study. In Proceedings of the International Joint Conference on Neural Networks. 297--302.Google Scholar
- Hendry, D., Jenkins, J., and Mccarthy, J. 2006. Collaborative bibliography. Inf. Process. Manag. 42, 3, 805--825. Google ScholarDigital Library
- Jones, J. 2008. Patterns of revision in online writing: A study of wikipedia's featured articles. Writt. Comm. 25, 262--289.Google ScholarCross Ref
- Kane, G. C. and Fichman, R. G. 2009. The shoemaker's children: Using wikis to improve is research, reaching, and publication. MIS Quart. 33, 1, 1--22. Google ScholarDigital Library
- Kittur, A. and Kraut, R. 2008. Harnessing the wisdom of crowds in wikipedia: Quality through coordination. In Proceedings of the ACM Conference on Computer Supported Cooperative Work. ACM Press, New York, 37--46. Google ScholarDigital Library
- Korfiatis, N., Poulos, M., and Bokos, G. 2006. Evaluating authoritative sources using social networks: An insight from wikipedia. Online Inf. Rev. 30, 3, 252--262.Google ScholarCross Ref
- Lih, A. 2004. Wikipedia as participatory journalism: Reliable sources? Metrics for evaluating collaborative media as a news resource. In Proceedings of the 5th International Symposium on Online Journalism, 16--17.Google Scholar
- Littlepage, G., Schmidt, G., Whisler, E., and Frost, A. 1995. An input-process-output analysis of influence and performance in problem-solving groups. J. Personality Social Psychol. 69, 5, 877--889.Google ScholarCross Ref
- Louridas, P. 2006. Using wikis in software development. IEEE Softw. 23, 2, 88--91. Google ScholarDigital Library
- Luyt, B., Tay, C. H., Lim, H. T., and Cheng, K. H. 2008. Improving wikipedia's accuracy: Is edit age a solution. J. Amer. Soc. Inf. Sci. Technol. 59, 2, 318--330. Google ScholarDigital Library
- Lykourentzou, I., Papadaki, K., Vergados, D., Polemi, D., And Loumos, V. 2010. CorpWiki: A self-regulating wiki to promote corporate collective intelligence through expert peer matching. Inf. Sci. 180, 1, 18--38. Google ScholarDigital Library
- Majchrzak, A. 2009. Comment: Where is the theory in wikis? MIS Quart. 33, 1, 18--20. Google ScholarDigital Library
- Manjoo, F. 2009. Is wikipedia a victim of its own success? Time Mag. 174.Google Scholar
- Mcgrath, J. 1984. Groups: Interaction and Performance. Prentice-Hall, Englewood Cliffs, NJ.Google Scholar
- Mcguinness, D., Zeng, H., Da Silva, P., Ding, L., Narayanan, D., and Bhaowal, M. 2006. Investigations into trust for collaborative information repositories: A wikipedia case study. In Proceedings of the Workshop on Models of Trust for the Web.Google Scholar
- Ortega, F., Gonzalez-Barahona, J., and Robles, G. 2008. On the inequality of contributions to wikipedia. In Proceedings of the 41st Annual Hawaii International Conference on System Sciences, IEEE Computer Society, 304--304. Google ScholarDigital Library
- Peters, L. and Karren, R. 2009. An examination of the roles of trust and functional diversity on virtual team performance ratings. Group Organiz. Manag. 34, 4, 479--504.Google ScholarCross Ref
- Pfeil, U., Zaphiris, P., and Ang, C. S. 2006. Cultural differences in collaborative authoring of wikipedia. J. Comput.-Mediat. Comm. 12, 1, 88--113.Google ScholarCross Ref
- Pinsonneault, A. and Caya, O. 2005. Virtual teams: What we know, what we don't know. Int. J. e-Collab. 1, 3, 1--16.Google ScholarCross Ref
- Press, L. 2006. Unpublished wikipedia web survey results. http://bpastudio.csudh.edu/fac/lpress/wikieval/Google Scholar
- Ram, S. and Liu, J. 2007. Understanding the semantics of data provenance to support active conceptual modeling. In Lecture Notes in Computer Science, vol. 4512. Springer, 17--29. Google ScholarDigital Library
- Rector, L. 2008. Comparison of wikipedia and other encyclopedias for accuracy, breadth, and depth in historical articles. Ref. Serv. Rev. 36, 1, 7--22.Google ScholarCross Ref
- Rosenzweig, R. 2006. Can history be open source? Wikipedia and the future of the past. J. Amer. Hist., 117--146.Google ScholarCross Ref
- Ruef, M. 2002. Strong ties, weak ties and islands: Structural and cultural predictors of organizational innovation. Industr. Corp. Change 11, 3, 427--449.Google ScholarCross Ref
- Schrage, M. 1990. Shared Minds: The New Technologies of Collaboration. Random House, New York. Google ScholarDigital Library
- Sen, S., Lam, S., Rashid, A., Cosley, D., Frankowski, D., Osterhouse, J., Harper, F., and Riedl, J. 2006. Tagging, communities, vocabulary, evolution. In Proceedings of the 20th Anniversary Conference on Computer Supported Cooperative Work. 181--190. Google ScholarDigital Library
- Surowiecki, K. 2004. The Wisdom of Crowds. Doubleday, New York. Google ScholarDigital Library
- Stvilia, B., Twidale, M., and Smith., L. 2005. Information quality discussions in wikipedia. Tech. rep. ISRN UIUCLIS-2005/2+CSCW, University of Illinois.Google Scholar
- Stvilia, B., Twidale, M., Smith., L. and Gasser, L. 2008. Information quality work organization in wikipedia. J. Amer. Soc. Inf. Sci. Technol. 59, 6, 983--1001. Google ScholarDigital Library
- Thom-Santelli, J., Muller, M. and Millen, D. 2008. Social Tagging Roles: Publishers, Evangelists, Leaders. In Proceedings of the 26th Annual SIGCHI Conference on Human Factors in Computing Systems. 1041--1044. Google ScholarDigital Library
- Wikipedia. 2010. Wikipedia: Version 1.0 editorial team/assessment. http://en.wikipedia.org/wiki/Wikipedia:Version_1.0_Editorial_Team/AssessmentGoogle Scholar
- Wilkinson, D. M. and Huberman, B. A. 2007. Assessing the value of cooperation in wikipedia. First Monday 12, 4.Google ScholarCross Ref
Index Terms
- Who does what: Collaboration patterns in the wikipedia and their impact on article quality
Recommendations
Measuring article quality in wikipedia: models and evaluation
CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge managementWikipedia has grown to be the world largest and busiest free encyclopedia, in which articles are collaboratively written and maintained by volunteers online. Despite its success as a means of knowledge sharing and collaboration, the public has never ...
Studying the Role of Diversity in Open Collaboration Network: Experiments on Wikipedia
NetSci-X 2016: Proceedings of the 12th International Conference and School on Advances in Network Science - Volume 9564This paper presents some empirical study towards understanding the role of diversity of individual authors and whole teams of authors on the quality of the articles they co-edit in open collaboration environments like Wikipedia. We introduce a concept ...
Does a 'Renaissance Man' Create Good Wikipedia Articles?
IC3K 2014: Proceedings of the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1We introduce a concept of diversity of interests or versatility of a member of an open-collaboration environment such as Wikipedia and aim to study how versatility influences the work quality. We introduce versatility measure based on entropy. In ...
Comments