Abstract
Weblogs, or Blogs, have facilitated people to express their thoughts, voice their opinions, and share their experiences and ideas. Individuals experience a sense of community, a feeling of belonging, a bonding that members matter to one another and their niche needs will be met through online interactions. Its open standards and low barrier to publication have transformed information consumers to producers. This has created a plethora of open-source intelligence, or "collective wisdom" that acts as the storehouse of over-whelming amounts of knowledge about the members, their environment and the symbiosis between them. Nonetheless, vast amounts of this knowledge still remain to be discovered and exploited in its suitable way. In this paper, we introduce various state-of-the-art research issues, review some key elements of research such as tools and methodologies in Blogosphere, and present a case study of identifying the influential bloggers in a community to exemplify the integration of some major aspects discussed in this paper. Towards the end, we also compare and contrast the blogosphere and social networks and the research therein.
- E. Adar, L. Zhang, L. Adamic, and R. Lukose. Implicit structure and the dynamics of blogspace. In Proceedings of the 13th International World Wide Web Conference, 2004.Google Scholar
- Nitin Agarwal, Magdiel Galan, Huan Liu, and Shankar Subramanya. Clustering blogs with collective wisdom. In Proceedings of the International Conference on Web Engineering, 2008. Google ScholarDigital Library
- Nitin Agarwal, Huan Liu, John J. Salerno, and Philip S. Yu. Searching for Familiar Strangers on Blogosphere: Problems and Challenges. In NSF Symposium on Next-Generation Data Mining and Cyber-enabled Discovery and Innovation (NGDM), 2007.Google Scholar
- Nitin Agarwal, Huan Liu, Lei Tang, and Philip S. Yu. Identifying the influential bloggers. In Proccedings of the First ACM International Conference on Web Search and Data Mining (Video available at: http://videolectures.net/wsdm08 agarwal iib/), 2008. Google ScholarDigital Library
- G. Attardi and M. Simi. Blog mining through opinionated words. In Proceedings of the fifteenth Text REtrieval Conference (TREC), 2006.Google Scholar
- A. L. Barabasi and R. Albert. Emergence of scaling in random networks. Science, 286(509), 1999.Google Scholar
- A. Blanchard. Blogs as virtual communities: Identifying a sense of community in the julie/julia project. Into the Blogosphere: Rhetoric, Community and Culture.http://blog.lib.umn.edu/blogosphere, 2004.Google Scholar
- A. Blanchard and M. Markus. The experienced sense of a virtual community: Characteristics and processes. The DATA BASE for Advances in Information Systems, 35(1), 2004. Google ScholarDigital Library
- A. Blum, T. H. C. Mugizi, and M. R. Rwebangira. A random-surfer web-graph model. In Third Workshop on Analytic Algorithmics and Combinatorics (ANALCO06), 2006.Google ScholarCross Ref
- Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107--117, 1998. Google ScholarDigital Library
- Christopher H. Brooks and Nancy Montanez. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 625--632, New York, NY, USA, 2006. ACM Press. Google ScholarDigital Library
- Alvin Chin and Mark Chignell. A social hypertext model for finding community in blogs. In HYPERTEXT'06: Proceedings of the seventeenth conference on Hypertext and hypermedia, pages 11--22, New York, NY, USA, 2006. ACM Press. Google ScholarDigital Library
- Thayne Coffman and Sherry Marcus. Dynamic classification of groups through social network analysis and hmms. In Proceedings of IEEE Aerospace Conference, 2004.Google ScholarCross Ref
- Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. Indexing by latent semantic analysis. Journal of the American Society for information science, 1990.Google Scholar
- Daniel Drezner and Henry Farrell. The power and politics of blogs. In American Political Science Association Annual Conference, 2004.Google Scholar
- L. Efimova and S. Hendrick. In search for a virtual settlement: An exploration of weblog community boundaries, 2005.Google Scholar
- T. Elkin. Just an online minute.. online forecast. http://publications.mediapost.com/index.cfm?fuseaction=Articles.showArticle art aid=29803.Google Scholar
- Thomas L. Friedman. The World Is Flat: A Brief History of the Twenty-First Century. Farrar, Straus and Giroux, 2005.Google Scholar
- Michael Gamon, Anthony Aue, Simon Corston-Oliver, and Eric Ringger. Pulse: Mining Customer Opinions from Free Text. In Proceedings of the 6th International Symposium on Intelligent Data Analysis, 2005. Google ScholarDigital Library
- Kathy E. Gill. How can we measure the influence of the blogosphere? In Proceedings of the WWW'04: work-shop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, 2004.Google Scholar
- Dan Gillmor. We the Media: Grassroots Journalism by the People, for the People. O'Reilly, 2006. Google ScholarDigital Library
- Jennifer Golbeck and James Hendler. Inferring binary trust relationships in web-based social networks. ACM Trans. Inter. Tech., 6(4):497--529, 2006. Google ScholarDigital Library
- Jacob Goldenberg, Barak Libai, and Eitan Muller. Talk of the network: A complex systems look at the underlying process of word-of-mouth. Marketing Letters, 12:211--223, 2001.Google ScholarCross Ref
- D. Gruhl, David Liben-Nowell, R. Guha, and A. Tomkins. Information diffusion through blogspace. SIGKDD Exploration Newsletter, 6(2):43--52, 2004. Google ScholarDigital Library
- R. Guha, Ravi Kumar, Prabhakar Raghavan, and Andrew Tomkins. Propagation of trust and distrust. In WWW '04: Proceedings of the 13th international conference on World Wide Web, pages 403--412, New York, NY, USA, 2004. ACM Press. Google ScholarDigital Library
- Z. Gyongyi, P. Berkhin, Hector Garcia-Molina, and J. Pedersen. Link spam detection based on mass estimation. In Proceedings of the 32nd International Conference on Very Large Data Bases (VLDB), 2006. Google ScholarDigital Library
- Z. Gyongyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with trustrank. In Proceedings of the 30th International Conference on Very Large Data Bases (VLDB), 2004. Google ScholarDigital Library
- Akshay Java, Pranam Kolari, Tim Finin, and Tim Oates. Modeling the spread of influence on the blogosphere. In Proceedings of the 15th International World Wide Web Conference, 2006.Google Scholar
- Anubhav Kale, Amit Karandikar, Pranam Kolari, Akshay Java, Tim Finin, and Anupam Joshi. Modeling trust and influence in the blogosphere using link polarity. In International Conference on Weblogs and Social Media, 2007.Google Scholar
- Ed Keller and Jon Berry. One American in ten tells the other nine how to vote, where to eat and, what to buy. They are The Influentials. The Free Press, 2003.Google Scholar
- David Kempe, Jon Kleinberg, and Eva Tardos. Maximizing the spread of influence through a social network. In Proceedings of the KDD, pages 137--146, New York, NY, USA, 2003. ACM Press. Google Scholar
- J. Kleinberg. Authoritative sources in a hyperlinked environment. In 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Google ScholarDigital Library
- P. Kolari, T. Finin, and A. Joshi. SVMs for the blogosphere: Blog identification and splog detection. In AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs, 2006.Google Scholar
- P. Kolari, A. Java, T. Finin, T. Oates, and A. Joshi. Detecting spam blogs: A machine learning approach. In Proceedings of the 21st National Conference on Artificial Intelligence (AAAI), 2006. Google ScholarDigital Library
- Apostolos Kritikopoulos, Martha Sideri, and Iraklis Varlamis. Blogrank: ranking weblogs based on connectivity and similarity features. In AAA-IDEA '06: Proceedings of the 2nd international workshop on Advanced architectures and algorithms for internet delivery and applications, page 8, New York, NY, USA, 2006. ACM Press. Google ScholarDigital Library
- R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the web for emerging cyber communities. In The 8th International World Wide Web Conference, 1999. Google ScholarDigital Library
- Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. On the Bursty Evolution of Blogspace. In Proceedings of the 12th international conference on World Wide Web, pages 568--576, New York, NY, USA, 2003. ACM Press. Google ScholarDigital Library
- J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, and M. Hurst. Cascading behavior in large blog graphs. In SIAM International Conference on Data Mining, 2007.Google ScholarCross Ref
- Beibei Li, Shuting Xu, and Jun Zhang. Enhancing clustering blog documents by utilizing author/reader comments. In ACM-SE 45: Proceedings of the 45th annual southeast regional conference, pages 94--99, New York, NY, USA, 2007. ACM Press. Google ScholarDigital Library
- Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura, and Belle L. Tseng. Splog detection using self-similarity analysis on blog temporal dynamics. In Proceedings of the 3rd international workshop on Adversarial information retrieval on the web (AIRWeb), pages 1--8, New York, NY, USA, 2007. ACM Press. Google ScholarDigital Library
- Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. Springer, 2006. Google ScholarDigital Library
- A. Ntoulas, M. Najork, M. Manasse, and D. Fetterly. Detecting spam web pages through content analysis. In Proceedings of the 15th international conference on World Wide Web (WWW), 2006. Google ScholarDigital Library
- Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.Google Scholar
- David M. Pennock, Gary W. Flake, Steve Lawrence, Eric J. Glover, and C. Lee Giles. Winners don't take all: Characterizing the competition for links on the web. Proceedings of the National Academy of Sciences, 99(8):5207-5211, 2002.Google ScholarCross Ref
- Josep M. Pujol, Ramon Sangesa, and Jordi Delgado. Extracting reputation in multi agent systems by means of social network topology. In Proceedings of the first international joint conference on Autonomous agents and multiagent systems (AAMAS), pages 467--474, New York, NY, USA, 2002. ACM Press. Google ScholarDigital Library
- Matthew Richardson and Pedro Domingos. Mining knowledge-sharing sites for viral marketing. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge Discovery and Data mining, pages 61--70, New York, NY, USA, 2002. ACM Press. Google ScholarDigital Library
- Jordi Sabater and Carles Sierra. Reputation and social network analysis in multi-agent systems. In AAMAS '02: Proceedings of the first international joint conference on Autonomous agents and multiagent systems (AAMAS), pages 475--482, New York, NY, USA, 2002. ACM Press. Google ScholarDigital Library
- Loren Terveen and David W. McDonald. Social matching: A framework and research agenda. ACM Trans. Comput.-Hum. Interact., 12(3):401--434, 2005. Google ScholarDigital Library
- D. J. Watts and S. H. Strogatz. Collective dynamics of 'small-world networks. Nature, 393(6684):440442, 1998.Google ScholarCross Ref
- Bin Yu and Munindar P. Singh. Detecting deception in reputation management. In Proceedings of the second international joint conference on Autonomous Agents and Multiagent Systems (AAMAS), pages 73--80, New York, NY, USA, 2003. ACM Press. Google ScholarDigital Library
Index Terms
- Blogosphere: research issues, tools, and applications
Recommendations
A study of communities and influence in blogosphere
IDAR '08: Proceedings of the 2nd SIGMOD PhD workshop on Innovative database researchBlogging becomes a popular way for a Web user to publish information on the Web. Bloggers write blog posts, share likes and dislikes, voice opinions, provide suggestions, and report news. In this work we study influential bloggers in both community as ...
The political blogosphere and the 2004 U.S. election: divided they blog
LinkKDD '05: Proceedings of the 3rd international workshop on Link discoveryIn this paper, we study the linking patterns and discussion topics of political bloggers. Our aim is to measure the degree of interaction between liberal and conservative blogs, and to uncover any differences in the structure of the two communities. ...
Conversations in the Blogosphere: An Analysis "From the Bottom Up"
HICSS '05: Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 4 - Volume 04The "blogosphere" has been claimed to be a densely interconnected conversation, with bloggers linking to other bloggers, referring to them in their entries, and posting comments on each other's blogs. Most such characterizations have privileged a subset ...
Comments