TY - GEN
T1 - The value of using big data technologies in computational social science
AU - Ch'ng, Eugene
N1 - Publisher Copyright:
© Copyright 2014 ACM.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2014/8/4
Y1 - 2014/8/4
N2 - The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.
AB - The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.
KW - Computational social science
KW - Data mining
KW - Open source
KW - Social network analysis
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=84986001286&partnerID=8YFLogxK
U2 - 10.1145/2640087.2644162
DO - 10.1145/2640087.2644162
M3 - Conference contribution
AN - SCOPUS:84986001286
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014
PB - Association for Computing Machinery
T2 - 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014
Y2 - 4 August 2014 through 7 August 2014
ER -