The value of using big data technologies in computational social science

Eugene Ch'ng

doi:10.1145/2640087.2644162

The value of using big data technologies in computational social science

Eugene Ch'ng

School of International Communications

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

2 Citations (Scopus)

46 Downloads (Pure)

Abstract

The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.

Original language	English
Title of host publication	Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014
Publisher	Association for Computing Machinery
ISBN (Electronic)	9781450328913
DOIs	https://doi.org/10.1145/2640087.2644162
Publication status	Published - 4 Aug 2014
Event	3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014 - Beijing, China Duration: 4 Aug 2014 → 7 Aug 2014

Publication series

Name	ACM International Conference Proceeding Series
Volume	04-07-August-2014

Conference

Conference	3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014
Country/Territory	China
City	Beijing
Period	4/08/14 → 7/08/14

Keywords

Computational social science
Data mining
Open source
Social network analysis
Twitter

ASJC Scopus subject areas

Software
Human-Computer Interaction
Computer Vision and Pattern Recognition
Computer Networks and Communications

Access to Document

10.1145/2640087.2644162

The Value of Using Big Data Technologies in Computational Social ScienceAccepted author manuscript, 375 KBLicence: CC BY-NC

Cite this

@inproceedings{ee58158793fa4ed78baa8ef81f35bc85,

title = "The value of using big data technologies in computational social science",

abstract = "The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.",

keywords = "Computational social science, Data mining, Open source, Social network analysis, Twitter",

author = "Eugene Ch'ng",

note = "Publisher Copyright: {\textcopyright} Copyright 2014 ACM. Copyright: Copyright 2017 Elsevier B.V., All rights reserved.; 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014 ; Conference date: 04-08-2014 Through 07-08-2014",

year = "2014",

month = aug,

day = "4",

doi = "10.1145/2640087.2644162",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery",

booktitle = "Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014",

}

Ch'ng, E 2014, The value of using big data technologies in computational social science. in Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014., 2644162, ACM International Conference Proceeding Series, vol. 04-07-August-2014, Association for Computing Machinery, 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014, Beijing, China, 4/08/14. https://doi.org/10.1145/2640087.2644162

The value of using big data technologies in computational social science. / Ch'ng, Eugene.
Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014. Association for Computing Machinery, 2014. 2644162 (ACM International Conference Proceeding Series; Vol. 04-07-August-2014).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - The value of using big data technologies in computational social science

AU - Ch'ng, Eugene

PY - 2014/8/4

Y1 - 2014/8/4

N2 - The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.

AB - The discovery of phenomena in social networks has prompted renewed interests in the field. Data in social networks however can be massive, requiring scalable Big Data architecture. Conversely, research in Big Data needs the volume and velocity of social media data for testing its scalability. Not only so, appropriate data processing and mining of acquired datasets involve complex issues in the variety, veracity, and variability of the data, after which visualisation must occur before we can see fruition in our efforts. This article presents topical, multimodal, and longitudinal social media datasets from the integration of various scalable open source technologies. The article details the process that led to the discovery of social information landscapes within the Twitter social network, highlighting the experience of dealing with social media datasets, using a funneling approach so that data becomes manageable. The article demonstrated the feasibility and value of using scalable open source technologies for acquiring massive, connected datasets for research in the social sciences.

KW - Computational social science

KW - Data mining

KW - Open source

KW - Social network analysis

KW - Twitter

UR - http://www.scopus.com/inward/record.url?scp=84986001286&partnerID=8YFLogxK

U2 - 10.1145/2640087.2644162

DO - 10.1145/2640087.2644162

M3 - Conference contribution

AN - SCOPUS:84986001286

T3 - ACM International Conference Proceeding Series

BT - Proceedings of the 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014

PB - Association for Computing Machinery

T2 - 3rd ASE International Conference on Big Data Science and Computing, BIGDATASCIENCE 2014

Y2 - 4 August 2014 through 7 August 2014

ER -

The value of using big data technologies in computational social science

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this