Srijan Kumar @ Georgia Tech

Research Interests

Online malicious actors and dangerous content threaten public health, democracy, science, and society. To combat these threats, I build technological solutions, including accurate and robust models for early identification, prediction and attibution, as well as social mitigation solutions, such as empowering people to counter online harms. I have conducted the largest study of malicious sockpuppetry across nine platforms, ban evasion/recidivism on online platforms, and some of the earliest works on online misinformation. I am the one of the first to investigate of the reliability of web safety models used in practice, including Facebook's TIES and Twitter's Birdwatch. My work is one of the first to study whole-of-society solutions to mitigate online misinformation.

My research interests lie in comprehensively studying some of the biggest threats to Web Safety and Integrity from complementary angles:

"Multi-X" Detection of Misinformation and Malicious Actors: Multi-Platform, Multi-Modal, and Multi-Lingual

Enhancing the Adversarial Robustness and Trustworthiness of Web Models

Building Graphs and Networks Models for Accurate and Early Detection

Studying Recommender Systems' Impact and Building Responsible Recommender Systems

Developing AI-Powered Social Solutions to Combat Online Harms

In detail, my research interests spans the following topics:

(1) AI for Security: I develop methods to efficiently characterize the behavior of and detect both harmful content and malicious actors. Accurate characterization and early detection can greatly improve the safety, integrity, and well-being of online users, communities, and platforms. I have worked on the following type of bad behavior:

Misinformation and Information Integrity: Mis/dis/mal-information manipulate public opinion on societally-important topics and reduce trust in democratic processes (e.g., election misinformation), turn people against public health policies (e.g., COVID-19 vaccine and masking misinformation), and lead to disbelief in scientific evidence (e.g., climate change misinformation). I have conducted studies of misinformation across platforms, modalities, and languages [ICWSM 2022a, b], conducted the first study on the impact of misinformation on mental health [Scientific Reports, 2022], and studied how people spread misinformation on social media [IEEE BigData 2020]. Towards finding solutions to curb misinformation, I have shown that peer correction, i.e., people correcting others, accounts for 96% of all counter misinformation messages online [IEEE BigData 2020]. .

Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation, The ACM Web Conference 2023
Examining the impact of sharing COVID-19 misinformation online on mental health, Scientific Reports 2022.
The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic, IEEE BigData 2020.
Cross-Platform Multimodal Misinformation: Taxonomy, Characteristics and Detection for Textual Posts and Videos, ICWSM 2022.
Overcoming Language Disparity in Online Content Classification with Multimodal Learning, ICWSM 2022.
HawkEye: A Robust Reputation System for Community-based Misinformation Detection, ASONAM 2021.
False Information on Web and Social Media: A Survey
Disinformation on the Web: Impact, Characteristics and Detection of Wikipedia Hoaxes, WWW 2016.

Group malicious behavior, fraud, deception, fakes, and more: People and state-sponsored organizations can deceptively use multiple accounts to manipulate public opinion and harass others. I conducted the first study of ban evasion [WWW 2022] and sockpuppetry across nine platforms, and created methods to detect them [WWW 2017]. This research was given the best paper award honorable mention at the WWW 2017 conference. Moreover, fake reviewers and reviews on e-commerce platform lead to economic loss and reduced trust of the platform. I devised a graph-based method to detect fake reviewers on such platforms [ACM WSDM 2018]. This system has been used in production at Flipkart. I have also created behavior-based models to efficiently detect vandalism on Wikipedia [ACM SIGKDD 2015], trolling on social media [ASONAM 2014], deception in games [AL 2015], and lying in video-based conversations [ICWSM 2021].

Characterizing, Detecting, and Predicting Online Ban Evasion, WWW 2022.
An Army of Me: Sockpuppets in Online Discussion Communities, WWW 2017.
Rev2: Fraudulent User Prediction in Rating Platforms, ACM WSDM 2018.
Deception Detection in Group Video Conversations using Dynamic Interaction Networks, AAAI ICWSM 2021.
VEWS: A Wikipedia Vandal Early Warning System, ACM SIGKDD 2015.
Linguisitic Harbingers of Betrayal: A Case Study on an Online Strategic Game, ACL 2015.
Accurately Detecting Trolls in Slashdot Zoo via Decluttering, ASONAM 2014.

Online hate speech and content: Online hate typically targets marginalized communities, deteriorates the mental health of victims, and has even led to real-world crimes. I have conducted the longest study of anti-Asian hate speech during the COVID-19 pandemic [ASONAM 2021a] and how communities conflict with one another [WWW 2018].

Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis, ASONAM 2021.
Community Interaction and Conflict on the Web, WWW 2018

(2) Secure, Robust, and Responsible AI: Machine learnind and deep learning models are being used for high-stakes tasks. However, their trustworthiness, reliability, and robustness against manipulation by smart adversaries and to unintentional changes in data is not known. I have explored how adversaries can manipulate recommender systems for their gains. I have conducted the first investigate to quantify the trustworthiness of Facebook's TIES deep learning-based fraud detection models [ACM SIGKDD 2021], recommender systems [ACM CIKM 2022], graph-based models [ACM CIKM 2021b], and community-driven counter misinformation platform used at Twitter's Birdwatch [ASONAM 2021b].

Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning , ACL 2023.

Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding, ACL 2023.

Temporal Dynamics-Aware Adversarial Attacks on Discrete-Time Dynamic Graph Models, KDD 2023.

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models, TGL@NeurIPS, 2022.

Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions, EMNLP 2022.

PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models, ACM SIGKDD 2021.

Rank List Sensitivity of Recommender Systems to Interaction Perturbations, ACM CIKM 2022.

HawkEye: A Robust Reputation System for Community-based Misinformation Detection, ASONAM 2021.

Evaluating Graph Vulnerability and Robustness using TIGER, ACM CIKM 2021.

(3) Graphs and Networks: Modeling and predicting over large-scale networks is crucial to mine actionable insights from large inter-connect data, including social networks, e-commerce networks, knowledge graphs, spatio-temporal networks, and interaction networks. My relevant works include:

Representation Learning in Continuous-Time Dynamic Signed Networks, CIKM 2023.

Temporal Dynamics-Aware Adversarial Attacks on Discrete-Time Dynamic Graph Models, KDD 2023.

Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models, TGL@NeurIPS, 2022.

Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks, ACM SIGKDD 2019.

Deception Detection in Group Video Conversations using Dynamic Interaction Networks., ICWSM 2021.

Rev2: Fraudulent User Prediction in Rating Platforms, ACM WSDM 2018.

Higher-Order Label Homogeneity and Spreading in Graphs, WWW 2020.

Edge Weight Prediction in Weighted Signed Networks, IEEE ICDM 2016.

(4) Recommender systems and Behavior Modeling: Recommender systems power much of the content and products that we see online. I develop user-based and graph-based efficient recommender systems that are accurate, scalable, and trustworthy [ACM CIKM 2021, ACM SIGKDD 2019]. I also investigate how malicious actors can manipulate deep learning-powered recommender systems for their ulterior motives. I create new techniques to quantify this robustness and innovate new adversarially-robust deep recommender system architectures, to usher an era of trustworthy recommendations. Relevant works include:

Predicting Human Behavior: The Next Frontiers, Science 2017.

Rank List Sensitivity of Recommender Systems to Interaction Perturbations, ACM CIKM 2022.

M2TRec: Metadata-aware Multi-task Transformer for Large-scale and Cold-start free Session-based Recommendations, ACM RecSys 2022.

Influence-guided Data Augmentation for Neural Tensor Completion, ACM CIKM 2021.

Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks, ACM SIGKDD 2019.

Publications

For my complete list of publications, please refer to my Google Scholar profile.

Highlights (selected from the full list below)

Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation [PDF] NEW!
Bing He, Mustaque Ahamad, Srijan Kumar
WebConf'23 - The ACM Web Conference, 2023
[Project page with data and code] [Demo Video] Best Paper Award nominee

Representation Learning in Continuous-Time Dynamic Signed NetworksNEW!
Kartik Sharma, Mohit Raghavendra, Yeon-Chang Lee, M Anand Kumar, Srijan Kumar
CIKM'23 - ACM CIKM 2023

Temporal Dynamics-Aware Adversarial Attacks on Discrete-Time Dynamic Graph Models NEW!
Kartik Sharma, Rakshit Trivedi, Rohit Sridhar, Srijan Kumar
KDD'23 - ACM SIGKDD 2023

Predicting Information Pathways Across Online CommunitiesNEW!
Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar
KDD'23 - ACM SIGKDD 2023

Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning NEW!
Shivaen Ramshetty, Gaurav Verma, Srijan Kumar
ACL'23 - ACL, 2023

Examining the impact of sharing COVID-19 misinformation online on mental health [PDF] NEW!
Gaurav Verma, Ankur Bhardwaj, Talayeh Aledavood, Munmun De Choudhury, Srijan Kumar
Scientific Reports - Scientific Reports 12, 8045 (2022)

Rank List Sensitivity of Recommender Systems to Interaction Perturbations NEW!
Sejoon Oh, Berk Ustun, Julian McAuley, Srijan Kumar
ACM CIKM 2022 - 31st ACM International Conference on Information and Knowledge Management
[Project page with data and code]

Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions NEW!
Gaurav Verma, Vishwa Vinay, Ryan Rossi, Srijan Kumar
EMNLP 2022 - The 2022 Conference on Empirical Methods in Natural Language Processing
[Project page with data and code]

Cross-Platform Multimodal Misinformation: Taxonomy, Characteristics and Detection for Textual Posts and Videos NEW!
Nicholas Micallef, Marcelo Sandoval-Castaneda, Mustaque Ahamad, Adi Cohen, Srijan Kumar, Nasir Memon
AAAI ICWSM 2022 - The AAAI 16th International Conference on Web and Social Media

Characterizing, Detecting, and Predicting Online Ban Evasion. [PDF] NEW!
Manoj Niverthi, Gaurav Verma, Srijan Kumar
ACM WWW 2022 - The ACM Web Conference, 2022
[Project page with data and code]

Overcoming Language Disparity in Online Content Classification with Multimodal Learning NEW!
Gaurav Verma, Rohit Mujumdar, Jay Wang, Munmun De Choudhury, Srijan Kumar
AAAI ICWSM 2022 - The AAAI 16th International Conference on Web and Social Media
[Project page with data and code]

PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models. [PDF]
Bing He, Mustaque Ahamad, Srijan Kumar
ACM SIGKDD 2021 – 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
[Project page with data and code] [Link to presentation (pptx)] [Link to presentation (pdf)]

Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis
Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar
IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
[Project page with data and code]

The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. [PDF]
Nicholas Micallef*, Bing He*, Srijan Kumar, Mustaque Ahamad, Nasir Memon (* = equal contribution)
IEEE Big Data 2020 -- Full Paper, research track (top 15%)
[Project page with data and code]

Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks [PDF]
Srijan Kumar, Xikun Zhang, Jure Leskovec
ACM SIGKDD, 2019 – 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019
[Github page with code and data] [Slides] [Short explanation video]

List of all publications

Conference, Journal, and Other Publications

Representation Learning in Continuous-Time Dynamic Signed NetworksNEW!
Kartik Sharma, Mohit Raghavendra, Yeon-Chang Lee, M Anand Kumar, Srijan Kumar
CIKM'23 - ACM CIKM 2023
Temporal Dynamics-Aware Adversarial Attacks on Discrete-Time Dynamic Graph Models NEW!
Kartik Sharma, Rakshit Trivedi, Rohit Sridhar, Srijan Kumar
KDD'23 - ACM SIGKDD 2023
Predicting Information Pathways Across Online CommunitiesNEW!
Yiqiao Jin, Yeon-Chang Lee, Kartik Sharma, Meng Ye, Karan Sikka, Ajay Divakaran, Srijan Kumar
KDD'23 - ACM SIGKDD 2023
Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning NEW!
Shivaen Ramshetty, Gaurav Verma, Srijan Kumar
ACL'23 - ACL, 2023
Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding NEW!
Venkata Prabhakara Sarath Nookala, Gaurav Verma, Subhabrata Mukherjee, Srijan Kumar
ACL Findings'23 - ACL Findings, 2023
Advances in AI for web integrity, equity, and well-being NEW!
Srijan Kumar
Frontiers in Big Data, 2023 - Rising Stars in Data Science research topic
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation [PDF] NEW!
Bing He, Mustaque Ahamad, Srijan Kumar
WebConf'23 - The ACM Web Conference, 2023
[Project page with data and code] [Demo Video] Best Paper Award nominee
Characterizing and Predicting Social Correction on Twitter [PDF] NEW!
Yingchen Ma, Bing He, Nathan Subrahmanian, Srijan Kumar
WebSci'23 - ACM Web Science 2023
[Project page with data and code]
Imperceptible Adversarial Attacks on Discrete-Time Dynamic Graph Models [PDF] NEW!
Kartik Sharma, Rakshit Trivedi, Rohit Sridhar, Srijan Kumar
TGL@NeurIPS - Temporal Graph Learning workshop at NeurIPS (2022)
Examining the impact of sharing COVID-19 misinformation online on mental health [PDF] NEW!
Gaurav Verma, Ankur Bhardwaj, Talayeh Aledavood, Munmun De Choudhury, Srijan Kumar
Scientific Reports - Scientific Reports 12, 8045 (2022)
Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions [PDF] NEW!
Gaurav Verma, Vishwa Vinay, Ryan Rossi, Srijan Kumar
EMNLP 2022 - The 2022 Conference on Empirical Methods in Natural Language Processing
[Project page with data and code]
Rank List Sensitivity of Recommender Systems to Interaction Perturbations NEW!
Sejoon Oh, Berk Ustun, Julian McAuley, Srijan Kumar
ACM CIKM 2022 - 31st ACM International Conference on Information and Knowledge Management
[Project page with data and code]
Implicit Session Contexts for Next-Item Recommendations
Sejoon Oh, Ankur Bharadwaj, Jongseok Han, Sungchul Kim, Ryan Rossi, Srijan Kumar
ACM CIKM 2022 - 31st ACM International Conference on Information and Knowledge Management - short paper
[Project page with data and code]
Overcoming Language Disparity in Online Content Classification with Multimodal Learning [PDF]
Gaurav Verma, Rohit Mujumdar, Jay Wang, Munmun De Choudhury, Srijan Kumar
AAAI ICWSM 2022 - The AAAI 16th International Conference on Web and Social Media
[Project page with data and code]
Cross-Platform Multimodal Misinformation: Taxonomy, Characteristics and Detection for Textual Posts and Videos [PDF]
Nicholas Micallef, Marcelo Sandoval-Castaneda, Mustaque Ahamad, Adi Cohen, Srijan Kumar, Nasir Memon
AAAI ICWSM 2022 - The AAAI 16th International Conference on Web and Social Media
Characterizing, Detecting, and Predicting Online Ban Evasion. [PDF]
Manoj Niverthi*, Gaurav Verma*, Srijan Kumar
ACM WWW 2022 - The ACM Web Conference, 2022
[Project page with data and code]
M2TRec: Metadata-aware Multi-task Transformer for Large-scale and Cold-start free Session-based Recommendations
Walid Shalaby, Sejoon Oh, Amir Afsharinejad, Xiquan Cui, Srijan Kumar
ACM RecSys 2022 - The ACM Conference Series on Recommender Systems (LBR), 2022
M2P2: Multimodal Persuasion Prediction using Adaptive Fusion [PDF]
Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V.S. Subrahmanian
IEEE TMM 2021 – IEEE Transactions on Multimedia
[Project page with data and code]
PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models. [PDF]
Bing He, Mustaque Ahamad, Srijan Kumar
ACM SIGKDD 2021 – 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
[Project page with data and code] [Link to presentation (pptx)] [Link to presentation (pdf)]
Influence-guided Data Augmentation for Neural Tensor Completion. [PDF]
Sejoon Oh, Sungchul Kim, Ryan Rossi, Srijan Kumar
ACM CIKM 2021 – 30th ACM International Conference on Information and Knowledge Management, 2021
[Project page with data and code]
Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis
Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar
IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
[Project page with data and code]
HawkEye: A Robust Reputation System for Community-based Misinformation Detection. [PDF]
Rohit Mujumdar, Srijan Kumar
IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Evaluating Graph Vulnerability and Robustness using TIGER.
Scott Freitas, Diyi Yang, Srijan Kumar, Hanghang Tong, Polo Chau
ACM CIKM 2021 – 30th ACM International Conference on Information and Knowledge Management, 2021
[Project page]
Deception Detection in Group Video Conversations using Dynamic Interaction Networks. [PDF]
Srijan Kumar, Chongyang Bai, VS Subrahmanian, Jure Leskovec
AAAI ICWSM 2021 – 15th International AAAI Conference on Web and Social Media, 2021
[Project page with data]
The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. [PDF]
Nicholas Micallef*, Bing He*, Srijan Kumar, Mustaque Ahamad, Nasir Memon (* = equal contribution)
IEEE Big Data 2020 -- Full paper in research track
[Project page with data and code]
Higher-Order Label Homogeneity and Spreading in Graphs. [PDF]
Dhivya Eswaran, Srijan Kumar, Christos Faloutsos
ACM Web (WWW), 2020 – The ACM Web Conference, 2020
[Github page with code and data]
User Engagement with Digital Deception
Maria Glenski, Svitlana Volkova, Srijan Kumar
Peer-reviewed book chapter in 'Disinformation, Misinformation, and Fake News in Social Media 2020' by Springer.
Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks [PDF]
Srijan Kumar, Xikun Zhang, Jure Leskovec
ACM SIGKDD, 2019 – 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019 [Oral presentation, research track (top 9%)]
New dataset released: Account blocks on Wikipedia and Reddit. Link below.
[Github page with code and data] [Slides] [Short explanation video]
Included in the curriculum at: UCSD, Purdue University, LMU Munchen.
Predicting the Visual Focus of Attention Prediction in Multi-person Discussion Videos [PDF]
Chongyang Bai, Srijan Kumar, Jure Leskovec, Miriam Metzger, Jay Nunamaker, V.S. Subrahmanian
IJCAI, 2019 – International Joint Conference on Artificial Intelligence, 2019
New dataset released: 62 dynamic networks of who-interacts-with-whom. Link below.
[Dataset] [Project page with demo]
Predicting Dominance in Multi-person Videos [PDF]
Chongyang Bai, Maksim Bolonkin, Srijan Kumar, Jure Leskovec, Judee Burgoon, Norah Dunbar, V.S. Subrahmanian
IJCAI, 2019 – International Joint Conference on Artificial Intelligence, 2019
[Dataset] [Project page with demo]
Community Interaction and Conflict on the Web [PDF]
Srijan Kumar, William L. Hamilton, Jure Leskovec, Dan Jurafsky
ACM Web (WWW), 2018 – The ACM Web Conference, 2018
New dataset released: Reddit community-to-community interlinks and harassment attacks
[Project page: Data and Code] [Presentation slides (pptx)] [Presentation slides (pdf)]

Included in the curriculum at: University of Waterloo

Press: Russian spam accounts are still a big problem for Reddit (Engadget), What Reddit Tells Us About Political Coalitions and Conflicts (The Atlantic), Most Reddit battles are started by 1 percent of communities (Engadget), Tiny percent of Reddit communities spark majority of conflicts (CNET), One Percent of Subreddits Are Responsible for Most of the Raids on Reddit (VICE), and more by Inverse, TheNextWeb, theregister.co.uk
Rev2: Fraudulent User Prediction in Rating Platforms [PDF]
Srijan Kumar, Bryan Hooi, Disha Makhija, Mohit Kumar, Christos Faloutsos, V.S. Subrahmanian
ACM WSDM, 2018 – 11th ACM International Web Search and Data Mining Conference, 2018
New dataset released: Fraudsters on Amazon, Bitcoin networks, and Epinions.
[Project page: Data and Codes] [Presentation slides (pptx)] [Poster]
Included in the curriculum at: Stanford University
False Information on Web and Social Media: A Survey [PDF]
Srijan Kumar, Neil Shah
Invited book chapter in Social Media Analytics: Advances and Applications, CRC Press, 2018
Breaking Bad: Forecasting Adversarial Android Bad Behavior
S. Li*, Srijan Kumar*, Tudor Dumitras, and V.S. Subrahmanian. (* indicates equal contribution).
CyberSecurity, 2018 - From Database to Cybersecurity, 2018.
Measuring the Evolution of a Scientific Field through Citation Frames. [PDF]
David Jurgens, Srijan Kumar, Raine Hoover, Dan McFarland, Dan Jurafsky
TACL, 2018 – Transactions of the Association for Computational Linguistics, 2018
[Project page with data] [Code]
Demand-Driven Single- and Multitarget Mixture Preparation Using Digital Microfluidic Biochips.
Shalu, Srijan Kumar, A. Singla, Sudip Roy, K. Chakrabarty, P. P. Chakrabarti, and B. B. Bhattacharya.
TODAES, 2018 - ACM Transactions on Design Automation of Electronic Systems.
An Army of Me: Sockpuppets in Online Discussion Communities. [PDF]
Srijan Kumar, Justin Cheng, Jure Leskovec, V.S. Subrahmanian.
ACM Web (WWW), 2017 – 26th International World Wide Web (The ACM Web) Conference, 2017

Best Paper Award Honorable Mention

[Presentation Slides]

Included in the curriculum at: University of Michigan, Virginia Tech University, Stanford University, Penn State University, Saarland University, and University of Freiburg.

Documentary: Familiar Shapes by Heather D. Freeman

Press: Sock puppet accounts unmasked by the way they write and post (New Scientist), Tool unmasks online puppeteers (New Scientist, print version), Spotting sockpuppets with science (TechCrunch), Sock Puppet Accounts on the Internet Getting You Down? Here’s How to Spot Them (WOWscience)
Predicting Human Behavior: The Next Frontiers.
V.S. Subrahmanian, Srijan Kumar.
Science, 2017 – Science, vol. 355, issue 6324, pp. 489, 2017
Spectral Lens: Explainable Diagnostics, Tools and Discoveries in Directed, Weighted Graphs
Sebastian Goebl, Srijan Kumar, Christos Faloutsos
IEEE ICDM, 2017 – IEEE International Conference on Data Mining, 2017
Data-Driven Approaches towards Malicious Behavior Modeling
Meng Jiang, Srijan Kumar, V.S. Subrahmanian, Christos Faloutsos
ACM SIGKDD, 2017 (Tutorial) – Tutorial at 23rd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2017
Antisocial Behavior on the Web: Characterization and Detection
Srijan Kumar, Justin Cheng, Jure Leskovec
ACM Web (WWW), 2017 (Tutorial) – 26th International World Wide Web (The ACM Web) Conference, 2017
Edge Weight Prediction in Weighted Signed Networks. [PDF]
Srijan Kumar, Francesca Spezzano, V.S. Subrahmanian, Christos Faloutsos
IEEE ICDM, 2016 – IEEE International Conference on Data Mining, 2016

Top 10 most cited papers of ICDM in the last 5 years. [Link]

New datasets released: Weighted, signed, temporal networks from Bitcoin OTC and Bitcoin Alpha.
[Project page: Code] [Bitcoin OTC] [Bitcoin Alpha]
Disinformation on the Web: Impact, Characteristics and Detection of Wikipedia Hoaxes. [PDF]
Srijan Kumar, Robert West, Jure Leskovec.
ACM Web (WWW), 2016 – 25th International World Wide Web (The ACM Web) Conference, 2016
New datasets released: Hoax articles on Wikipedia
[Project page: Data and Code]

Included in the curriculum at: UIUC, University of Waterloo, McGill University, Texas A&M University, University of Hawaii, University of Freiburg, Leibniz University, Hannover, University of Waterloo, University of Alberta, University of Wellington, New Zealand, and Bari BigData winter school 2017.

Press: Don't Ask Wikipedia To Cure the Internet (WIRED), Can Wikipedia Solve YouTube's Conspiracy Theory Problem? (Motherboard)
Structure and Dynamics of Signed Citation Networks. [PDF]
Srijan Kumar.
ACM Web (WWW), 2016 companion - 25th International World Wide Web (The ACM Web) Conference companion, 2016.
[Project page: Data and Code]
Identifying Malicious Actors on Social Media
Srijan Kumar, Francesca Spezzano, V.S. Subrahmanian
IEEE/ACM ASONAM, 2016 tutorial – Advances in Social Network Analysis and Mining, 2016
Stubborn Mining: Generalizing Selfish Mining and Combining with an Eclipse Attack. [PDF]
Kartik Nayak*, Srijan Kumar*, Andrew Miller and Elaine Shi. (* indicates equal contribution).
Euro S&P, 2016 - IEEE European Symposium on Security and Privacy, 2016.
VEWS: A Wikipedia Vandal Early Warning System. [PDF]
Srijan Kumar, Francesca Spezzano, V. S. Subrahmanian.
ACM SIGKDD, 2015 – 21th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2015
New datasets released: Vandals and vandalism on Wikipedia
[Project page: Data and Code]
Linguisitic Harbingers of Betrayal: A Case Study on an Online Strategic Game. [PDF]
Vlad Niculae, Srijan Kumar, Jordan Boyd-Graber, Cristian Danescu-Niculescu-Mizil.
ACL, 2015 – 51st Conference of the Association for Computational Linguistics, 2015
New datasets released: Deception in Diplomacy, a conversation-based online game
[Project page: Data and Code]
Press: Should you worry about people who are too polite? (CNN), When Diplomacy Leads to Betrayal (The Wall Street Journal), Here’s a Good Reason to Be Wary of Overly Polite People (New York Magazine) and more here.
Layout-Aware Mixture Preparation of Biochemical Fluids on Application-Specific Digital Microfluidic Biochips. [PDF]
Sudip Roy, Partha P. Chakrabarti, Srijan Kumar, Krishnendu Chakrabarty, Bhargab B. Bhattacharya.
ACM TODAES, 2015 - ACM Transactions on Design Automation of Electronic Systems, 2015.
Demand-Driven Mixture Preparation and Droplet Streaming using Digital Microfluidic Biochips. [PDF]
Sudip Roy, Srijan Kumar, P. P. Chakrabarty, B. B. Bhattacharya and K. Chakrabarty.
ACM/IEEE DAC, 2014 - ACM/IEEE Design Automation Conference, 2014.
Accurately Detecting Trolls in Slashdot Zoo via Decluttering. [PDF]
Srijan Kumar, Francesca Spezzano, V. S. Subrahmanian.
IEEE/ACM ASONAM, 2014 – Advances in Social Network Analysis and Mining, 2014
New datasets released: Trolls in Slashdot, a signed social networking platform
[Project page: Data and Code]
Automatic Classification and Analysis of Interdisciplinary fields in Computer science. [PDF]
T. Chakrabarty, Srijan Kumar, D. Reddy, Suhansanu Kumar, Niloy Ganguly and Animesh Mukherjee.
ASE/IEEE SocialCom, 2013 - ASE/IEEE International Conference on Social Computing.
Routing-Aware Resource Allocation for Mixture Preparation in Digital Microfluidic Biochips. [PDF]
Sudip Roy, P. P. Chakrabarty, Srijan Kumar, B. B. Bhattacharya and K. Chakrabarty.
ISVLSI, 2013 - IEEE International Symposium on VLSI, 2013.
Efficient Mixture Preparation of Biochemical Fluids using Digital Microfluidic Biochip. [PDF]
Srijan Kumar, Sudip Roy, P. P. Chakrabarti, B. B. Bhattacharya and K. Chakrabarty.
IEEE DDECS, 2013 - Sixteenth IEEE Symposium on Design and Diagnostics of Electronic Circuits and Systems, 2013.

For all publications, please see my Google Scholar.

Group

The CLAWS - Computational Data Science Lab for the Web and Social Media - at Georgia Tech develops data science and applied machine learning solutions to solve the most pressing challenges facing the users, communities, and platforms on web and social media. We focus on pertinent online threats of malicious actors and dangerous content. We investigate the social and technological factors behind these issues and innovate multi-pronged solutions to overcome these challenges.

Sponsors: We are grateful for grants and gifts from NSF (CNS-2154118, IIS-2027689, ITE-2137724, ITE-2230692, CNS-2239879), DARPA, CDC, IDEaS, The Home Depot, Adobe, Google, Facebook, and Microsoft.

Postdocs:

Dr. Yeon-Chang Lee: graphs, recommender systems (visiting postdoc)
Dr. Yibo Hu: social networks

Data/Research Engineers:

Andrew Zhao: Data Engineer
Ananya Malik: Data Engineer

Ph.D. Students:

Sejoon Oh: adversarial ML, recommender systems; ML@GT Fellow, Twitch PhD Fellowship finalist, Kwanjeong Educational Foundation Fellow
Bing He: misinformation (co-advised with Prof. Mustaque Ahamad)
Gaurav Verma: misinformation, multimodality; Snap PhD Research Fellow (2022), Adobe PhD Fellowship finalist (2022), College of Computing Rising Star Doctoral Student Research Awardee (2022)
Kartik Sharma: graphs and networks
Yiqiao Jin: social networks, misinformation
Eric Ma: counter misinformation
Cuong Nguyen: social media and social networks

Masters and Undergraduate Students:

Rynaa Grover: MS
Mehul Soni: MS
Manoj Niverthi: MS, BS 2022
Yingchen (Eric) Ma: MS
Nigel Neo: MS
Adhira Choudhury: HS, BS
Ethan Kim: BS
Nathan Subrahmanian: BS

Alumni:

Ananya Malik: MS -> Data Engineer at Georgia Techa
Sara Abdali: postdoc 2022 -> Microsoft
Andy Chung: BS; NSF CS4Grad fellowship awardee
Shivaen Ramshetty: MS 2022 -> Toyota Research Corporation
Sarath Nookala: MS 2022 -> Meta
Aaron Reich: MS 2022 -> Co-founder, Alpine Health Systems
Harshal Gajjar: MS 2022 -> C3AI
Jongseok Han: MS 2022 -> Walmart
Kritika Gupta: MS
Manoj Niverthi: BS 2022 -> MS at Georgia Tech
Sivagami Nambi: MS 2022 -> Amazon
Mohit Raghavendra: BS 2022 -> MS at Georgia Tech
Soyoung Oh: MS 2022 -> Ph.D. at EPFL
Zhen Jiang: BS 2022 -> MS at UC Berkeley
Rohit Sridhar: MS 2022 -> Ph.D. at Georgia Tech
Adhira Choudhury: HS 2022 -> BS at Georgia Techa
Vivek Anand: MS 2023
Matthew Yang: MS 2021
Bharat Mamidibathula: MS 2021 -> Pinterest
Shreeshaa Kulkarni: MS 2021 -> Facebook
Ankur Bhardwaj: MS 2021 -> Walmart
Rohit Mujumdar: MS 2021 -> NCR
Andrew Wang: BS
Zan Huang: MS 2020 -> Kuaishou
Sunny Dhamnani: MS -> Facebook

Datasets

Malicious, fake, fraud behavior and content:

Dynamic networks:

Web of Trust Network: Bitcoin OTC platform. Signed, weighted, temporal.
Web of Trust Network: Bitcoin Alpha platform Signed, weighted, temporal.
Reddit: Community-to-community link network. Temporal, weighted.
Wikipedia: User to page edit. Temporal, weighted, attributed.
Reddit: User to subreddit posting activity.Temporal, weighted, attributed.
MOOC platform: Student activity. Temporal.
LastFM: User activity (listening to songs). Temporal.

Processed datasets:

Reddit: User and subreddit embeddings.

Service

Paper Reviewing and Program Committee:

Associate Editor of Frontiers in Big Data – Data Science, 06/2022 - present
Associate Editor of ACM SIGKDD Explorations, 07/2022 - present

Proposal Reviewing for NSF and other Agencies:

NSF SaTC, 2022
NSF CISE III, 2022, 2019
NSF SBIR/STTR, 2020, 2021, 2022 (x2)
NSF CISE NeTS CRII (ad-hoc reviewer), 2019

Journal Reviewing

PNAS, 2023
Nature Machine Intelligence, 2023 (x2)
ACM Transactions on Knowledge Discovery from Data, 2022 (x2)
EPJ Data Science, 2022, 2021
ACM Transactions on Intelligent Systems and Technology (TIST), 2022, 2017, 2016, 2015
ACM Computing Surveys, 2021
Network Science in 2020
IEEE Internet Computing in 2020
Journal of Machine Learning Research, 2019
Data Mining and Knowledge Discovery, 2019
Social Network Analysis and Mining, 2018
Online Information Review, 2018
IEEE Transactions on Computational Social Systems (TCSS), 2018, 2016, 2015
Science, 2017
IEEE Intelligent Systems (IS), 2017, 2016, 2015
Information Systems Frontiers in 2017

Senior Program Committee/Area Chair:

Senior PC, AAAI International Conference on Artificial Intelligence (AAAI), 2023, 2021
Area Chair, ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT), 2023
Senior PC, AAAI International Conference on Web and Social Media (ICWSM), 2023, 2022
Senior PC, Web and Society Track of The Web Conference (WWW), 2023
Area Chair, ACM SIGKDD, 2023

Program Committee:

ACM Web Search and Data Mining Conference (WSDM), 2022, 2019, 2018
AAAI International Conference on Web and Social Media (ICWSM), 2021, 2019, 2018
The Web Conference (WWW), 2020, 2019
ACM Conference on Hypertext and Social Media (HT), 2019
AAAI International Conference on Artificial Intelligence (AAAI), 2019
SIAM International Conference on Data Mining (SDM), 2019
European Symposium on Societal Challenges in Computational Social Science (EuroCSS), 2019
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2018, 2017
World Wide Web Conference (WWW), 2018
IEEE Conference on Information and Knowledge Management (CIKM), 2018, 2017
IEEE/ACM International Conference Series on Social Network Analysis and Mining (ASONAM), 2018, 2016
IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2018
International Joint Conference on Artificial Intelligence (IJCAI-ECAI), 2018

External Mentorship:

Mentor at the Doctoral Consortium at the ACM WSDM Conference, 2022
Mentor at the Ph.D. Symposium at ACM CIKM Conference, 2022
Mentor at the Undergraduate Consortium at the ACM SIGKDD Conference, 2022

Teaching

CSE 8803 DSN: Data Science for Social Networks (Fall 2023)
CSE 6240: Web Search and Text Mining (Spring 2023)
CSE 8803 DSN: Data Science for Social Networks (Fall 2022)
CSE 6240: Web Search and Text Mining (Spring 2022)
CSE 8803 DSN: Data Science for Social Networks (Fall 2021)
CSE 6240: Web Search and Text Mining (Spring 2021)
CSE 6240: Web Search and Text Mining (Spring 2020)

Honors and Awards

Recent News