Moreno-Ortiz, Antonio.

Making Sense of Large Social Media Corpora Keywords, Topics, Sentiment, and Hashtags in the Coronavirus Twitter Corpus / [electronic resource] : by Antonio Moreno-Ortiz. - 1st ed. 2024. - XII, 192 p. 105 illus., 102 illus. in color. online resource.

Chapter 1 - Introduction -- Chapter 2 Managing large Twitter datasets -- Chapter 3. Keywords -- Chapter 4. Topics -- Chapter 5. Sentiment -- Chapter 6. Entities -- Chapter 7. Other social media semantic items: hashtags and emojis -- Chapter 8. Lessons learned.

Open Access

This open access book offers a comprehensive overview of available techniques and approaches to explore large social media corpora, using as an illustrative case study the Coronavirus Twitter corpus. First, the author describes in detail a number of methods, strategies, and tools that can be used to access, manage, and explore large Twitter/X corpora, including both user-friendly applications and more advanced methods that involve the use of data management skills and custom programming scripts. He goes on to show how these tools and methods are applied to explore one of the largest Twitter datasets on the COVID-19 pandemic publicly released, covering the two years when the pandemic had the strongest impact on society. Specifically, keyword extraction, topic modelling, sentiment analysis, and hashtag analysis methods are described, contrasted, and applied to extract information from the Coronavirus Twitter Corpus. The book will be of interest to students and researchers in fields that make use of big data to address societal and linguistic concerns, including corpus linguistics, sociology, psychology, and economics. Antonio Moreno-Ortiz is a lecturer at the Faculty of Arts of the University of Malaga, Spain.

9783031527197

10.1007/978-3-031-52719-7 doi


Linguistics--Methodology.
Social media.
Communication in medicine.
Applied linguistics.
Research Methods in Language and Linguistics.
Social Media.
Health Communication.
Applied Linguistics.

P29.52-41.22

407.21