Deep Dive Tutorial on Tweet Preprocessing

Name: Deep Dive Tutorial on Tweet Preprocessing
Start: 2022-04-06T21:00:00+03:00
End: 2022-04-06T22:00:00+03:00

Istanbul Twitter Developer Community

Apr 6, 2022, 6:00 – 7:00 PM

59

RSVPs

Do you want to go further from traditional text preprocessing approaches? This deep dive tutorial on Tweet preprocessing is for you. You will have a chance to learn how to handle informal social media text with new approaches in computational text analyses. We will show several Python libraries and some useful codes to apply to real data from Twitter. 

About this event

Do you want to go further from traditional text preprocessing approaches? This deep dive tutorial on Tweet preprocessing is for you. In our next event, we will host Assistant Professor Steven Wilson from Oakland University. We will examine how informal social media text poses challenges for traditional preprocessing methods and explore several approaches to complement state-of-the-art computational text analyses. We will show several Python libraries and some useful codes to apply to real data from Twitter.
Many traditional text preprocessing pipelines remove special characters, out of vocabulary words, emojis, URLs, special social media features like mentions and hashtags. In some cases,  these are replaced with standard tokens like <URL> or <OOV> instead of removing them. If your goal is just to examine the standard language, this would be enough for you. Is there a way to encode these removed parts of texts? This tutorial will show you alternative approaches you can take to handling these features.

Speaker

Steven R. Wilson

Oakland University

Asst. Prof.

Host

Akin Unver

Ozyegin University

Associate Professor

Organizer

Yunus Emre Tapan

Northeastern University

Istanbul Twitter Developer Community Lead

See bio

Contact Us