Deep Dive Tutorial on Tweet Preprocessing

Istanbul Twitter Developer Community

Apr 6, 2022, 6:00 – 7:00 PM

59
RSVPs

Do you want to go further from traditional text preprocessing approaches? This deep dive tutorial on Tweet preprocessing is for you. You will have a chance to learn how to handle informal social media text with new approaches in computational text analyses. We will show several Python libraries and some useful codes to apply to real data from Twitter.

About this event

Do you want to go further from traditional text preprocessing approaches? This deep dive tutorial on Tweet preprocessing is for you. In our next event, we will host Assistant Professor Steven Wilson from Oakland University. We will examine how informal social media text poses challenges for traditional preprocessing methods and explore several approaches to complement state-of-the-art computational text analyses. We will show several Python libraries and some useful codes to apply to real data from Twitter.

Many traditional text preprocessing pipelines remove special characters, out of vocabulary words, emojis, URLs, special social media features like mentions and hashtags. In some cases,  these are replaced with standard tokens like <URL> or <OOV> instead of removing them. If your goal is just to examine the standard language, this would be enough for you. Is there a way to encode these removed parts of texts? This tutorial will show you alternative approaches you can take to handling these features.

Speaker

  • Steven R. Wilson

    Oakland University

    Asst. Prof.

Host

  • Akin Unver

    Ozyegin University

    Associate Professor

Organizer

  • Yunus Emre Tapan

    Northeastern University

    Istanbul Twitter Developer Community Lead

Contact Us