The closest things we have to an AI


A Data Scientist’s take on defending Machine Learning models

source: https://pasadenaweekly.com/permanent-record-is-accused-spy-edward-snowdens-defense-brief-to-the-american-people/

Introduction


Support your argument with data!

source: https://assets.weforum.org/article/image/large_9M3Mr4G6HoxKpUnt1kMxQ2FOH5Z0dOy0aRWwumfWFmw.jpg

Data collection


Natural Language Processing is a catchy phrase these days

Relevance


Part II — Case Study

Natural Language Processing is a catchy phrase these days

This is Part 2 of a pair of tutorials on text pre-processing in python. In the first part, I laid out the theoretical foundations. In this second part, I’ll demonstrate the steps described in Part 1 in python on texts in different languages while discussing their differing effect arising from different structures of languages.

Relevance

  1. Removing stopwords
  2. Removing both extremely frequent and infrequent words
  3. Stemming…


Part I — Theoretical Background

Introduction


A vector map of Budapest. https://www.shutterstock.com/image-vector/black-white-vector-city-map-budapest-1035519106


A map of Budapest. Source: https://hebstreits.com/product/budapest-hungary-downtown-vector-map/


Source: https://aperiodical.com/tag/statistics/

Bagging: combining regression/classification trees


Source: https://aperiodical.com/tag/statistics/

Bootstrapping

Mor Kapronczay

ML&NLP Engineer @ Bold360AI. Text mining and predictive statistics enthusiast.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store