deutsch-nlp

September 11, 2019 ยท View on GitHub

Overview

I took five german authors from the Kaggle German Literature dataset. Using a Multinomial Naive Bayes classifier with TF-IDF vectorization, I built a pipeline that takes in German text and produces a prediction.

Resources

To Do

  • add translation API to pipeline
  • improve recall for Kafka texts