cNLP Basic NLP text processing tools implemented in C TODO n-grams Bag of Words TF-IDF Lemmatization Stemming Word embeddings