Natural Language Processing

  • Tokenizers
  • Embeddings
  • Frameworks
    • NLTK, popularity is declining
    • spaCy, widely used in production systems
    • Hugging Face, “democratizing NLP/ML”, known for its advanced pre-trained models.
    • Flair, built by Humboldt University of Berlin, based on PyTorch
    • Stanza, developed by Stanford
    • TensorFlow and PyTorch systems
    • LangChain, with integration with LLMs
    • Jina AI