Releases: dlite-tools/NLPiper
Releases · dlite-tools/NLPiper
0.3.1
0.3.0
📝 Changelog:
-
✨ Features:
- Add Spacy and TorchText Embeddings transformer
-
📚 Documentation:
- Improve documentation and migrate links to new repository host
0.2.1
📝 Changelog:
-
🛠️ Improvements:
- Allow
DocumentandTokento be set extra attributes
- Allow
-
♻️ Refactoring:
- Move
BaseTransformout of init for better integration with IDEs
- Move
-
📚 Documentation:
- Improve documentation
cleanersfor sphinix better compatibility and explanation on transforms
- Improve documentation
0.2.0
📝 Changelog:
-
✨ Features:
- Add transformer for embeddings
- Add ner and embeddings token wise
- Add embeddings document wise
- Improve Stanza integration allowing to extract lemma and ner directly to the token object
-
🐛 Bug:
- Fix bug on importing nlpiper
-
📚 Documentation:
- Update and improve documentation
0.1.0
📝 Changelog:
- ✨ Features:
- Create transformation pipeline to a document
- Rollback transformation steps applied to a document
- Add document Cleaners:
CleanURL,CleanEmail,CleanNumber,CleanPunctuation,CleanEOF,CleanMarkup,CleanAccents - Add document Tokenizers:
BasicTokenizer,MosesTokenizer,StanzaTokenizer - Add document Normalizers:
CaseTokens,RemovePunctuation,RemoveStopWords,VocabularyFilter,SpellCheck,Stemmer - Add Embeddings:
GensimEmbeddings