💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
-
Updated
Nov 21, 2025 - Python
E5A6
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Retrieval and Retrieval-augmented LLMs
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
A curated list of pretrained sentence and word embedding models
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
unified embedding model
SGPT: GPT Sentence Embeddings for Semantic Search
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
A Python vector database you just need - no more, no less.
Compute Sentence Embeddings Fast!
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
A Structured Self-attentive Sentence Embedding
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.
Add a description, image, and links to the sentence-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the sentence-embeddings topic, visit your repo's landing page and select "manage topics."