Stars
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
This is the repo of the medical dialogue dataset 'imcs21' in CBLUE@Tianchi
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
A modular graph-based Retrieval-Augmented Generation (RAG) system
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
🔍 AI search engine - self-host with local or cloud LLMs
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claud…
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”