LLM Document OpenSource List

  • 2023-Databerry : The no-code platform for connecting custom data to large language models.

  • 2023-xturing : xturing provides fast, efficient and simple fine-tuning of LLMs, such as LLaMA, GPT-J, GPT-2, OPT, Cerebras-GPT, Galactica, and more. By providing an easy-to-use interface for personalizing LLMs to your own data and application, xTuring makes it simple to build and control LLMs. The entire process can be done inside your computer or in your private cloud, ensuring data privacy and security.

  • 2023-LMFlow : An extensible, convenient, and efficient toolbox for finetuning large machine learning models, designed to be user-friendly, speedy and reliable, and accessible to the entire community.

  • 2023-Argilla : Argilla is an open-source data curation platform for LLMs. Using Argilla, everyone can build robust language models through faster data curation using both human and machine feedback. We provide support for each step in the MLOps cycle, from data labeling to model monitoring.


  • 2023-ReadPilot : Read Pilot analyzes online articles and generate Q&A cards for you. Powered by OpenAI & Next.js.

  • 2023-DocsGPT : GPT-powered chat for documentation search & assistance.

  • 2023-pdfGPT : PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot!

  • 2023-nextjs-openai-doc-search : Template for building your own custom ChatGPT style doc search powered by Next.js, OpenAI, and Supabase.

  • 2023-WebWhiz : WebWhiz allows you to create an AI chatbot that knows everything about your product and can instantly respond to your customer’s queries.

  • 2023-Quivr : Quivr is your second brain in the cloud, designed to easily store and retrieve unstructured information. It’s like Obsidian but powered by generative AI.

  • 2023-BriefGPT : Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI.

  • 2023-Psychic : Psychic is an open source integration platform to extract and transform unstructured data from SaaS applications like Notion, Slack, Zendesk, Confluence, and Google Drive. Instead of building one integration for each data sources, you can build one integration that works for all data sources, and manage each connection from a GUI. Psychic is designed for startups that use LLMs and vector databases.

  • 2023-GanymedeNil/document.ai : 基于向量数据库与 GPT3.5 的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)

  • 2023-Haystack : 🔍 Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more.

  • 2023-OpenChat : Run and create custom ChatGPT-like bots with OpenChat, embed and share these bots anywhere, the open-source chatbot console.

  • 2023-anything-llm : A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces.

  • 2023-Dialoqbase : Dialoqbase is a web application that allows you to create custom chatbots with your own knowledge base. It uses powerful language models to generate responses and PostgreSQL for vector search and storing the knowledge base