Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: RAG #52

Open
scorpfromhell opened this issue Jul 25, 2024 · 0 comments
Open

[Feature Request]: RAG #52

scorpfromhell opened this issue Jul 25, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@scorpfromhell
Copy link

Problem Description

If question & answers/FAQs/documentations are made available in local storage or indexdb, they can be used for doing retrieval augmented generation.

Solution Description

Currently the chat responses are based only on the data provided during the pre-training. That data might either be outdated or insufficient. To overcome that RAG can be considered.

Content can be either stored locally or can be fetched from search engines or specified sites using tool calling.

Content can be stored locally in:

  1. local storage if curated question-answer pairs exist in limited quantity
  2. indexdb if curated question-answer pairs exist in a large number that can't fit into local storage
  3. Voy, a WASM based vector db, can be used to store the content after embedding.

The content can either be uploaded from local files or synced via REST API (can be provided via settings or a button next to prompts).

Retrieval can be done using elasticlunr.js for plain text and Transformers.js for semantic search in case of embeddings.

Alternatives Considered

Something similar has been done in https://github.com/jacoblee93/fully-local-pdf-chatbot

But it does not allow:

  1. Persistence of content on which RAG needs to be done
  2. Synchronisation of locally stored content via REST API
  3. Retrieving content from the Internet (tool calling)

Additional Context

No response

@scorpfromhell scorpfromhell added the enhancement New feature or request label Jul 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant