Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Continual learning with LLM #1063

Open
1 of 3 tasks
sonichi opened this issue Jun 3, 2023 · 2 comments · May be fixed by #1098
Open
1 of 3 tasks

Continual learning with LLM #1063

sonichi opened this issue Jun 3, 2023 · 2 comments · May be fixed by #1098
Assignees
Labels
enhancement New feature or request

Comments

@sonichi
Copy link
Contributor

sonichi commented Jun 3, 2023

Reason: to support continual learning with LLM, such as:

  • Use historical interaction log to improve the performance of an agent with the accumulation of interaction sessions.
  • Summarize data that don't fit in the context window at once.
  • Personalized chat.
  • Long chat.

One possible solution is:
Implement a continual learning agent and a teaching agent such that:

  • The teaching agent feeds learning goal and learning data to the learning agent. The learning agent maintains learning results of a particular form.
  • The learning agent can use LLM to update learning results after each batch of learning data.
  • When LLM is used for learning in the learning agent, it only needs the current learning results and the new batch of learning data. So the learning can be performed online and continually.
  • The teaching agent allows human input similar to user proxy agent.
  • Both the learning agent and the teaching agent can be serialized and deserialized to continue the learning process.
  • Optionally, the learning agent tells the teaching agent the maximal number of tokens of the learning data to receive.
  • Optionally, the teaching agent tells the learning agent the minimal number of tokens needed for the next batch of learning data.
  • Optionally, the teaching agent can provide feedback to the learning agent, including: numeric value about how good is the current learning result(s); and textual feedback.
@sonichi sonichi added the enhancement New feature or request label Jun 3, 2023
@sonichi sonichi self-assigned this Jun 3, 2023
@sonichi sonichi changed the title Continual learning agent and teaching agent Continual learning with LLM Jun 6, 2023
@qingyun-wu
Copy link
Contributor

This implementation could be used to answer the following research question: How to use historical interaction log to improve the performance of an agent with the accumulation of interaction sessions?

@qingyun-wu
Copy link
Contributor

qingyun-wu commented Jun 8, 2023

One possible implementation is to just have a learning component in the AssistantAgent and a teaching component in the UserProxyAgent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants