Continual learning with LLM #1063

sonichi · 2023-06-03T16:11:14Z

Reason: to support continual learning with LLM, such as:

Use historical interaction log to improve the performance of an agent with the accumulation of interaction sessions.
Summarize data that don't fit in the context window at once.
Personalized chat.
Long chat.

One possible solution is:
Implement a continual learning agent and a teaching agent such that:

The teaching agent feeds learning goal and learning data to the learning agent. The learning agent maintains learning results of a particular form.
The learning agent can use LLM to update learning results after each batch of learning data.
When LLM is used for learning in the learning agent, it only needs the current learning results and the new batch of learning data. So the learning can be performed online and continually.
The teaching agent allows human input similar to user proxy agent.
Both the learning agent and the teaching agent can be serialized and deserialized to continue the learning process.
Optionally, the learning agent tells the teaching agent the maximal number of tokens of the learning data to receive.
Optionally, the teaching agent tells the learning agent the minimal number of tokens needed for the next batch of learning data.
Optionally, the teaching agent can provide feedback to the learning agent, including: numeric value about how good is the current learning result(s); and textual feedback.

Tasks

Give feedback

qingyun-wu · 2023-06-06T22:52:04Z

This implementation could be used to answer the following research question: How to use historical interaction log to improve the performance of an agent with the accumulation of interaction sessions?

qingyun-wu · 2023-06-08T22:55:52Z

One possible implementation is to just have a learning component in the AssistantAgent and a teaching component in the UserProxyAgent.

sonichi added the enhancement New feature or request label Jun 3, 2023

sonichi added this to the Release of a new version for an initial autogen.agent package milestone Jun 3, 2023

sonichi self-assigned this Jun 3, 2023

qingyun-wu mentioned this issue Jun 6, 2023

How to use interaction log to improve the performance of an agent across sessions #1068

Closed

sonichi changed the title ~~Continual learning agent and teaching agent~~ Continual learning with LLM Jun 6, 2023

sonichi mentioned this issue Jun 7, 2023

Improve logging in autogen.oai #1069

Open

sonichi mentioned this issue Jun 10, 2023

Create a new subclass of Domain to support open-ended tuning #1077

Open

gagb self-assigned this Jun 14, 2023

skzhang1 self-assigned this Jun 18, 2023

weilinear mentioned this issue Jun 21, 2023

Write user code example #1072

Open

sonichi modified the milestones: Release of a new version for an initial autogen.agent package, Upgrade of autogen Jun 23, 2023

qingyun-wu linked a pull request Jun 28, 2023 that will close this issue

Continual learning via LearningAgent and TeachingAgent #1098

Draft

3 tasks

sonichi linked a pull request Jul 18, 2023 that will close this issue

Continual learning via LearningAgent and TeachingAgent #1098

Draft

3 tasks

sonichi mentioned this issue Aug 7, 2023

support async in agents #1178

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continual learning with LLM #1063

Continual learning with LLM #1063

sonichi commented Jun 3, 2023 •

edited by skzhang1

Loading

Tasks

qingyun-wu commented Jun 6, 2023

qingyun-wu commented Jun 8, 2023 •

edited

Loading

Continual learning with LLM #1063

Continual learning with LLM #1063

Comments

sonichi commented Jun 3, 2023 • edited by skzhang1 Loading

Tasks

qingyun-wu commented Jun 6, 2023

qingyun-wu commented Jun 8, 2023 • edited Loading

sonichi commented Jun 3, 2023 •

edited by skzhang1

Loading

qingyun-wu commented Jun 8, 2023 •

edited

Loading