Skip to content

Commit

Permalink
Merge pull request #8 from callanwu/main
Browse files Browse the repository at this point in the history
add webwalker
  • Loading branch information
boyugou authored Jan 17, 2025
2 parents 3131bdc + b6666e1 commit 2f671a4
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions update_template_or_data/update_paper_list.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,13 @@
- [WebWalker: Benchmarking LLMs in Web Traversal](https://arxiv.org/abs/2501.07572)
- Jialong Wu, Wenbiao Yin, Yong Jiang, Zhenglin Wang, Zekun Xi, Runnan Fang, Deyu Zhou, Pengjun Xie, Fei Huang
- 🏛️ Institutions: Tongyi Lab, Alibaba NLP
- 📅 Date: January 13, 2025
- 📑 Publisher: arXiv
- 💻 Env: [Web]
- 🔑 Key: [benchmark], [framework], [RAG], [WebWalker], [WebWalkerQA]
- 📖 TLDR: This paper presents **WebWalker**, a multi-agent framework designed to improve the ability of large language models (LLMs) to traverse websites, addressing the challenges of retrieving complex, multi-layered information. WebWalker integrates an "explore-critic" paradigm, where the explorer agent navigates the web, and the critic agent evaluates the progress. The **WebWalkerQA** benchmark is introduced to assess web traversal tasks, showing how retrieval-augmented generation (RAG) can be enhanced with vertical exploration to solve real-world queries.


- [OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis](https://qiushisun.github.io/OS-Genesis-Home/)
- Qiushi Sun, Kanzhi Cheng, Zichen Ding, Chuanyang Jin, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu, Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu
- 🏛️ Institutions: Shanghai AI Lab, HKU, Johns Hopkins University, SJTU, Oxford, HKUST
Expand Down

0 comments on commit 2f671a4

Please sign in to comment.