Skip to content

Commit

Permalink
update papers from Prof. Toby Li
Browse files Browse the repository at this point in the history
  • Loading branch information
boyugou committed Dec 18, 2024
1 parent dff59dd commit c5cb612
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 2 deletions.
5 changes: 4 additions & 1 deletion .github/workflows/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,10 @@ jobs:
run: |
git config --global user.name "github-actions"
git config --global user.email "[email protected]"
git add README.md paper_by_env paper_by_author update_template_or_data/statistics
git add README.md
git add paper_by_env
git add paper_by_author
git add update_template_or_data/statistics
git commit -m "Auto Updates"
git push
env:
Expand Down
9 changes: 8 additions & 1 deletion update_template_or_data/update_paper_list.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,12 @@


- [Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents](https://arxiv.org/abs/2412.13194)
- Yifei Zhou, Qianlan Yang, Kaixiang Lin, Min Bai, Xiong Zhou, Yu-Xiong Wang, Sergey Levine, Erran Li
- 🏛️ Institutions: MSR, UIUC, Amazon
- 📅 Date: December 17, 2024
- 📑 Publisher: arXiv
- 💻 Env: [Web]
- 🔑 Key: [framework], [reinforcement learning], [skill discovery], [PAE]
- 📖 TLDR: This paper introduces the Proposer-Agent-Evaluator (PAE) system, enabling foundation model agents to autonomously discover and practice skills in real-world web environments. PAE comprises a context-aware task proposer, an agent policy for task execution, and a vision-language model-based success evaluator. Validated on vision-based web navigation tasks, PAE significantly enhances zero-shot generalization capabilities of vision-language model Internet agents, achieving over 30% relative improvement on unseen tasks and websites, and surpassing state-of-the-art open-source agents by more than 10%.

- [UI Layout Generation with LLMs Guided by UI Grammar](https://arxiv.org/abs/2310.15455)
- Yuwen Lu, Ziang Tong, Qinyi Zhao, Chengzhi Zhang, Toby Jia-Jun Li
Expand Down

0 comments on commit c5cb612

Please sign in to comment.