WebAgents

About

With the advancement of web techniques, they have become deeply embedded in people's lives, facilitating the completion of numerous work and daily activities. Despite the importance of the web, many tasks performed on it are repetitive and extremely time-consuming, significantly reducing productivity and negatively impacting overall quality of life. The critical role of the web, combined with the significant time and effort required to complete daily web tasks, naturally raises a question: ‘Can a superintelligent AI assistant be developed to automatically handle these repetitive and time-consuming tasks?’ Recently, Large Foundation Models (LFMs), containing billions of parameters, have exhibited human-like language understanding and reasoning capabilities, offering promising opportunities for the development of superintelligent AI assistants. To fully harness the potential of LFMs, WebAgents have emerged to complete complex web tasks according to user instructions, greatly enhancing the convenience of human daily life.

Our Survey Paper: A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models

Slides: WebAgent-Slides-Part-1 and WebAgent-Slides-Part-2

TARGET AUDIENCE AND PREREQUISITES FOR THE TUTORIAL

The audience of this tutorial could be college students, researchers in academic institutions, and industrial AI labs who are interested in Large Foundation Models (LFMs) and WebAgents. The audience is expected to have basic knowledge of artificial intelligence, foundation models, and agent techniques. However, this tutorial will be presented at the college junior/senior level so that it can be comfortably followed by academic researchers or industrial practitioners who are interested in this emerging field but not quite familiar with it. After attending this tutorial, the audience is expected to have a comprehensive understanding of WebAgents and obtain some insights about the potential research directions in this field.

Tutorial Syllabus

The topics of this tutorial include (but are not limited to) the following:

WebAgents

Large Foundation Models

Pre-training

Fine-tuning

Reinforcement Learning

Trustworthiness

The tutorial outline is shown below:

Introduction of WebAgents (15 minutes)

Preliminaries of AI Agents and LFM-based WebAgents (30 minutes)

Reinforcement learning-based Agents
Large foundation model-empowered Agents
AI Agents for Web Automation

Architecture of WebAgents and Main Modules (30 minutes)

WebAgents architecture overview
Perception in WebAgents
Planning and Reasoning in WebAgents
Execution in WebAgents

Coffee Break (20 minutes)

Training of WebAgents (30 minutes)

Data Used for Training
Training Strategies in WebAgents

Trustworthy WebAgents (30 minutes)

Safety and Robustness in WebAgents
Privacy in WebAgents
Generalizability in WebAgents

Challenges and Future Directions of WebAgents (15 minutes)

Personalized WebAgents
Domain-Specific WebAgents
Trustworthy WebAgent
Dataset and Benchmark of WebAgent

Q&A (10 minutes)

About

TARGET AUDIENCE AND PREREQUISITES FOR THE TUTORIAL

Event Dates

Tutorial Syllabus

Organization

Tutorial TUTORS

Yujuan Ding

Liangbo Ning

Ziran Liang

Zhuohang Jiang

Haohao Qu

Wenqi Fan

Xiao-yong Wei

Hui Liu

Philip S. Yu

Qing Li