With the advancement of web techniques, they have become deeply embedded in people's lives, facilitating the completion of numerous work and daily activities. Despite the importance of the web, many tasks performed on it are repetitive and extremely time-consuming, significantly reducing productivity and negatively impacting overall quality of life. The critical role of the web, combined with the significant time and effort required to complete daily web tasks, naturally raises a question: ‘Can a superintelligent AI assistant be developed to automatically handle these repetitive and time-consuming tasks?’ Recently, Large Foundation Models (LFMs), containing billions of parameters, have exhibited human-like language understanding and reasoning capabilities, offering promising opportunities for the development of superintelligent AI assistants. To fully harness the potential of LFMs, WebAgents have emerged to complete complex web tasks according to user instructions, greatly enhancing the convenience of human daily life.
Our Survey Paper: A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
The topics of this tutorial include (but are not limited to) the following:
The tutorial outline is shown below: