Auto Web Task Agent

Backend
Frontend

Inspiration

Many everyday tasks on the internet are repetitive and time-consuming. Whether it is applying for internships, filling forms, searching products, or booking services, users often spend a lot of time navigating websites and clicking through multiple pages.

As students and developers, we noticed how much time is wasted performing the same manual actions again and again. We wanted to explore whether AI agents could automate these workflows in the same way a human interacts with a website.

This idea inspired us to build Auto Web Task Agent, an intelligent assistant that can understand natural language instructions and automatically perform tasks on websites.

What it does

Auto Web Task Agent allows users to complete web tasks using simple commands.

Instead of manually navigating websites, a user can simply type something like:

"Apply for AI internships on LinkedIn."

The AI agent will then:

Understand the user's intent

Break the task into logical steps

Navigate the website automatically

Fill required forms

Complete the task

The system transforms natural language into automated browser actions, reducing repetitive manual work.

How we built it

Our system combines AI reasoning, automation, and a simple user interface.

Core AI Layer

We used Amazon Nova to power the reasoning and planning capabilities of the agent. The model interprets user instructions and converts them into structured actions.

Automation Layer

Using Amazon Nova Act, the agent interacts with web interfaces, performs clicks, fills forms, and navigates pages.

Backend

The backend manages the agent workflow, task planning, and communication between the AI model and the browser automation system.

Frontend

We built a lightweight dashboard where users can enter commands and observe the task execution steps in real time.

Storage

Files such as resumes or input data are stored using Amazon S3.

The overall pipeline looks like this:

User Command → AI Reasoning → Task Planning → Web Automation → Task Completion

Challenges we ran into

One of the biggest challenges was designing a reliable workflow where the AI could convert human instructions into precise browser actions.

Websites have different layouts and dynamic elements, which makes automation difficult. We had to carefully structure prompts and agent workflows to ensure consistent behavior.

Another challenge was balancing AI reasoning with deterministic automation, ensuring that the agent could adapt to different tasks while still executing reliable steps.

Accomplishments that we're proud of

Successfully built an AI-powered web automation agent that can convert natural language instructions into real web actions.

Integrated Amazon Nova reasoning capabilities to understand user intent and generate step-by-step workflows.

Implemented browser automation using Amazon Nova Act, allowing the agent to interact with websites like a real user.

Designed a simple and intuitive interface where users can give commands and watch tasks being executed in real time.

Built a working prototype within a short hackathon timeframe that demonstrates the real potential of AI agents for productivity.

What we learned

During the development of Auto Web Task Agent, we gained valuable insights into building AI-driven applications:

How agentic AI systems break down complex instructions into smaller executable steps.

Integrating foundation models like Amazon Nova with real-world automation workflows.

Designing prompts and agent logic to ensure reliable task execution.

Handling dynamic web interfaces and ensuring automation works across different websites.

Building scalable cloud-based AI solutions using AWS services.

This project helped us better understand how AI agents can move beyond chatbots and actually perform real-world actions.

What's next for Auto Web Task Agent

We plan to expand Auto Web Task Agent into a more powerful productivity platform. Future improvements include:

Supporting more complex multi-step workflows across multiple websites.

Adding voice commands so users can speak tasks naturally.

Building personalized agents that remember user preferences.

Integrating additional AI capabilities like multimodal understanding for documents and screenshots.

Expanding the system for use cases such as job applications, travel booking, research automation, and business workflows.

Our long-term vision is to create a fully autonomous digital assistant that can perform everyday online tasks with minimal human effort.

Built With

act
agent
ai
amazon
amazon-web-services
api
automation
backend
bedrock
browser
cloud
code
control
css
data
express.js
file
foundationdb
framework
frameworks
frontend
github
interface
javascript
lambda
langchain
languages
libraries
logic
models
node.js
nova
orchestration
planning
playwright
python
react
reasoning
repository
s3
server
serverless
services
storage
tailwind
task
ui
user
version
web

Updates

Rajan Sharma started this project — Mar 16, 2026 03:24 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.