smart_toyAI/PROMPT ENGINEERING

Agentic manual testing

sourceSimon Willison

calendar_todayMarch 6, 2026

schedule2 min read

lightbulb

EXECUTIVE SUMMARY

Enhancing Code Reliability with Agentic Manual Testing Techniques

Summary

The article discusses the importance of agentic manual testing in software development, emphasizing the need for coding agents to execute and verify the code they generate. It highlights various strategies for manual testing, particularly in conjunction with automated tests, to ensure code quality and functionality.

Key Points

Coding agents can execute the code they write, unlike traditional large language models (LLMs).
Automated tests may pass while the code still fails in practical scenarios, necessitating manual testing.
Effective manual testing methods include using `python -c` for Python libraries and `curl` for JSON APIs.
Playwright, developed by Microsoft, is a powerful tool for automating web browser testing.
The agent-browser by Vercel serves as a CLI wrapper around Playwright, enhancing usability for coding agents.
Showboat is a tool designed to document the manual testing process, capturing commands and outputs for verification.
Manual testing can reveal issues not caught by automated tests and is essential for interactive web UIs.

Analysis

The significance of agentic manual testing lies in its ability to complement automated testing, ensuring that code not only passes tests but also functions correctly in real-world scenarios. This approach helps maintain high-quality software and improves the reliability of coding agents in development workflows.

Conclusion

IT professionals should integrate agentic manual testing techniques into their development processes to enhance code quality and reliability. Utilizing tools like Playwright and Showboat can streamline testing and documentation efforts, ultimately leading to more robust software solutions.