Creating An Automated Ui Test Of Your Web App In Seconds, With

Creating An Automated Ui Test Of Your Web App In Seconds, With

Posted on Jan 14

• Originally published at Medium

The pace of change in generative AI is incredible, and I’m constantly amazed with what we can do now. But today I discovered something that turned my amazement up to 11.

Jack Wotherspoon had shared his most recent weekly bulletin describing the latest advancements in Gemini CLI. He mentioned “Multi-Modal MCP Tool Responses” as one of the new features, meaning that Gemini CLI can now interpret and act on MCP responses of various modalities, not just text. He happened to mention BrowserMCP which I’ve never before looked at.

15 minutes later I had Gemini CLI performing UI tests of one of my web apps, simply by providing a single natural language instruction to Gemini CLI.

This was a doddle. First, I took the MCP configuration described here and added it to my .gemini/settings.json. I.e.

Then I installed the BrowserMCP extension into Chrome. This took about 5 seconds, following the instructions here. Could it really be this easy?

Time to test with Rickbot, of course! For those of you not aware, Rickbot is a multi-personality agentic solution I built recently using Google Gemini, and which I have deployed to Google Cloud Run. (Check the article for the walkthrough of how I did this.)

My goal was to create a UI test that would change to each of the personalities available in the dropdown, interact with each personality, get all the responses, and then present them back in a table.

I launched the Rickbot Streamlit web app locally in my development environment, like this:

Then I connect BrowserMCP by clicking on the BrowserMCP button in Chrome:

Source: Dev.to