Connect with us

Boston, MA

Tools for Your To Do List with Spot and Gemini Robotics | Boston Dynamics

Published

on

Tools for Your To Do List with Spot and Gemini Robotics | Boston Dynamics


For an industrial robot built for the rigors of factories and power plants, tidying up a living room may seem like a light day at the office for Spot. Yet, a recent video of the robot picking up shoes and soda cans in a residential home represents the promise of AI models in robotics. In this case, Google’s visual-language model (VLM) Gemini Robotics-ER 1.5 was empowering Spot with embodied reasoning.

This particular demo grew out of a 2025 hackathon at Boston Dynamics that built on prior projects using Large Language Models (LLMs) and Visual Foundation Models (VFMs) to enable Spot to contextualize its environment and engage in more complex autonomous actions than a typical Autowalk mission. Rather than write formal software logic or a “state machine” program that defines each step of a given task, we interacted with Gemini Robotics using conversational language. In turn, it communicated with Spot on our behalf.

A Robust SDK and Natural Language Prompts Save Time

Using Spot’s SDK, we developed a layer that facilitated interaction between Gemini Robotics and Spot’s application programming interface (API). The API normally gives developers access to the robot’s capabilities to create custom applications or behaviors. For example, researchers at Meta have used Spot to test how an AI system could locate and retrieve objects it had never seen before.

Advertisement

Our ability to engage Gemini Robotics using natural language prompts was a huge timesaver, compared to traditional programming. We told Gemini Robotics it had access to a mobile robot equipped with cameras and a robotic arm. It also had a finite set of tools it could use to control the robot. A tool is a lightweight script that performs some internal logic and translates inputs from Gemini Robotics to actual API calls. We limited the actions to navigating between locations, capturing images, identifying objects, grasping them, and placing them somewhere else. 

The extent of our SDK means there are great examples one could leverage to add more access to the API with minimal development.

Giving Gemini Robotics a Baseline

To start we needed to explain to Gemini Robotics what we wanted it to do. We did experience a learning curve when writing these baseline prompts. Simple instructions like “put down an object” or “take a picture” weren’t detailed enough to produce expected behavior. We had to add context in our descriptions as we refined each tool. 

A good example is the detailed prompt for the “TakePicture” tool:

This command will cause the robot to take a picture with the specified camera. There is some nuance to choosing the correct camera. Once arriving at a location using GoTo, you should always start by taking a picture with the gripper camera, because it's the most informative.
If the robot has arrived at location and is already holding an object, you can do one of two things:
1. Immediately call PutDown
2. Search the area with either of the front cameras. The front cameras are low to the ground, so if you're trying to put things on an elevated surface, they won't give you useful information.

In this example, we gave Gemini Robotics no detailed description of the robot’s chassis or arm. Instead, we simply explained that Spot’s front cameras would be too low to photograph objects on elevated surfaces. We were able to iterate rapidly, as small changes in wording produced noticeably better results. Once it had this set of basic tools through the API, Gemini Robotics could sequence Spot’s actions and follow the handwritten instructions on a whiteboard on the day of the demonstration.

Advertisement

How Gemini Robotics and Spot Collaborate

Until the robot powers on, Gemini Robotics has no context for what specific tasks we might ask it to perform in a given demo. We only provided simple written instructions, such as, “Make sure all of the shoes at the front door are on the shoe rack.” Gemini Robotics evaluated images from Spot’s cameras and identified objects in the scene that matched the instructions. These objects became the reference points for Spot’s navigational and manipulation systems.

In many respects, Gemini Robotics was identical to an operator manually driving Spot using its tablet controller. For example, to pick up an object with Spot, an operator positions the robot near the object and then uses a grasp wizard to identify the target object. The operator provides high-level direction and Spot figures out the exact details. In this demonstration, Gemini Robotics functioned as both the operator and the tablet sending commands to the robot. This freed us up to act more like a team lead, providing a high-level to-do list and trusting Spot and Gemini Robotics do the rest.

Call and Response

When Gemini Robotics engages a given tool, the tool responds with results and context, such as, “I picked up the object,” or “I can’t pick up something while my hand is full.” Gemini Robotics then makes adjustments on the fly based on this feedback from Spot. For example, to pick up shoes, Gemini Robotics requests an image, identifies the shoes in that image, and calls the “pickup” command. By creating fundamental tools that semantically flow in conversation,  Gemini Robotics can manage the sequence of tasks required to clean up the room. Spot’s existing software stack manages the locomotion, navigation, and manipulation of the robot itself.

It’s important to note Gemini Robotics has strict boundaries in this scenario. It can’t invent new capabilities or control Spot beyond what is available through the API. This keeps Spot’s behavior predictable, while still allowing Gemini Robotics to adapt to different situations.

A Force Multiplier for Developers

For developers already working with Spot, this research has tremendous potential. Through Spot’s SDK, they have access to a robust toolkit of capabilities. Companies use these tools today to build applications for inspection, research, and industrial data analysis, among others.

An AI model like Gemini Robotics offers a way to expand those applications more rapidly. Rather than write extensive task logic on top of Spot’s APIs, developers can experiment with having AI systems interpret natural language instructions and dynamically choose to engage the robot. As a result, models like Gemini Robotics can act as force multipliers, amplifying the reliable toolkit and robust performance that is already delivering value for Boston Dynamics customers.

Advertisement

Our Next-Token Prediction for Spot and Gemini Robotics

Although this is still an experimental step and not a hardened application, it illustrates a compelling direction for robotics and physical AI. Robots like Spot are already extremely capable of navigating complex and changeable environments, collecting data and sensor readings, and manipulating objects. Rather than reinventing the wheel, AI foundation models offer a new way to expand these capabilities in new settings and to new applications.

Physical AI is a rapidly evolving field and our team is leading the way in the lab and in real applications of AI empowered robots. While we are early in our formal partnership with Google Deepmind, we’re excited for what the future holds with Atlas and we’ve already rolled out practical enhancements for Spot and Orbit, with AIVI-Learning powered by Google Gemini Robotics ER 1.6. This next evolution of our AI Visual Inspection tool unlocks a new level of visual intelligence, as users benefit from shared expertise bringing a deeper level of contextual intelligence to Spot and Orbit. Model improvements automatically happen behind the scenes, adding more capabilities to the same software and hardware.

Today, this demo points to a future where users can rely more on natural language to guide Spot’s actions, rather than complex code. The engineer’s role shifts toward setting goals and objectives. The multi-modal robot foundation model interprets the instructions to form complex and adaptive plans and Spot executes the action.

This article was contributed by Issac Ross and Nikhil Devraj, engineers on the Spot team.

Advertisement



Source link

Boston, MA

Portion of Storrow Drive, Soldiers Field Road will close nightly through August – The Boston Globe

Published

on

Portion of Storrow Drive, Soldiers Field Road will close nightly through August – The Boston Globe


An inbound stretch of Storrow Drive and Soldiers Field Road will be closed each night through August for tunnel repairs, officials announced.

Starting Monday, the closures will begin at 8 p.m. and last until 5 a.m., state officials said.

Road closures begin at North Harvard Street in Allston and stretch along the Charles River Esplanade to Mugar Way in Boston, near the Hatch Memorial Shell, officials said.

Traffic will be detoured into Cambridge over the Anderson Bridge, along Memorial Drive, and then be routed into Boston over the Longfellow Bridge.

Advertisement

The closures will allow ongoing repairs to the Storrow Drive Tunnel in the Back Bay. The work is the first phase of a two-stage project to extend the lifespan of the tunnel, which carries roughly 50,000 drivers to and from downtown Boston daily.

The outbound portion of the tunnel and accompanying roadways will not be affected.

State transportation officials said changes to the work schedule will be made when necessary to minimize impacts during major local events at TD Garden, Fenway Park, or during the FIFA World Cup and 250th anniversary celebrations scheduled for this summer.

Additional changes may be made without notice due to weather.

Transportation officials have not specified when the closures will end.

Advertisement

Bryan Hecht can be reached at bryan.hecht@globe.com. Follow him on Instagram @bhechtjournalism.





Source link

Continue Reading

Boston, MA

Ole Miss softball to play Boston in NCAA tournament Lubbock Regional

Published

on

Ole Miss softball to play Boston in NCAA tournament Lubbock Regional


This story has been updated with new information

OXFORD — Ole Miss softball is back in the NCAA Tournament after making the Women’s College World Series a season ago.

The Rebels (34-24) will play Boston (46-13) on May 15 (1 p.m. CT, ESPNU) in the Lubbock Regional. Ole Miss is the No. 2 seed in the regional, and Boston is the No. 3.

Advertisement

Texas Tech (52-6), the No. 11 overall seed and regional host, will face No. 4 Marist (37-19).

The Rebels went 6-18 in SEC play this season, and have a largely new-look roster from the team that made the WCWS last season.

Ole Miss beat South Carolina and Tennessee in the SEC Tournament to improve its seed.

Freshman Madi George has burst onto the scene in the SEC. The first-year infielder leads Ole Miss with a .385 batting average. She has a team-high 21 home runs and 58 RBIs.

Seniors Emilee Boyer (3.86 ERA), Kyra Aycock (3.97 ERA) and junior Lily Whitten (3.04 ERA) are the primary options in the circle for coach Jamie Trachsel.

Advertisement

Trachsel is in her sixth season leading the Ole Miss program. She led the Rebels to their first WCWS appearance in program history in 2025.

What to know about Boston, Texas Tech and Marist in Lubbock Regional

Boston entered the Patriot League Tournament as the top seed and the Terriers delivered. Boston beat No. 2 Colgate 12-1, becoming the second team in Patriot League history to four-peat as conference champions. Boston is on a 12-game winning streak. Kylie Doherty leads the team with a .396 batting average and 26 home runs.

Texas Tech made the 2025 WCWS championship series, losing to Texas in three games.

Texas Tech lost just three Big 12 games this season but lost in the first round of the Big 12 Tournament. The Red Raiders are a strong threat to get to the WCWS again. There are four Texas Tech batters hitting over .400. Star pitcher NiJaree Canady leads the Red Raiders with a 1.24 ERA. She has 209 strikeouts.

Advertisement

Marist plays in the MAAC and won the conference tournament. Marist split a two-game series against South Carolina early in the season. Ava Metzger (12-3, 2.51 ERA) and Peyton Pusey (.404 batting average) lead the team.

Sam Hutchens covers Ole Miss for the Clarion Ledger. Email him at Shutchens@gannett.com or reach him on X at @Sam_Hutchens_



Source link

Continue Reading

Boston, MA

‘This is really just the start of it all’: Mojo Boston makes splashy debut at City Hall Plaza – The Boston Globe

Published

on

‘This is really just the start of it all’: Mojo Boston makes splashy debut at City Hall Plaza – The Boston Globe


Attendees held umbrellas as The Bends performed at Mojo Boston on Saturday.Andrew Burke-Stevenson/for The Boston Globe

Mojo, a music brand and concert organizer, was founded in 2021 by Charley Blacker, Alex Parker, and Emily Donovan while they were students at the University of Massachusetts Amherst. The trio of friends decided to create Mojo out of their shared love for music and house shows.

“We saw there were so many local musicians that were so talented, but they didn’t have the platform we thought they really deserved,” Blacker told the Globe during Saturday’s festivities. “So we thought if we could do the behind-the-scenes work of organizing photographers and [provide] a social media platform, we could give these musicians the platform they deserve.”

Attendees danced as The Bends performed at Mojo Boston on Saturday.Andrew Burke-Stevenson/for The Boston Globe

Five years later, the team behind Mojo is sticking to their mission, tackling their biggest venue yet with this weekend’s event at City Hall Plaza, which previously served as the original location for Boston Calling before it moved to the Harvard Athletic Complex in Allston. Boston Calling announced last year that it is taking a one-year hiatus in 2026, with plans to return in 2027.

In addition to getting the chance to work on such a big event with his best friends, Blacker hopes Mojo Boston can help “lead to a lot more opportunities for local music.”

People browsed food vendors at Mojo Boston on Saturday.Andrew Burke-Stevenson/for The Boston Globe

There was a wide range of genres represented at Saturday’s event, from the pop-rock stylings of The Bends to house and garage music from DJ AC Slater. Prior to the Boston debut, Mojo brought a festival to Pennsylvania’s Happy Valley in April and returned to Amherst later that month to host another event.

Mojo Boston attendees and former UMass Amherst students Emily Bowler and Max Debeau have been familiar with Mojo since its inception, watching the organization go from hosting basement shows to full scale music festivals. Debeau noted how many of the acts at Saturday’s event have worked with Mojo in the past, performing at UMass and other shows around the Bay State.

“To see it all come together has been great,” Debeau said. “This is the stage that they all deserve.”

“It’s crazy how quickly they were able to erect something so amazing,” Bowler added.

Amanda Giroux danced away from the stage while The Bends performed at Mojo Boston on Saturday.Andrew Burke-Stevenson/for The Boston Globe

Formed in New Bedford, the band Autumn Drive was one of 18 acts that performed at Mojo Boston, and they are no strangers to a Mojo show.

“We’ve done, I think, every single Mojo that there is, so we’re very tight with them,” said guitarist and singer Charlie Gamache. “When we found out they were doing a big festival [in Boston], I was like, ‘We want in no matter what.’”

Advertisement

The band emphasized how much their relationship with Mojo has meant to them over the years, with Autumn Drive drummer Joe Gauvin praising the organization for “always putting us in front of a crowd that’s there to see music and hear us.”

Michael Asulin, of Stoughton, and Jackie Ludicke, of Miami, Fla., talked while wearing ponchos at Mojo Boston on Saturday.Andrew Burke-Stevenson/for The Boston Globe

From a makeshift stage out of wooden pallets in his basement to Boston’s City Hall Plaza, Blacker is is proud of Mojo’s success and is already looking ahead at what’s to come.

“This is really just the start of it all,” said Blacker. “We have very lofty ambitions and goals, and we have nothing but confidence in our ability to accomplish everything we set out to do.”


Gitana Savage can be reached at gitana.savage@globe.com. Follow her on X @gitana_savage.





Source link

Advertisement
Continue Reading
Advertisement

Trending