Invalidity dossier

US 12236947

Flexible-format voice command

Current assignee: Cerence Operating Co

Added 5/5/2026, 12:00:13 PM

Active provider: Google · gemini-2.5-flash

Patent summary

Title, assignee, inventors, filing/issue dates, abstract, and a plain-language overview of the claims.

✓ Generated

A detailed analysis of U.S. Patent 12,236,947 reveals a system for processing voice commands with greater flexibility, a departure from the rigid formats required by many existing voice-activated assistants.

Title: Flexible-format voice command

Assignee: Cerence Operating Company

Inventors: Bart D'hoore, Christoph Halboth, Holger Quast, Dino Seppi, Markus Funk, Tom Claes, Christophe Ris

Filing Date: July 10, 2023

Issue Date: February 25, 2025

Abstract:
The patent describes a voice-based system designed to process commands in a flexible format. This system allows a "wake word" to be positioned at various points within an utterance, not just at the beginning. The abstract suggests that, similar to natural speech, the system can be addressed by name within or at the end of a spoken command, or in some contexts, not at all.

Overview of Independent Claims

As of the current date, a detailed analysis of the independent claims is as follows. It should be noted that a plain-language summary is provided for clarity.

Independent Claim 1:
The first independent claim outlines a method for processing voice commands. This method involves receiving a first audio input from a user's utterance and a corresponding video input of the user. A key aspect of this claim is the determination that the utterance contains a command directed to the system. This determination is based on processing both the audio and video inputs, with the video processing identifying a visual characteristic of the user as they speak. Once a command is identified, the system is caused to act on it. In essence, this claim covers a multi-modal approach to voice command recognition, using both audio and visual cues to ascertain the user's intent.

Independent Claim 17:
This claim also describes a method for processing voice commands using both audio and video inputs, similar to the first claim. However, it adds a crucial element: the state of a dialog between the user and the system is also used in determining if an utterance is a command. This means the system considers the ongoing conversation's context. For instance, if the system has just asked the user a question, a subsequent utterance is more likely to be interpreted as a command.

Independent Claim 18:
This claim describes the physical system that carries out the methods outlined in the other claims. It specifies a voice-based system that includes an audio input device (like a microphone), a video input device (like a camera), and a computing device. The computing device is configured to receive and process both audio and video inputs to identify a user's command, based on the audio content and visual cues from the user, and then to execute that command.

Uncertainty Note:
A search of the U.S. Court of Appeals for the Federal Circuit (CAFC) dockets for 2026 did not yield any specific results for patent number 12,236,947. This suggests that, as of the current date, there are no publicly docketed appeals concerning this patent. However, this does not definitively mean no litigation exists, as cases may be at different stages or not yet docketed at the appellate level. The information presented here is based on publicly available data and should not be considered a complete legal analysis.

Generated 5/5/2026, 12:03:59 PM