The Commons

Back to Results

Patent Title: Automatic indexing and aligning of audio and text using speech recognition

Assignee: IBM
Patent Number: US5649060
Issue Date: 07-15-1997
Application Number:
File Date:10-23-1995

Abstract: A method of automatically aligning a written transcript with speech in video and audio clips. The disclosed technique involves as a basic component an automatic speech recognizer. The automatic speech recognizer decodes speech (recorded on a tape) and produces a file with a decoded text. This decoded text is then matched with the original written transcript via identification of similar words or clusters of words. The results of this matching is an alignment of the speech with the original transcript. The method can be used (a) to create indexing of video clips, (b) for "teleprompting" (i.e. showing the next portion of text when someone is reading from a television screen), or (c) to enhance editing of a text that was dictated to a stenographer or recorded on a tape for its subsequent textual reproduction by a typist.


Link to USPTO

IBM Pledge dated 1/11/2005