Skip to content

[speech-to-text] Integrate ASR Post-Processor Speech Transcript Enrichment #1255

@JessBozorg

Description

@JessBozorg

Task

A new feature has been added to Watson Speech-to-Text service which needs to be added to the Java SDK. To enable enrichment in API calls, add the parameter:

enrichments("punctuation") (enables punctuation and capitalization enrichment)

Response looks like this:

{
   "speaker_labels": [
      {
         "from": 52.52,
         "to": 52.7,
         "speaker": 0,
         "confidence": 1.0,
         "final": false
      }
   ],
   "enriched_results": {
      "transcript": {
         "text": "Oh là, et bon, il faudrait me payer ma complation, là. Et vous avez ajouté à le fournir, me payer ma complatigne.",
         "timestamp": {
            "from": 0.22,
            "to": 59.68
         }
      },
      "status": "success"
   }
}

Supported languages

  • English (US, UK, Australia, India)
  • French (France, Canada)
  • German
  • Italian
  • Portuguese (Brazil, Portugal)
  • Spanish (Spain, Latin America, Argentina, Chile, Colombia, Mexico, Peru)
  • Japanese

More details on this feature can be found here: IBM Cloud Docs – Speech Transcript Enrichment

Related: #1252

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions