Microsoft’s Word writing tool will soon be able to record and transcribe audio, marking an evolution long requested by everyone from students to reporters to Microsoft executives. However, it has strikingly limited features when compared to competitors.
The new transcription technology, which will be made available showed it worked well recording output from a computer’s speakers to its internal microphone (so, no headphones plugged in). People can also upload prerecorded audio to the service.writing with Word via a web browser, allows people to both record and upload audio files to be transcribed often within moments. In demonstrations with reporters on Monday, Microsoft
But that’s where its features matching competitors ends and where the list of tasks it can’t perform start to pile up.
The transcription feature only works on the web version of Word, not on its desktop Windows or Mac apps and not on its mobile companions. Microsoft said it hopes to have the technology available for phones and tablets by the end of the year but wouldn’t commit to offering the technology for the desktop apps.
Competitors such asfor software can work with more languages, or work offline. And , for example, offer easier search, markup and sharing.
Microsoft said what it offers against competitors is the simplicity of recording, storing and accessing transcripts within its suite of apps.
“We’re really uniquely positioned to help provide a one-stop shop, where your audio, recording transcript, notes, and ultimately your story can all live together inside a single familiar secure tool,” said Dan Parish, Microsoft’s group program manager who worked on this new feature. He said the technology grew out of Microsoft’s effort to help people “spend less time and energy creating their best work, and really focus on what matters most.”
Microsoft’s move to offer transcription technology marks a change that even the company acknowledged was a long time coming. People are increasingly relying on voice-enabled technology for many aspects of their lives, whether it’s to turn up the music while they’re cooking in the kitchen, send a text message while driving, or find a help keep records of some of the president’s phone calls.. Even the US government relies on automated voice transcription to
As people increasingly adjust to working away from their office, Microsoft said its transcription software can help — both to keep notes and to act as a third hand if we’re suddenly interrupted by a child or pet during a meeting or brainstorming session.
Microsoft acknowledged the technology has limitations that the company hopes to make better.
For example, Microsoft said it will allow people to record unlimited audio if they use a web browser, but limits them to 300 minutes (five hours) per month if they record and upload later, such as if they’re in a classroom with poor internet. Microsoft also said each audio file people upload has to be 200MB, or about 75 minutes of low-quality, mono MP3 recording. Like other services, people can upload MP3, WAV, MP4 and M4A files, though other services such as Otter.ai, offer support various movie files too such as AVI, MOV and MPG.
Microsoft also said that while transcription of a recording made in Word will happen within moments of pressing stop, in part because Microsoft’s actually transcribing behind the scenes, an uploaded audio file could take as long as the recording to transcribe.
But Microsoft said it sees itself as “definitely right at the top of the industry” in terms of how accurate its service is. That’s in part thanks to its connections to the Azure Cognitive Services technology,.
“in general, obviously, we feel quite confident in the quality that our that we are producing here,” Parish said.