transpixel.gif (43 bytes)
You're at Voice Recognition Breaking News. Click to return home.
Breaking News
Industry Trends
Breaking News
Products
Information
User Profiles
Demos
Order



 

Originating publication
August 01, 1998, Issue: 908
Section: Reviews

Voice Recognition Makes Itself Heard
Owen Linderholm

Ever since 2001: A Space Odyssey's HAL made talking to your computer a popular idea, voice recognition has been a goal for several software companies.

A few years ago, Dragon Systems, IBM and Kurzweil introduced PC voice-recognition programs that take dictation. To put it mildly, these programs weren't very good. But last year, releases from Dragon and IBM took a huge step forward. Although they still required you to spend long hours training the programs to understand your voice inflections, the dictation actually worked.

The latest releases of these programs take more than another step forward; they're completely new and advanced products. We looked at beta versions of Dragon Systems' new NaturallySpeaking Preferred Edition 3.0 and IBM's new ViaVoice 98 Executive Edition, as well as VoiceXpress Plus from Lernout & Hauspie (which purchased Kurzweil). All three let you dictate and correct, control and format your document purely through verbal commands. In addition, both NaturallySpeaking and ViaVoice allow even greater control over your computer by permitting you to dictate into almost any application. Though ViaVoice also gives you control over much of Windows itself, NaturallySpeaking proved the best, with superior voice-recognition accuracy and speed, and acceptable voice control of applications and Windows.

NaturallySpeaking Preferred Edition 3.0

NaturallySpeaking is the least obtrusive of the three programs I tested-once it is up and running, it waits as a small microphone icon on your taskbar. However, the process of getting the program going isn't easy. As with other products, you must perform enrollment, a training process that teaches the software to adapt to your voice and inflections. In NaturallySpeaking's case, you need to choose one of several passages to train with; each takes about half an hour to complete. NaturallySpeaking, like the other programs, also adjusts audio and microphone volume levels and measures the sound level in the background.

Once you've finished reading the enrollment passage, NaturallySpeaking spends an additional quarter of an hour processing your input and adapting to the way you speak. Finally, the setup wizard asks you to load a representative sample of your existing documents into its Vocabulary Builder. It analyzes these to identify your typical sentence structures and to notice the kinds of words you use. At the end, it asks you to train the system on all those words it doesn't already recognize.

Finally, NaturallySpeaking runs a short multimedia presentation on how to make best use of the program before calling up the application itself. NaturallySpeaking automatically launches its main screen, which is a simple word processor for taking dictation, with an extended menu system to control voice recognition operations and options.

I found NaturallySpeaking remarkably accurate right after enrollment and a quick practice: I dictated my 495-word test document straight into the program, correcting errors as I went. NaturallySpeaking's big advantage over its competitors is a winning combination of superior initial recognition and the ability to select phrases as well as single words for correction.

Why is that important? Voice recognition programs don't typically recognize single words in context. For example, the sentence "It was too much to have to eat two pies" has four occurrences of a word pronounced the same way, but with three meanings and spellings. That means you have to say the whole sentence at once rather than pausing after each word. Only programs that can make use of context could get this right. NaturallySpeaking actually heard this sentence correctly the first time.

The point is, when these programs make mistakes (as they often do), they typically make mistakes over whole phrases rather than single words. Consequently, you want to be able to correct the whole phrase in one shot, and NaturallySpeaking is the only program that lets you rapidly select and correct an entire phrase.

Because making corrections was so easy, I was able to dictate and correct my test document (with business text, a normal prose passage and a short snippet of poetry) in only 11 minutes, 28 seconds, at a speed of 43 corrected words per minute. I'm not a lightning-fast typist, nor terribly slow, but I was able to retype the same passage-again, going back and completely correcting it-at 37.4 corrected words per minute.

We chose to measure corrected words per minute for this test since most users are looking to create final, accurate documents. Most importantly, I was able to dictate and correct this document without touching the mouse or keyboard, making NaturallySpeaking truly beneficial for those unable to type because of physical handicaps, ailments or illness.

NaturallySpeaking falls short in one area, however. Although it can control individual applications, it cannot effectively control the operation of Windows itself. An application called MouseGrid lets you move the cursor around the screen and select and click items. You can also select most items in menus and dialog boxes using only your voice, but it's too much of a struggle to be truly worthwhile. IBM's ViaVoice 98 Executive Edition does a better job of controlling the Windows environment easily, even though it lacks a function to directly control the mouse cursor's position.

Overall, NaturallySpeaking Preferred Edition is a breakthrough in dictation and voice recognition. At $159 it's the most expensive of these three programs, but it's still relatively cheap. Dragon Systems also sells a less capable version (the Personal Edition) for $99 and a simple dictation-only program called Point & Speak for $59. NaturallySpeaking Preferred 3.0 is the first voice recognition program we've seen that has real utility, and it easily earns a spot on the WinList.

ViaVoice 98 Executive Edition

ViaVoice 98 actually comes in three flavors: Home, Office and Executive versions. The Home Edition lets you dictate into its own word processor, called SpeakPad, and into Microsoft's Word 97; you can control and format documents in both. The Office Edition adds the ability to control the Windows Desktop and to launch, control and close other applications. It also includes an extended vocabulary for business and finance. The Executive Edition adds the ability to dictate text into any application at all, with correction and voice command functions in all applications.

I tested the Executive Edition. It proved easy to install and set up, and took about half an hour to enroll.

ViaVoice 98 uses a VoiceCenter command bar across the top of the screen that provides access to voice recognition commands and options. Very extensive help is always available and can be opened and searched using simple voice commands.

Dictation was decent, but not as good as in NaturallySpeaking Preferred Edition 3.0. Although you can correct as you go along in ViaVoice, I eventually found it faster to dictate the entire document and then return to fix errors. Using this technique, I was able to dictate and correct my test document in 14 minutes, 20 seconds, for a speed of 34.5 corrected words per minute. However, I did have to use the mouse a couple of times. As an example of the top speed you can expect to get with these programs, I also measured how long it took to dictate and correct the document, making full use of the mouse and keyboard where expedient. Using this technique, I finished the test document in 10 minutes, 43 seconds for a speed of 46 corrected words per minute-close to NaturallySpeaking's speed without the keyboard and mouse.

I found ViaVoice 98 to be good at recognition and very easy to use, but corrections can be tedious since the program can only handle a single word at a time. Making corrections using voice-only was a struggle. In particular, I had trouble spelling words out using letters and had to resort to using the alpha-bravo-charlie phonetic alphabet-a useful tool to guarantee accuracy, but a time-consuming process.

Launching applications is a snap with ViaVoice 98. You simply say something like "Open program Calculator" to launch Calculator. You can then say what you want the Calculator to do: For example, you'd say "two thousand and seventeen divided by twenty-three equals" to produce an answer. You can also give commands like "Move Notepad window two inches right." All in all, command and control of Windows and applications in ViaVoice 98 was a snap.

ViaVoice 98 includes a lot of voice tools, as well as a vocabulary extender that analyzes your existing documents for writing style and vocabulary to improve recognition. IBM also provides very good documentation on how to use the program most effectively-far more than any of the other programs.

I would recommend ViaVoice 98 Executive Edition to anyone who needs or wants to use voice to control the overall operation of a PC. It isn't as good for dictation as NaturallySpeaking Preferred, so it doesn't make the WinList. But at $149, ViaVoice 98 is a very good value and a worthwhile consideration.

VoiceXpress Plus

Lernout & Hauspie's VoiceXpress Plus is the oldest of these programs. It's also the only one I looked at in a final shipping version. Installation and enrollment was harder for VoiceXpress Plus than it was for the other programs. It takes a very long time to load whenever it's used, and since it has the worst recognition of the three (but still far better than previous-generation products), it also takes longer to enroll.

I found myself repeating sections a couple of times to get them right. Enrollment takes well over an hour.

In most other respects, VoiceXpress Plus is remarkably similar to ViaVoice 98. Both programs use a command bar at the top of the screen to control voice recognition commands and options, but VoiceXpress Plus only works within its own VoicePad application and Word 97.

VoiceXpress Plus has good initial recognition-about as good as ViaVoice 98. However, it's much harder to correct errors in VoiceXpress. You can select and correct single words by saying "correct" followed by the incorrect word to bring up a correction dialog box. But, unlike the other programs, VoiceXpress Plus doesn't offer a list of potential matches. And you must spell out your correction into the dialog box. While this is sometimes the method you have to fall back on with the other two programs, it's the only option in VoiceXpress Plus, and it makes correction slow and difficult. It took 20 minutes, 26 seconds to dictate and correct the test document with VoiceXpress Plus, and I was forced to use the keyboard frequently. This translates to 24.2 corrected words per minute-not terrible, but not nearly as good as NaturallySpeaking Preferred or ViaVoice.

VoiceXpress is easy to use and gives you a few novel features, such as the ability to set it for female or male speakers in various age ranges before you begin enrollment. It has a very good vocabulary extender, a lot like that of NaturallySpeaking Preferred. But it doesn't recognize speech well enough or work for anything other than dictation into its own word processor or Word 97. It is, however, the cheapest of the three programs and is perhaps more fairly compared to the lower-end versions of the other programs. Even so, they still have better basic recognition accuracy. We recommend you look at NaturallySpeaking Preferred or ViaVoice 98 instead.

Coming in loud and clear

There is more in common among these programs than there are differences. They all ship with headset microphones to assure good audio quality. They all do an adequate or better job of dictating into word processors, and they all allow you to format, correct and control your document while dictating. They are also all susceptible to abrupt changes in background noise and require extensive enrollment to be effective. Paradoxically, all work best if you can keep up a decent clip while speaking, since they use context to recognize words and differentiate commands from dictation.

NaturallySpeaking Preferred Edition 3.0 and ViaVoice 98 Executive Edition stand out because of their ability to work with other applications, control the Desktop and, in particular, produce higher recognition quality. ViaVoice 98 gets the nod if you need more control over the Windows environment. But if pure dictation is your primary concern, get NaturallySpeaking Preferred Edition 3.0-we're adding it to the WinList.

--Quick View--

NaturallySpeaking Preferred Edition 3.0

Bottom Line: Best program for pure dictation into any application

Price: $159

Platforms: 95, NT

Pros: Takes dictation and lets you correct without mouse or keyboard; very accurate

Cons: Hard to control Windows; less formatting and fewer control commands for documents

Strongest Rival: ViaVoice 98 Executive Edition

Dragon Systems, 800-4-DRAGON, 617-965-5200. Winfo #671

--

ViaVoice 98 Executive Edition

Bottom Line: Good program for dictation that can also control all applications and Windows by voice

Price: $149

Platforms: 95, NT

Pros: Good control of Windows interface; can take dictation into any application; decent accuracy

Cons: A bit harder to correct unrecognized words

Strongest Rival: NaturallySpeaking Preferred Edition 3.0

IBM Corp., 800-825-5263, 914-642-3000.

--

VoiceXpress Plus

Bottom Line: Decent accuracy, but can format and control only in Word

Price: $99.99

Platforms: 95, NT

Pros: Accuracy; decent format and control in Word

Cons: Can't control Windows or applications other than Word; difficult to make corrections

Strongest Rival: NaturallySpeaking Preferred Edition 3.0

Lernout & Hauspie, 800-380-1234, 781-203-5000. Winfo #769

SIDEBAR: Ten Tips to Help Your PC Hear You Better

1. Use the headset microphone that comes with the program, not the crummy microphone that came with your PC.

2. Work in a consistent environment; keep background noise as low as you can make it.

3. Do all the training and enrollment the program wants.

4. Always go back and correct dictation mistakes properly using the program, rather than just retyping them. This helps the program to learn.

5. Use the vocabulary extender with your existing documents.

6. Be sure different users enroll with their own settings.

7. Make different enrollments and speaker settings for different environments: notebook, in office, at home, traveling on an airplane and so forth.

8. Adjust your microphone carefully and reset its volume level every time you put it on.

9. Speak clearly and distinctly, but without pausing.

10. Give the program time to learn the way you speak. It really does get better and better.

Copyright ® 1998 CMP Media Inc.



|
Speech Recognition Software, Digital Recorder and Microphone Industry Trends | News in the Computer Software Industry as it relates to Speech Recognition Technology | Product Index: Dragon NaturallySpeaking IBM Via Voice Philips SpeechMike SONY Digital Recorders Olympus ScanSoft Software iCommunicator Sign Language Program VXI and Sign Language Software | Speech Recognition Technology Information | User Profiles of Doctors, Lawyers, Students and Schools, Educators and Disabled or Assistive Technology Users | Free Demonstration of New Speech Recognition Technology | Order Voice Recognition Computing Software |

© 1997-1998 21st Century Eloquence. All rights reserved.
Voice Recognition Computer Software