IPB

Welcome Guest ( Log In | Register )


 
Reply to this topicStart new topic
> speaker-independent system for notes
speaker-independ...
post Feb 8 2005, 03:09 AM
Post #1


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



Hi all,

I am interested in applying DNS to a system for speaker-independent VR for rough notes, similar to a project of a group at UC Berkeley; see their project description at this website:
http://www-inst.eecs.berkeley.edu/~assist/hearing.html

I don't need near-perfect accuracy but just tolerable accuracy to have rough notes (with some errors) of what has been said-- these notes never become a document but serve as a transient reference during consecutive interpretation (verbal translation) work.

Does anyone have any ideas on how to get this working? Specifically, does anyone know of a unified pitch device or software that can unify pitches before the sound is entered into Dragon?

Thanks!!!!!
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 8 2005, 01:57 PM
Post #2


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



Just do a search on Google for "voice processors" and you'll see a number of devices that are originally designed for music but certainly work with voice. Either that, or talk to a sound engineer at a radio station or TV station and they can easily demonstrate how you can, for example, make everyone sound like Donald Duck.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 8 2005, 05:03 PM
Post #3


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



Thanks, John. As I am interested in a minimalist setting that doesn't look so big during, say, a conference (when I do interpretation), I don't wish to use a big machine with all those fancy features that the music industry uses. Would you or anyone know about software or a very small device that can unify pitches? The Berkeley project report says it might be possible to use a DVP (digital voice processing) "chip" by itself, but I haven't a clue as to how to get that chip or how to set it up. I've written to the professor but haven't got a response yet.


QUOTE
Originally posted by John Wickett:
Just do a search on Google for \"voice processors\" and you'll see a number of devices that are originally designed for music but certainly work with voice.  Either that, or talk to a sound engineer at a radio station or TV station and they can easily demonstrate how you can, for example, make everyone sound like Donald Duck.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 9 2005, 01:28 PM
Post #4


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



Speaker,

In my experience the DVP chip does not work as it does not apply the same voice modelling to each user. I think that your choice here is to use a voice processor (they aren't that big) or spend over $10,000.00 to get custom made soundcard with the inherently possible software conflicts.

John
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 10 2005, 06:27 AM
Post #5


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

Thanks. Sorry, I meant "DSP" chip, not "DVP" chip. What do you think about a DSP chip?

In my experience the DVP chip does not work as it does not apply the same voice modelling to each user. John[/B][/QUOTE]
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 10 2005, 07:12 AM
Post #6


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John, I did a search on Google. What do you think about the ZSP voice processor? from the picture it looks like a small square that I guess can be installed inside a computer, but I'm not sure if it can unify pitches. The big blue machines with all the dials are too big and visually intrusive for what I need to do.

Is there any kind of software or algorithm I can install on Windows XP to automatically change all incoming voice into a voice with the same pitch? Even if there is still variation in other properties of the voices, it's ok, as I am not looking for great accuracy and would be willing to experiment with only pitch at this time? And then route that unified-pitch voice to Dragon? Any ideas about such software or who could customize it?

Thanks again for all your patient indulgence.

QUOTE
Originally posted by John Wickett:
Speaker,

In my experience the DVP chip does not work as it does not apply the same voice modelling to each user. I think that your choice here is to use a voice processor (they aren't that big) or spend over $10,000.00 to get custom made soundcard with the inherently possible software conflicts.

John
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 10 2005, 01:47 PM
Post #7


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



>>What do you think about the ZSP voice processor

I haven't tried it, but after reading about it I see it is designed for VoIP and telephony - totally different applications than voice recognition.

I tend to go with the Symetrix voice processors and I don't find them to be very intrusive.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 10 2005, 09:38 PM
Post #8


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

That's great to hear that you have experience with this! What do you use the Symetrix voice processors for? For a system approximating speaker-independent VR? If so, I'd love to hear more of the details of the system you have, to learn from your experience.

I am really tech illiterate; what do you think about the DSP (digital signal processor) chip? is it basically the same thing as a DVP chip?

Thanks a million again!


QUOTE
Originally posted by John Wickett:
>>What do you think about the ZSP voice processor

I haven't tried it, but after reading about it I see it is designed for VoIP and telephony - totally different applications than voice recognition.

I tend to go with the Symetrix voice processors and I don't find them to be very intrusive.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca  
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 10 2005, 09:58 PM
Post #9


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



I just went to the Symetrix website; about how big (length and width) are the 528E and the 628? They look kind-of big; I think it might look weird if I put something like that on a conference table.

I discovered recently that my Toshiba Portege 3505 PC has a great built-in internal microphone or mic array inside the monitor; when I use the "microphone boost" feature, it can can pick up a voice clearly and crisply from a couple yards away, even if the voice is speaking to the back side of the monitor. And this is ideal, so I wouldn't have to have a visually unpleasing external microphone stuck to the top of my monitor during a conference and I wouldn't need to bother someone to wear a lapel mic. So now what I am directing all my attention to is finding some kind of software that I can install directly on my operating system, or some kind of very small hardware that I can install inside my computer. Any ideas here? Thanks again!
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 11 2005, 01:39 PM
Post #10


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



speaker,

>>what do you think about the DSP (digital signal processor) chip? is it basically the same thing as a DVP chip?

DSP is basically for telephony while DVP is intended for video.

>>What do you use the Symetrix voice processors for?

Many of my clients are disabled and have dysarthric or aphasic speech, so it is necessary to alter the audio input to the soundcard so their voices are recognizable by DNS.

The Symetrix 528e is about two thirds the size of a keyboard and is one of my favorites. However, I am not using this for multiple voice input and, as far as I'm aware, there is no product on the market that will deliver what you are looking for. At this point in time you will basically have to settle for a piece of hardware that will shrink all voices so that they are as similar as possible and then deal with the high error rate. The recognition of multiple voices is not supported at this time, but my three sons can use my voice file without any problems. If my daughter or grandchildren try to use my voice file the results can be almost incomprehensible.

We are well aware that a lot of people are looking for this technology and hopefully it will be available in the near future. In the meantime, all you can do is try to alter voices prior to input to the computer.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 11 2005, 07:48 PM
Post #11


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

>>> but my three sons can use my voice file without any problems. If my daughter or grandchildren try to use my voice file the results can be almost incomprehensible.


This is good to hear! This confirms the results of a similar experiment with men and women's voice files described on the ASSIST webpage at UC Berkeley. Just to confirm-- are you referring to your voice file after it has been processed by the Symetrix? Or without the Symetrix?

I am thinking of making different sample voice files, definitely at least one for adult men and another for adult women; I won't be working with children.

Thanks again, I am learning a lot.
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 11 2005, 07:51 PM
Post #12


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



>>> Just to confirm-- are you referring to your voice file after it has been processed by the Symetrix? Or without the Symetrix?


Sorry, I meant to say "...after the SOUND of your voice has been processed..."
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 11 2005, 07:59 PM
Post #13


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

>>>> a piece of hardware that will shrink all voices so that they are as similar as possible


Is there way to set up a piece of hardware, for example the 528e, so that it automatically skrinks all incoming voices to the SAME PREDETERMINED level of "shrinkness" ? So that I wouldn't have to adjust the "shrinkness" every time a different speaker speaks? By the way, what does it mean to shrink a voice?

And can the 528e automatically change the pitch of a voice to a predetermined pitch value?
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 12 2005, 07:00 AM
Post #14


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

Do you think the "Voice Changer Software" (see webpage link below), which can change voices "in real time", could work in conjunction with Dragon? Would be there be a way to take the digital output of such a program and feed it in as the input sound of Dragon? The webpage for the software is:
http://www.softwarevault.com/Download/Voic...ftware-Download

Thanks again for your experienced views.
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 12 2005, 01:42 PM
Post #15


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



speaker,

>>are you referring to your voice file after it has been processed by the Symetrix?

When my sons are using my voice file (I don't save it after they use it) there is no need for the Symetrix. Of course, having been raised by me, their vocabulary, phraseology and pronunciation are very similar to mine.

>>so that it automatically skrinks all incoming voices to the SAME PREDETERMINED level of "shrinkness" ? So that I wouldn't have to adjust the "shrinkness" every time a different speaker speaks? By the way, what does it mean to shrink a voice?

Once the Symetrix is adjusted, the output to the soundcard is very similar for all voices. "Shrinking" refers to modulating all audio input so that the output to the soundcard is almost a monotone; no highs or lows as you adjust once for the decibel and frequency range that is desired.

>>Do you think the "Voice Changer Software" (see webpage link below), which can change voices "in real time", could work in conjunction with Dragon?

I have never heard about it, but it sounds like it alters the output rather than the input.

I have to tell you that this is an imperfect solution to what you want, but it is probably the best you are going to get right now. You have said that you are willing to accept a lot of errors and that is what you are going to get - the hardware will just minimize them as much as possible.

John
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 12 2005, 07:30 PM
Post #16


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



John,

>>> I have never heard about it, but it sounds like it alters the output rather than the input.


That is the concept I'm trying to get my head around; how can I make the voice output of a software package to be the voice input for DNS? Can ScanSoft customize DNS for me to do that? Or can I have something inside my computer rewired?
Go to the top of the page
 
+Quote Post
John Wickett
post Feb 13 2005, 01:51 PM
Post #17


Moderator
****

Group: Super Moderators
Posts: 4,266
Joined: 9-January 01
From: wickworks@xplornet.com
Member No.: 51
VR User?: about 18 years
Which Program?: DragonDictate then DNS
Why do you Use VR?: Ease, speed, and to stay current to assist disabled clients
Where are you Located: Villsge Green, PEI, Canada



speaker,

>>how can I make the voice output of a software package to be the voice input for DNS?

I am not sure what you mean by "software package", but in my experience you have to alter the voices before they get to your soundcard and that is why I am suggesting an external hardware device.

John
Dictated with Dragon Naturally Speaking Version 8 Professional
John Wickett
PEI Canada
902-368-0141
902-651-2152
jwickett@auracom.com
john.wickett@vac-acc.gc.ca
Go to the top of the page
 
+Quote Post
speaker-independ...
post Feb 16 2005, 08:47 PM
Post #18


Junior Member
**

Group: Members
Posts: 22
Joined: 7-February 05
Member No.: 1,747



>>>I am not sure what you mean by "software package", but in my experience you have to alter the voices before they get to your soundcard and that is why I am suggesting an external hardware device.


Hi John,

I experimented with a couple voice-changing software products, and good news! the AV Voice Changer 3.1 Gold software can change the microphone voice input in real time BEFORE the sound goes into DNS. On playback DNS remembers the voice that is outputted by the Voice Changer software. I guess the better Diamond version of their software can also do it, but I haven't tried yet.
Go to the top of the page
 
+Quote Post

Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



RSS Lo-Fi Version Time is now: 19th June 2013 - 02:58 AM

We Recommend Using Dragon Medical and Dragon NaturallySpeaking Speech Recognition Software

Physicians Using Dragon Medical - Looking for the Best Electronic Health Record (EHR / EMR)?