Hi, Thanks a lot. I am going to use multiple microphones to track a
human speaker. However the microphones might be very close to the
person, such as cell phone case. Also, I don't have any prior knowledge
about the position of the person, and just hope to find it using some
algorithms. Is there any avaliable methods to do so? Thanks.
> You need to describe the physical situation in more detail and exactly
> what you want to determine. When you say ,"track" do you want to know
[quoted text clipped - 14 lines]
> However, you will need to perform some sort of signal processing in
> order to know that you are tracking the right signal.
sweaterman - 28 Jul 2005 21:32 GMT
Are there going to be many stationary microphones that the person will
be walking by?
asteel@bigpond.net.au - 28 Jul 2005 23:08 GMT
I think this group did something like what you describe -
http://www.bee.qut.edu.au/projects/saivt/speechaudiolab/
Andrew
> Are there going to be many stationary microphones that the person will
> be walking by?
asteel@bigpond.net.au - 28 Jul 2005 23:10 GMT
Hi,
I think this group did something similar
http://www.bee.qut.edu.au/projects/saivt/speechaudiolab/
Andrew
Angelo Campanella - 31 Jul 2005 14:59 GMT
> Hi, Thanks a lot. I am going to use multiple microphones to track a
> human speaker. However the microphones might be very close to the
> person, such as cell phone case. Also, I don't have any prior knowledge
> about the position of the person, and just hope to find it using some
> algorithms. Is there any avaliable methods to do so? Thanks.
If there are two microphones on a horizontal line, and the task is to
determine the azimuth of the talker, then the "zero crossings" technique
will work. Here, the signal from a microphone is amplified to a very
high level, clipping occurs, but such ac waves, no matter what
frequency, always cross zero twice per cycle. A zero crossing detector
(a common DSP algorithm) then signals that fact every time.
A coincidence error detector is then applied, whose output of course is
the time delay of one signal vs the other. When the talker is on the
normal to the ,line between mics, this error is zero. If it is not zero,
it's value is proportional the time delay and the microphone spacing.
Now you can do one of two things: you can either compute the azimuth,
or you can feed the error signal to a motor that drives a platform to
position the microphone normal line at the talker.
For voice tracking in a noisy environment, you have to select a narrow
band of sound... perhaps the voice fundamental, perhaps the vowel area
around 500 Hz, I'm not sure.
That narrow band signal is then the signal to which the above mentioned
DSP is applied.
For multiple microphones, A more sophisticated matrix approach with
regard to the subsequent handling of the error signals between many
microphone pairs is required; invention left up to you.
Angelo Campanella