Figure 11: Hough-transformed AI of vowel “i” by male
speaker B with maximum points shown (bottom), and the
corresponding delay trajectories with curves drawn back
based on maximum point information (top). Please note
that despite the similarity to Figure 8, r=2 in this case.
8 RESULTS
It has been shown that after the Hough-transfor-
mation of the auditory image, vowels can be recog-
nized even with very simple processing methods.
Despite the simplicity of the algorithm, recognition
is speaker-independent for selected vowels (a, o, u).
We insist that a competent (neural) system could do
a more extensive and yet robust recognition based
on H
τ
and ρ.
9 CONCLUSIONS
The application of the Hough-transform to the
neurotransmitter vesicle release distribution yields
good results, especially in procuring invariant
parameter settings for vowel descriptions for
different speakers. According to these findings, the
authors will try to model several computational
maps in the brain structured to execute Hough-
transforms. Furthermore, more sophisticated post-
processing methods are being investigated to yield a
more robust and possibly automated vowel
recognition.
ACKNOWLEDGEMENTS
We acknowledge the help of and would like to thank
Johannes Katzmann for his efficient Hubel-Wiesel
learning method (Katzmann, 2005), Andreas
Brückmann for the Hubel-Wiesel network
configuration program, and Gero Szepannek for the
stochastic modelling of the IHCs.
REFERENCES
Shapiro, S.D., 1978. Feature Space Transforms for Curve
Detection. Pattern Recognition, 10, pp 129–143.
Hough, P.V.C., 1959. Machine analysis of bubble
chamber pictures. Proceedings of the International
Conference on High-Energy Accelerators and
Instrumentation, L. Kowarski (Editor), CERN, pp
554–556.
Hough, P.V.C., 1962. Method and means for recognizing
complex patterns. US Patent 3069654.
Duda, E.O., Hart, P.E., 1972. Use of the Hough transform
to detect lines and curves in pictures. Comm. ACM, pp
11–15.
Ballard, D.H., 1981. Generalizing the Hough transform to
detect arbitrary shapes. Pattern Recognition 13, pp
111–122.
Ohlsson, M., Peterson, C., Yuille, A.L., 1992. Track
finding with deformable templates - the elastic arms
approach. Comp. Phys. Comm. vol. 71, pp 77–98.
Davis, E.R., 1992. Modelling peak shapes obtained by
Hough transform, IEEE proceedings vol. 139, 1.
Swaaij, M. Van, Catthoor, F., De Man, H., 1990. Deriving
ASIC architectures for the Hough transform. Parallel
Computing 16, pp 113–121.
Hubel, D.H., Wiesel, T.N., Stryker, M.P., 1978.
Anatomical demonstration of orientation columns in
macaque monkey. Journal of Comparative Neurology
177, pp 361–380.
Epstein, A., Paul, G.U., Vettermann, B., Boulin, C.,
Klefenz., F., 2001. A parallel systolic array ASIC for
real time execution of the Hough-transform. E. S.
Peris, A. F. Soria, V. G. Millan (Editors). Proceedings
of the 12th IEEE International Congress on Real Time
for Nuclear and Plasma Sciences, Valencia, pp 68–72.
Röver, Ch., Klefenz, F., Weihs, C., 2004. Identification of
musical Instruments by Means of the Hough
Transform. C. Weihs, W. Gaul (Editors). Proceedings
of the 28
th
Annual Conference of the Gesellschaft für
Klassifikation e. V., University of Dortmund, pp 608–
615.
Brückmann, A., Klefenz, F., Wünsche, A., 2004. A
neural net for 2d-slope and sinusoidal shape detection.
Int. Scient. J. of Computing 3 (1), pp 21–26.
Greenberg, S., Poeppel, D., Roberts, T., 1997. A Space-
Time Theory of Pitch and Timbre Based on Cortical
Expansion of the Cochlear Traveling Wave Delay.
Proceedings of the XI
th
Int. Symp. on Hearing,
Grantham.
Baumgarte, F., 2000. Ein psychophysiologisches Gehör-
modell zur Nachbildung von Wahrnehmungs-
schwellen für die Audiocodierung. Doctor Thesis,
University of Hannover, Germany.
Sumner, C.J., O'Mard, L.P., Lopez-Poveda, E.A. and
Meddis, R., 2002. A revised model of the inner-hair
cell and auditory nerve complex. Journal of the
Acoustical Society of America, 111, pp 2178–2189.
Katzmann, J., 2005. Echtzeitfähige, auf der Hough-
Transformation basierende Methoden der
Bildverarbeitung zur Detektion von Ellipsen. Diploma
Thesis, University of Ilmenau, Germany.
VISAPP 2006 - IMAGE ANALYSIS
256