Bibliography

Next: Dynamic time warping Up: A Time Varying Appearance Previous: Final conclusions Index

Bibliography

1: F. J. Aherne, N. A. Thacker, and P. I. Rockett.
The Bhattacharyya metric as an absolute similarity measure for frequency coded data.
Kybernetika, 32(4):001-007, 1997.
2: Okan Arikan, David A. Forsyth, and James F. O'Brien.
Motion synthesis from annotations.
In Proceedings of SIGGRAPH, 2002.
3: G. Bailly.
Audiovisual speech synthesis.
In ETRW on Speech Synthesis, Perthshire, Scotland, 2001.
4: A. M. Baumberg and D. C. Hogg.
An efficient method for contour tracking using active shape models.
Technical Report 94.11, School of Computer Studies, University of Leeds, April 1994.
5: F. Bettinger and T. F. Cootes.
A model of facial behaviour.
In Proceedings of the $6^{th}$ International Conference on Automatic Face and Gesture Recognition, pages 123-128, Seoul, Korea, May 2004.
6: F. Bettinger, T. F. Cootes, and C. J. Taylor.
Modelling facial behaviours.
In Paul L. Rosin and David Marshall, editors, British Machine Vision Conference, pages 797-806, Cardiff, UK, September 2002.
7: M. Black and A. Jepson.
Recognizing temporal trajectories using the condensation algorithm.
In Proceedings of the 3rd IEEE International Conference on Automatic Face and Gesture Recognition, April 1998.
8: Andrew Blake and Michael Isard.
Active Contours.
Springer, 1998.
9: A. Bobick and J. Davis.
An appearance-based representation of action.
In 13th International Conference on Pattern Recognition, Vienna, Austria, August 1996.
10: A. Bobick and J. Davis.
The representation and recognition of action using temporal templates.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(3):257-267, March 2001.
11: Richard Bowden.
Learning statistical models of human motion.
In IEEE Workshop on Human Modelling, Analysis and Synthesis, CVPR, Hilton Head Island, July 2000.
12: M. Brand.
Coupled hidden Markov models for modeling interacting processes.
Technical Report TR-405, MIT Media lab Perceptual Computing / Learning and Common Sense, Cambridge, November 1996.
13: M. Brand and V. Kettnaker.
Discovery and segmentation of activities in video.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):844-851, August 2000.
14: M. Brand, N. Oliver, and A. Pentland.
Coupled hidden Markov models for complex action recognition.
In IEEE CVPR97, 1997.
15: Christoph Bregler.
Learning and recognizing human dynamics in video sequences.
In IEEE Conference on Computer Vision and Pattern Recognition, June 1997.
16: Christoph Bregler, Michele Covell, and Malcom Slaney.
Video rewrite: Driving visual speech with audio.
In Proceedings of SIGGRAPH, 1997.
17: Neill W. Campbell, Colin Dalton, David Gibson, and Barry Thomas.
Practical generation of video textures using the auto-regressive process.
In Paul L. Rosin and David Marshall, editors, British Machive Vision Conference, pages 434-443, September 2002.
18: Robert T. Collins and Yanxi Liu.
On-line selection of discriminative tracking features.
In Proceedings of the $9^{th}$ International Conference on Computer Vision (ICCV-03), pages 346-352, Nice, France, 2003.
19: D. Comaniciu and P. Meer.
Mean shift analysis and applications.
In Seventh International Conference on Computer Vision, pages 1197-1203, 1999.
20: Dorin Comanicui and Peter Meer.
Mean shift: A robust approach toward feature space analysis.
IEEE transactions on Pattern Analysis and Machine Intelligence, 24(5):603-619, May 2002.
21: T. Cootes and C. Taylor.
Statistical models of appearance for computer vision.
Technical report, University of Manchester, 1999.
22: T. F. Cootes, G. J. Edwards, and C. J. Taylor.
Active appearance models.
In H.Burkhardt & B. Neumann Ed's, editor, European Conference on Computer Vision, volume 2, pages 484-498. Springer, 1998.
23: T. F. Cootes and C. J. Taylor.
A mixture model for representing shape variation.
In BMVA Press, editor, Proceedings of British Machine Vision Conference, pages 110-119, 1997.
24: T. H. Cormen, C. E. Leiserson, and R. L. Rivest.
Introduction to Algorithms.
The MIT Press, 1997.
25: Darren Cosker, David Marshall, Paul Rosin, and Yulia Hicks.
Video realistic talking heads using hierarchical non-linear speech-appearance models.
In Proceedings of Mirage, France, March 2003.
26: J. Davis.
Hierarchical motion history images for recognizing human motion.
In IEEE Workshop on Detection and Recognition of Events in Video, pages 39-46, Vancouver, Canada, July 2001.
27: A. P. Dempster, N. M. Laird, and D. B. Rubin.
Maximum likelihood from incomplete data via the EM algorithm (with discussion).
Journal of the Royal Statistical Society series B, 39:1-38, 1977.
28: Vincent E. Devin and David C. Hogg.
Reactive memories: An interactive talking-head.
In British Machine Vision Conference, pages 603-612, September 2001.
29: Ian L. Dryen and Kanti V. Mardia.
Statistical Shape Analysis.
John Wiley & Sons, 1998.
30: G. Edwards, C. Taylor, and T. Cootes.
Learning to identify and track faces in image sequences.
In 8 th British Machine Vison Conference, pages 130-139, Colchester, UK, 1997.
31: P. Eisert, S. Chaudhuri, and B. Girod.
Speech driven synthesis of talking head sequences.
In 3D Image Analysis and Synthesis, pages 51-56, Erlangen, November 1997.
32: D. J. Fleet, M. J. Black, Y. Yacoob, and A. D. Jepson.
Design and use of linear models for image motion analysis.
Int. Journal of Computer Vision, 36(3):171-193, 2000.
33: Aphrodite Galata, Neil Johnson, and David Hogg.
Learning structured behaviour models using variable length markov models.
Technical Report 1999.10, University of Leeds, May 1999.
34: Aphrodite Galata, Neil Johnson, and David Hogg.
Learning variable-length Markov models of behavior.
Computer Vision and Image Understanding: CVIU, 81(3):398-413, March 2001.
35: B. Le Goff and C. Benoit.
A text-to-audiovisual-speech synthesizer for french.
In Proceedings of the International Conference on Spoken Language Processing (ICSLP), Philadelphia, USA, October 1996.
36: Gene H. Golub and Charles F. Van Loan.
Matrix computations.
John Hopkins Press, 1989.
37: Hans Peter Graf and Eric Cosatto.
Sample-based synthesis of talking heads.
In ICCV-RATFG-RTS, pages 3-7, 2001.
38: Lisa Gralewski, Neill Campbell, Barry Thomas, Colin Dalton, and David Gibson.
Statistical synthesis of facial expressions for the portrayal of emotion.
In Proceedings of the International Conference on Computer Graphics and Interactive Techniques in Austalasia and Southeast Asia (GRAPHITE 2004), Singapore, June 2004.
39: I. Guyon and F. Pereira.
Design of a linguistic postprocessor using variable memory length markov models.
In 3rd International Conference on Document Analysis and Recognition, pages 454-457, 1995.
40: C. A. Hack and C. J. Taylor.
Modelling 'talking head' behaviour.
In British Machine Vision Conference (BMVC03), 2003.
41: Craig Hack.
Modelling 'Talking Head' Behaviour.
PhD thesis, University of Manchester, 2004.
42: Pengyu Hong, Matthew Turk, and Thomas S. Huang.
Gesture modeling and recognition using finite state machines.
In Proceedings of the Fourth International Conference on Automatic Face and Gesture Recognition, pages 410-415, March 2000.
43: Pengyu Hong, Zhen Wen, and Thomas S. Huang.
An integrated framework for face modeling, facial motion analysis and synthesis.
In Proceedings of the 9th ACM International Conference on Multimedia, pages 495-498, Ottawa, Ontario, Canada, September 2001.
44: Ying Huang, Xiaoqing Ding, Baining Guo, and Heung-Yeung Shum.
Real-time face synthesis driven by voice.
In Proceedings of CAD/Graphics, 2001.
45: Peter Ti ino Georg Dorffner.
Building predictive models from fractal representations of symbolic sequences.
In Advances in Neural Information Processing Systems, 2000.
46: E. T. Jaynes.
Monkeys, kangaroos and n.
In J. H. Justice, editor, Maximum Entropy and Bayesian Methods in Applied Statistics, page 26. Cambridge University Press, 1986.
47: T. Jebara and A. Pentland.
Parametrized structure from motion for 3d adaptive feedback tracking of faces.
In Proceedings of Computer Vision and Pattern Recognition, 1997.
48: T. Jebara and A. Pentland.
Action reaction learning: Automatic visual analysis and synthesis of interactive behaviour.
Lecture Notes in Computer Science, 1542:273-292, 1999.
49: T. Jebara and A. Pentland.
The generalized CEM algorithm.
Advances in Neural Information Processing Systems, 12, 1999.
50: F. Jelinek.
Statistical Methods for Speech Recognition.
MIT Press, 1997.
51: N. Johnson, A. Galata, and D. Hogg.
The acquisition and use of interaction behaviour models.
In IEEE Computer Society Press., editor, CVPR, pages 866-871, 1998.
52: N. Johnson, A. Galata, and D. Hogg.
The acquisition and use of interaction behaviour models.
In IEEE Computer Society Press., editor, Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition - CVPR'98, pages 866-871, 1998.
53: F. Jurie and M. Dhome.
Real time robust template matching.
In British Machine Vision Conference, pages 123-132, September 2002.
54: V. Kettnaker and M. Brand.
Minimum-entropy models of scene activity.
In Proceedings of the IEEE Computer Science Conference on Computer Vision and Pattern Recognition (CVPR-99), pages 281-286, Los Alamitos, June 23-25 1999. IEEE.
55: Lucas Kovar, Michael Gleicher, and Frédéric Pighin.
Motion graphs.
In Proceedings of SIGGRAPH, pages 473-482, San Antonio, Texas, 2002.
56: Takeshi Kurata, Takashi Okuma, Masakatsu Kourogi, and Katsuhiko Sakaue.
The hand mouse: Gmm hand-color classication and mean shift tracking.
In Proceedings of the IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems (RATFG-RTS'01), pages 119-124, July 2001.
57: A. Lanitis, C. J. Taylor, and T. F. Cootes.
An automatic face identification system using flexible appearance models.
In BMVA Press, editor, Proceedings of the British Machine Vision Conference, pages 66-75, 1994.
58: Yan Li, Tianshu Wang, and Heung-Yeung Shum.
Motion texture: A two-level statistical model for character motion synthesis.
In Proceedings of SIGGRAPH, 2002.
59: D. Magee and R. Boyle.
Feature tracking in real world scenes (or how to track a cow).
In IEE Colloquium on Motion Analysis and Tracking, May 1999.
60: D. R. Magee.
Machine Vision Techniques for the Evaluation of Animal Behaviour.
PhD thesis, The University of Leeds, School of Computing, October 2000.
61: D. R. Magee and R. D. Boyle.
Detecting lameness in livestock using 're-sampling condensation' and 'multi-stream cyclic hidden Markov models'.
In BMVC2000 11th British Machine Vision Conference Volume 1, Bristol, September 2000.
62: D. R. Magee and R. D. Boyle.
Spatio-temporal modeling in the farmyard domain.
In Proceedings IAPR International Workshop on Articulated Motion and Deformable Objects, pages 83-95, 2000.
63: Jan R. Magnus and Heinz Neudecker.
Matrix Differential Calculus with Applications in Statistics and Econometrics.
Wiley series in probability and statistics. John Wiley & Sons, 1999.
64: Dimitrios Makris and Tim Ellis.
Spatial and probabilistic modelling of pedestrian behaviour.
In Paul L. Rosin and David Marshall, editors, British Machine Vision Conference, pages 557-566, Cardiff, UK, September 2002.
65: Duncan Marsh.
Applied Geometry for Computer Graphics and CAD.
Springer-Verlag, 1999.
66: Richard J. Martin.
A metric for ARMA processes.
IEEE Transactions on Signal Processing, 48(4):1164-1170, April 2000.
67: D. W. Massaro, J. Beskow, M. M. Cohen, C. L. Fry, and T. Rodriguez.
Picture my voice: Audio to visual speech synthesis using artificial neural networks.
In D. W. Massaro, editor, Proceedings of AVSP'99, Internation Conference on Audditory-Visual Speech Processing, pages 133-138, Santa Cruz, CA., August 1999.
68: Iain Matthews and Simon Baker.
Active appearance models revisited.
Technical Report CMU-RI-TR-03-02, The Robotics Institue, Carnegie Mellon University, 2002.
69: Iain Matthews, Takahiro Ishikawa, and Simon Baker.
The template update problem.
In Proceedings of the British Machine Vision Conference, September 2003.
70: Tom M. Mitchell.
Machine Learning.
Computer Science. McGraw-Hill, 1997.
71: Jeffrey Ng and Shaogang Gong.
Learning intrinsic video content using Levenshtein distance in graph partitioning.
Lecture Notes in Computer Science, 2353:670-684, 2002.
72: Jun-Yong Noh and Ulrich Neumann.
Talking faces.
In IEEE International Conference on Multimedia and Expo, volume 2, pages 627-630, 2000.
73: S. Ouni, D. W. Massaro, M. M. Cohen, K. Young, and A. Jesse.
Internationalization of a talking head.
In $15^{th}$ International Congress of Phonetic Sciences (ICPhS'03), Barcelona, Spain, August 2003.
74: David Oziem, Lisa Gralewski, Neill Campbell, David Gibson, and Barry Thomas.
Synthesising facial emotions.
In Proceedings of Theory and Practice of Computer Graphics (TPCG'04), pages 120-127, Bournemouth, United Kingdom, June 2004.
75: M. Pantic and L. J. M. Rothkrantz.
Automatic analysis of facial expressions: The state of the art.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1424-1445, December 2000.
76: L. Rabiner and B. Juang.
Fundamentals of speech recognition.
Prentice Hall, 1993.
77: L. R. Rabiner.
A tutorial on hidden Markov models and selected applications in speech recoginition.
Proceedings of the IEEE, 77:257-286, 1989.
78: E. Raudsepp.
Body language speaks louder than words.
Machine Design, 65(19):85-89, September 1993.
79: David Reynard, Andrew Wildenberg, Andrew Blake, and John A. Marchant.
Learning dynamics of complex motions from image sequences.
In ECCV (1), pages 357-368, 1996.
80: Eric Sven Ristad.
A natural law of succession.
Technical Report TR-495-95, Princeton University, Computer Science Department, July 1995.
81: Dana Ron, Yoram Singer, and Naftali Tishby.
The power of amnesia.
In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems, volume 6, pages 176-183. Morgan Kaufmann Publishers, Inc., 1994.
82: Dana Ron, Yoram Singer, and Naftali Tishby.
The power of amnesia: Learning probabilistic automata with variable memory length.
Machine Learning, 25:117-149, 1996.
83: S. T. Roweis and L. K. Saul.
Nonlinear dimensionality reduction by locally linear embedding.
SCIENCE, 290:2323-2326, 2000.
84: Payam Saisan, Gianfranco Doretto, Ying Nian Wu, and Stefano Soatto.
Dynamic texture recognition.
In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pages 58-63, 2001.
85: Lawrence K. Saul and Michael I. Jordan.
Boltzmann chains and Hidden Markov Models.
In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 435-442. The MIT Press, 1995.
86: Arno Schödl, Richard Szeliski, David H. Salesin, and Irfan Essa.
Video textures.
In Kurt Akeley, editor, Siggraph 2000, Computer Graphics Proceedings, pages 489-498. ACM Press / ACM SIGGRAPH / Addison Wesley Longman, 2000.
87: J. Shi and J. Malik.
Normalized cuts and image segmentation.
IEEE Conf. Computer Vision and Pattern Recognition, June 1997.
88: Jianbo Shi and Jitendra Malik.
Self inducing relational distance and its application to image segmentation.
In Proceedings of the $5^{th}$ European Conference on Computer Vision (ECCV'98), pages 528-543, Freiburg, Germany, June 1998.
89: Jianbo Shi and Jitendra Malik.
Normalized cuts and image segmentation.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):888-905, August 2000.
90: B. W. Silverman.
Density Estimation for Statistics and Data Analysis.
Chapman and Hall, 1986.
91: M. B. Stegmann.
Object tracking using active appearance models.
In Søren I. Olsen, editor, 10th Danish Conference on Pattern Recognition and Image Analysis, volume 1, pages 54-60, Copenhagen, Denmark, July 2001. DIKU.
92: N. Sumpter and A. Bulpitt.
Learning spatio-temporal patterns for predicting object behaviour.
In Proc. British Machine Vision Conference, pages 649-658, 1998.
93: Barry J. Theobald, Andrew Bangham, Iain Matthews, J. R. W. Glauert, and G. C. Cawley.
2.5d visual speech synthesis using appearance models.
In Richard Harvey and Andrew Bangham, editors, Proceedings of the British Machine Vision Conference (BMVC), 2003.
94: Barry J. Theobald, J. Andrew Bangham, Iain Matthews, and Gavin C. Cawley.
Visual speech synthesis using statistical models of shape and appearance.
In Proceedings of Auditory-Visual Speech Processing, 2001.
95: M. Walter, A. Psarrou, and S. Gong.
Auto clustering for unsupervised learning of atomic gesture components using minimum description length.
In ICCV-RATFG-RTS, pages 157-162, 2001.
96: Michael Walter, Alexandra Psarrou, and Shaogang Gong.
Data driven gesture model acquisition using minimum description length.
In British Machine Vision Conference, pages 673-683, September 2001.
97: M. P. Wand.
Data-based choice of histogram bin width.
The American Statistician, 51(1):59, February 1997.
98: Tianshu Wang, Yan Li, Ying-Qing Xu, and Heung-Yeung Shum.
Learning kernel-based hmms for dynamic sequence synthesis.
Graphical Models, 65(4):206 - 221, July 2003.
99: J. Yang, R. Stiefelhagen, U. Meier, and A. Waibel.
Visual tracking for multimodal human computer interaction.
In Proceedings of ACM CHI 98 Conference on Human Factors in Computing Systems, volume 1 of About Faces, pages 140-147, 1998.
100: Bo Yu.
Recognition of freehand sketches using mean shift.
In Proceedings of the 2003 International Conference on Intelligent User Interfaces, pages 204-210, Miami, Florida , USA, January 2003.

franck 2006-10-01