Optical music recognition : feature identification

Type of content
Discussion / Working Papers
Publisher's DOI/URI
Thesis discipline
Degree name
Publisher
University of Canterbury
Journal Title
Journal ISSN
Volume Title
Language
Date
1995
Authors
Bainbridge, D.
Abstract

Although it has been less than a year since the last progress report, work has reached a natural 'breakpoint' offering an opportunity to describe what has been accomplished as well as gathering thoughts on the future. For a broader picture of how the implemented work fits into the general design, the reader is directed towards [Bai94a], which describes a complete Optical Music Recognition system. Since it is possible for a piece of music to include arbitrary graphics1 , it is impossible to design an OMR system that can process all music. The key idea, therefore, expressed in [Bai94a] is to provide a versatile foundation that can be built upon by individual users to generate particular instances of the system capable of recognising a particular class of music notation. Such a philosophy is reminiscent of Computer Aided Design (CAD). Similar to this area, the user should be encouraged to utilise sound software engineering principles. The proposed system could be thought of as a Computer Aided Music Recognition (CAMR). The main body of this report describes the implemented work. The topics: staff separation; primitive identification; primitive data tabulation; drawing package development; prototype2 musical feature classifier extensions; and miscellaneous items are discussed in turn. The report concludes by reflecting on the completed work as well as contemplating how the remaining problems may be solved. Having the right development environment to study OMR is considered an important facet of work. Time throughout the project has been devoted to this. In addition to the topics listed in (Bai94b], time has recently been invested learning Perl ( a Practical Extraction and Report Language), Gofer ( a functional language), and Lime ( a music file format for graphical reconstruction). Developing Internet searching skills using 'whois', 'archie', 'gofer', and 'mosaic' has also proved invaluable. Finally, in lieu of the forth coming design of a musical knowledge expression language, time has been spent investigating the suitability of Lisp, Prolog and various public domain knowledge based systems. The excerpt of music shown in Figure 1 is a recurring example used, throughout this report, to illustrate various points.

Description
Citation
Keywords
Ngā upoko tukutuku/Māori subject headings
ANZSRC fields of research
Fields of Research::46 - Information and computing sciences::4603 - Computer vision and multimedia computation::460306 - Image processing
Field of Research::08 - Information and Computing Sciences::0801 - Artificial Intelligence and Image Processing::080104 - Computer Vision
Rights
All Rights Reserved