[go: up one dir, main page]

CN101409021A - Sound music reading material and pick-up device - Google Patents

Sound music reading material and pick-up device Download PDF

Info

Publication number
CN101409021A
CN101409021A CNA2007102019784A CN200710201978A CN101409021A CN 101409021 A CN101409021 A CN 101409021A CN A2007102019784 A CNA2007102019784 A CN A2007102019784A CN 200710201978 A CN200710201978 A CN 200710201978A CN 101409021 A CN101409021 A CN 101409021A
Authority
CN
China
Prior art keywords
phoneme
value
information
pronunciation
code word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CNA2007102019784A
Other languages
Chinese (zh)
Inventor
游旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA2007102019784A priority Critical patent/CN101409021A/en
Publication of CN101409021A publication Critical patent/CN101409021A/en
Withdrawn legal-status Critical Current

Links

Images

Landscapes

  • Electrically Operated Instructional Devices (AREA)

Abstract

The invention relates to a sound music reading material and a sound pick-up related to a splicing mode, wherein a code word is arranged on a page of the sound reading material, one code word is generally printed beside each or a plurality of printed music sections, the value of the code word is generally related to the value of splicing information of a plurality of music sections, and the splicing of the music sections is completed by phonemes. The microphone reads this code word and decodes the play-out musical piece pronunciation. The code reading head of the sound pick-up can use a bar code reading head or a conductive code reading head, and the corresponding code word is a bar code or a conductive code and the like.

Description

涉及拼合方式的有声音乐读物及拾音器 Audible music books and pickups involving splicing

技术领域: Technical field:

本发明涉及一种涉及拼合方式的有声音乐读物及拾音器的解决方案。The invention relates to a solution to audio music books and sound pickups in a splicing mode.

背景技术: Background technique:

在现有技术中,有对乐谱发音的方案,如中国专利号为:96100915.2的专利,In the prior art, there is a scheme for pronouncing music score, as the Chinese patent number is: the patent of 96100915.2,

其解决方案是把乐谱按类似五线谱的格式,用打孔,涂黑等方式记录在一张纸上作为专用乐谱页,而其读取装置是按与其专用乐谱页的形式对应的做有一些感应器件来感应专用乐谱页上五线谱的记录信息,使用时用马达卷动专用乐谱页,顺序读取页上记录的乐谱信息,并将其发出音乐声来。此方式不需另外加存储器来记录谱数据,读取装置读取的就是谱数据。其缺点是读取装置大,读取操作麻烦,成本高,使用不方便,专用乐谱页与常规乐谱页需做成不同的纸,而且不能按位置任意发音。The solution is to record the score on a piece of paper in a format similar to staves by punching, blackening, etc. The device senses the record information of the stave on the special score page, and uses the motor to scroll the special score page, sequentially reads the score information recorded on the page, and makes a musical sound. In this way, no additional memory is needed to record the spectral data, and what the reading device reads is the spectral data. The disadvantages are that the reading device is large, the reading operation is cumbersome, the cost is high, and the use is inconvenient. The special score page and the conventional score page need to be made of different papers, and it cannot be pronounced arbitrarily according to the position.

发明内容: Invention content:

本发明的任务是提供一种结构简单,成本低,使用方便的有声音乐读物系统,包括有声音乐读物及拾音器。The task of the present invention is to provide a simple structure, low cost, and easy-to-use audio music reading system, including audio music books and pickups.

本发明涉及两个方面:The present invention involves two aspects:

一是其整页面上印有码字的有声音乐读物:在有声音乐读物的整页面上,以印刷或其他方式形成有码字;每个码字可被相应拾音器读取以用于拾音器播放音乐,每个码字的值所表示的信息是与拼合信息的值有关的信息;码字一般可以是导电码或条形码等。One is audio music books with codes printed on the entire page: on the entire page of audio music books, code words are formed by printing or other means; each code word can be read by the corresponding pickup for the pickup to play music , the information represented by the value of each codeword is information related to the value of the combined information; the codeword can generally be a conductive code or a barcode.

二是用于读取有声音乐读物整页面上诸个码字中的部诸码字中的码字并播放音乐的拾音器:其一般包括一个读码头,一个单片机芯片,一个发音器件,存储器,按键模块等。发音器件用于播放音乐。拾音器与有声音乐读物配合使用,使用时,使读码头接近或接触有声音乐读物整页面上的码字,单片机芯片利用读码头读入码值,单片机芯片根据读得的一个或多个码字的码值得到拼合信息的值并按得到的拼合信息的值解码使发音器件发出音乐声音。The 2nd, be used for reading the code word in the code word in all code words in all code words on the entire page of audio music book and play music: it generally comprises a reading terminal, a single-chip microcomputer chip, a pronunciation device, memory, button modules etc. The sounding device is used to play music. The pickup is used in conjunction with the audio music book. When in use, the reading dock is made to approach or touch the code words on the entire page of the audio music book. The single-chip microcomputer chip uses the reading dock to read the code value. The code value obtains the value of the combined information and decodes the obtained value of the combined information to make the sounding device emit music sounds.

有声音乐读物整页面的一个码字的值可以是或含有或可转换得到数个乐段拼合信息的值,或是其它情况。The value of a codeword of the entire page of the audio music book can be or contain or can be converted to obtain the value of stitching information of several music sections, or other situations.

有声音乐读物整页面上的码字可采用导电码,相应的拾音器的读码头可采用机电式导电码读码头。有声音乐读物整页面上的码字也可采用条形码等,相应的拾音器的读码头可采用光电式读码头或其它形式读码头。The code word on the entire page of the audio music book can adopt the conductive code, and the reading terminal of the corresponding pickup can adopt the electromechanical conductive code reading terminal. The code word on the entire page of audio music book also can adopt bar code etc., and the reading dock of corresponding pickup can adopt photoelectric type to read dock or other forms to read dock.

作为有声音乐读物主体的读物可以是音乐教学教材,音乐欣赏读物,学生用乐谱书及乐队使用的乐谱书等。The reading materials as the main body of audio music reading materials can be music teaching materials, music appreciation reading materials, music score books for students and music score books for bands, etc.

附图说明: Description of drawings:

图1:其整页面上印有导电码的一种有声音乐读物的实例图Figure 1: Illustration of an example of an audiobook with conductive codes printed on the entire page

图2:使用光电式读码头的拾音器的一个电路图Figure 2: A circuit diagram of a pickup using a photoelectric reading dock

图3:使用MCU外接的音乐芯片的拾音器的电路图Figure 3: The circuit diagram of the pickup using the music chip connected to the MCU

图4:大机体连读码头形式的拾音器(机体的外型适于让有声音乐读物放入机体里)外型实例和有声音乐读物的实例Figure 4: A pickup in the form of a large-body continuous reading dock (the shape of the body is suitable for putting audio music books into the body) appearance examples and examples of audio music books

图5:大机体连读码头形式的拾音器(机体的外型适于让有声音乐读物放在机体上)外型实例和有声音乐读物的实例Figure 5: The pickup in the form of a large body continuous reading dock (the shape of the body is suitable for allowing audio music books to be placed on the body) appearance examples and examples of audio music books

图6:手持式拾音器外型实例和有声音乐读物的实例Figure 6: Example of a handheld pickup and an example of an audio book

图7:小壳体连读码头形式的拾音器外型实例和有声音乐读物的实例Figure 7: An example of the appearance of a pickup in the form of a small shell continuous reading dock and an example of an audio book

图8:一段重复音可被定义为一个音素(此例音素含3个音)Figure 8: A repeated sound can be defined as a phoneme (this example phoneme contains 3 tones)

图9:一段音阶音可被定义为一个音素(此例音素含4个音)Figure 9: A section of scale tone can be defined as a phoneme (this example phoneme contains 4 tones)

图10:一段和弦分解可被定义为一个音素(此例音素含3个音)Figure 10: A chord decomposition can be defined as a phoneme (this example phoneme contains 3 tones)

图11:带八度冠音的两个示例音Figure 11: Two example tones with octave crowns

图12:把一个被定位的音素重复2次的示意图Figure 12: Schematic diagram of repeating a localized phoneme twice

图13:拼合编码解码的一个乐谱实例(钢琴谱)Figure 13: An example of a music score (piano score) of the combined encoding and decoding

图14:拼合编码解码的另一个乐谱实例(钢琴谱)Figure 14: Another musical score example (piano score) of the coded and decoded pieces

图15:交织拼合的两个音素示例Figure 15: Example of two phonemes interleaved

图16:一个音素示例Figure 16: An example phoneme

1:机电式导电码读码头1: Electromechanical conductive code reading terminal

2:电平通断检测电路2: Level on-off detection circuit

3:MCU3: MCU

4:按键模块4: Button module

6:外接数据存储器6: External data memory

10:导电码10: conduction code

44:有声音乐读物的整页面上印刷的乐谱44: Sheet music printed on a full page of an audiobook

45:光笔45: light pen

55:程序存储器55: Program memory

56:乐曲的音素56: Phonemes of music

57:条形码57: barcode

具体实施方式: Detailed ways:

本申请文件各处涉及的读物,一般是常见的音乐读物,比如乐谱书。可在读物的整页面上形成有码字,码字可有多个,位置可任意,码字可以是导电码,条形码或其它码种的码字,每个码字可被相应拾音器读取以用于拾音器播放音乐,码字的值所表示的信息是与拼合信息的值有关的信息,码字可以是处在读物页面上需播放此音乐的位置,码字在页面上所处位置一般是在印刷着码字对应播放的音乐的乐谱的旁边;读物整页面上的所有与拼合信息的值有关的码字与读物就一起形成有声音乐读物。The reading materials involved in this application document are generally common music reading materials, such as music score books. Code words can be formed on the entire page of the reading material. There can be multiple code words, and the positions can be arbitrary. The code words can be conductive codes, bar codes or other code words. Each code word can be read by the corresponding pickup. It is used for the pickup to play music. The information represented by the value of the code word is information related to the value of the combined information. The code word can be at the position where the music needs to be played on the page of the reading material. The position of the code word on the page is generally Next to the music scores of the music played corresponding to the codewords printed; all codewords related to the value of the combined information on the entire page of the reading material and the reading material together form an audio music reading material.

有声音乐读物整页面上的诸个码字中的全部或部分码字中每个码字的码值可以是或含有一个乐段拼合信息的值,由拾音器读有声音乐读物整页面上诸个码字中的部诸码字中的码字并对码字的值中乐段拼合信息的值解码并播乐段发音;当然处于不同页面或不同位置的不同码字,其码值表达或含有的一个乐段拼合信息的值一般是不同的(这里的词‘表达’与‘是或含有’中的‘是’涵义相同)。The code value of each code word in all or part of the code words in the code words on the whole page of the audio music book can be or contain the value of a music segment splicing information, read all the code words on the whole page of the audio music book by the pickup Codewords in all codewords in the word and decode the value of the music segment combination information in the value of the codeword and play the music segment pronunciation; of course, the different codewords in different pages or different positions, the code value expression or contained The value of a piece of combination information is generally different (here the word 'expression' has the same meaning as 'is' in 'is or contains').

在本申请文件中各处提及的‘多个’中的‘多’是指等于或大于2的一个数值。在本申请文件中各处提到的‘多本’中的‘多’也是指等于或大于2的一个数值。在本申请文件中各处提到的‘多种’中的‘多’也是指等于或大于2的一个数值。在本申请文件中其它各处提到的作为数词的‘多’也是指等于或大于2的一个数值。在本申请文件中其它各处提及的作为数词的‘几’是指大于1且小于十的数值中的一个数值,这是常识。在本申请文件中的各处提及的‘数个’中的‘数’是指等于或大于1的一个数值。在本申请文件中的各处提及的‘数种’中的‘数’是指等于或大于1的一个数值。在本申请文件中的其它各处提及的作为数词的‘数’也是指等于或大于1的一个数值。The 'many' mentioned in the 'multiple' throughout this application refers to a numerical value equal to or greater than 2. The "many" in the "multiple copies" mentioned in this application document also refers to a numerical value equal to or greater than 2. The "multiple" in the "multiple" mentioned throughout the application documents also refers to a numerical value equal to or greater than 2. 'Multiple' mentioned elsewhere in this application document as a numeral also refers to a numerical value equal to or greater than 2. The 'several' mentioned as a numeral elsewhere in this application document refers to a numerical value greater than 1 and less than ten, which is common sense. The 'number' in 'several' mentioned in various places in this application document refers to a numerical value equal to or greater than 1. The 'number' in 'several' mentioned in various places in this application document refers to a numerical value equal to or greater than 1. References to 'number' as a numeral elsewhere in this application document also refer to a numerical value equal to or greater than 1.

本申请文件各处涉及的读物,可以是书籍,图文卡片,地图,乐谱书或报纸杂志等,也可以是各级学校音乐教材(属于书籍)。乐谱书里的乐谱可以是五线谱的,简谱的,吉他谱的,电子琴谱(有些电子琴谱与五线谱有不同)的等。The reading materials involved in this application document can be books, graphic cards, maps, music books or newspapers and magazines, etc., and can also be music teaching materials (belonging to books) in schools at all levels. The music scores in the music score book can be staves, numbered notation, guitar notation, electronic piano notation (some electronic notation is different from stave notation) etc.

如提及时无特别的指明,本申请文件中各处提及的读物,是指一个计量单位的读物,比如一本书,一本乐谱书,一本杂志,这是显然的,也是常识。如提及时无特别的指明,本申请文件中各处提及的有声音乐读物,是指一个计量单位的有声音乐读物,比如一本乐谱书(此乐谱书本身不包括码字,此乐谱书是读物)与其整页面上所有其码值是与拼合信息的值有关的码字一起形成的一本有声音乐读物,这是显然的,也是常识。Unless otherwise specified, the reading materials mentioned in this application document refer to the reading materials of a unit of measurement, such as a book, a music score book, and a magazine. This is obvious and common sense. If there is no special indication when mentioned, the audio music books mentioned everywhere in this application document refer to the audio music books of a unit of measurement, such as a music score book (the music score book itself does not include code words, and the music score book is Reading material) and all its code value on its whole page is to form an audio music reading material together with the code word relevant with the value of splicing information, this is obvious, also is common sense.

本申请文件各处提及的有声音乐读物,其主体为读物,读物整页面上有码字,码字的值表示的信息是与拼合信息的值有关的信息。本申请文件各处提及的有声音乐读物,一般是把作为有声音乐读物主体的读物的页作为码字载体。The audio music readings mentioned in this application document are mainly reading materials, and there are code words on the entire page of the reading materials, and the information represented by the value of the code words is information related to the value of the spliced information. The audio music reading material that this application document mentions everywhere generally is to use the page as the code word carrier as the reading material of audio music reading material main body.

码字的位置是在读物(比如乐谱书)的整页面上的。读物的任一个页面上可没有码字,或有一个或多个处于任意不同位置的码字;拾音器可以是读取读物整页面上的一个码字,并根据码值来处理发出音乐声,可以是把读物页面上此码字旁边印的乐谱内容用声音演奏出来。这里演奏的乐谱可以是此码字旁边印的一段乐谱,或那一整页印的乐谱,或本页及后面几页印的乐谱等;拾音器也可以是读取读物整页面上的多个码字,并根据这多个码字的码值来处理发出音乐声,可以是把读物页面上这多个码字旁边印的乐谱内容用声音演奏出来。这里演奏的乐谱可以是这多个码字旁边印的一段乐谱,或那一整页印的乐谱,或本页及后面几页印的乐谱等。The position of code word is on the whole page of reading material (such as music score book). There may be no codeword on any page of the reading material, or one or more codewords in any different positions; the pickup can read a codeword on the entire page of the reading material, and process and emit music sounds according to the code value, which can be It is to perform the music score content printed next to the code word on the reading material page with sound. The music score played here can be a section of music score printed next to this code word, or the music score printed on that whole page, or the music score printed on this page and the following pages, etc.; Word, and according to the code value of these a plurality of code words, process and send out musical sound, can be that the musical notation content printed next to these a plurality of code words on the reading material page is played out with sound. The music score played here can be a section of music score printed next to these multiple code words, or the music score printed on that whole page, or the music score printed on this page and the following pages.

本申请文件各处所指作为有声音乐读物主体的读物(读物可以是书籍,乐谱书,地图,报纸杂志或印有图文的卡片等),在读物的整页面上有诸个码字,一般一本读物是含有多个页的(也即一本读物含有多个页面),而任一个页面可没有或有一个或有多个处在不同位置的码字,当然各页面的码字的码值一般是各异的,如其单个码字的值是或含有一个乐段拼合信息的值,那么各码字的值上的乐段拼合信息的值一般是不同的;诸个码字中每个码字的值所表示的信息是与拼合信息的值有关的信息,由拾音器根据读得的诸个码字中的部诸码字中的码字的码值来播放音乐发音,甚至每页面都可有码字,有的页面如不需发音可没码字;对于一本含多个页面的读物,可能在多个页面上都有码字(这句话提到的‘多’小于或等于上句话提到的‘多’)且每个页面有数个码字,也可能只有其中一个页面有码字,此页面的码字数量可以有一个或多个;显然,本申请文件各处所说的‘在读物的整页面上有诸个码字’,可能是读物的每个页面上都有码字,也可能是只有读物的部分页面上有码字;甚至一本读物本身只有一个页面(只有一个页且只使用单面),在这一个页面上有一个或多个码字;可把一张卡片当成一本只有一个页面或有正反两个页面的读物,在一个或两个页面上有码字;对于一叠卡片,可把一叠卡片当成一本读物,也可把一叠卡片中的每张卡片分别当成一本只有一个页面或有正反两个页面的读物;可由拾音器读取各页面上的数个码字并处理播音乐。一本有声音乐读物整页面上的诸个码字中的各码字可能都属于同一码种(比如都是条形码)或有多个码种(比如诸个码字中既有条形码也有导电码)的,如诸个码字中的各码字有多个码种,那各码种的码字一般应使用带不同读码头的不同的拾音器来读码播音乐。The readings (readings can be books, music books, maps, newspapers, magazines or cards printed with pictures, etc.) as the main body of audio music readings are referred to in this application document. There are code words on the entire page of the readings, generally A reading material contains multiple pages (that is, a reading material contains multiple pages), and any page may not have or have one or a plurality of code words in different positions, and of course the code words of the code words of each page Values are generally different, such as the value of its single codeword is or contains the value of a music section splicing information, then the value of the music section splicing information on the value of each codeword is generally different; The information represented by the value of the codeword is the information relevant to the value of the combined information, and the music pronunciation is played by the pick-up according to the code value of the codewords in the codewords of the part codewords in the codewords read, even every page There may be codewords, and some pages may not have codewords if they do not need to be pronounced; for a book containing multiple pages, there may be codewords on multiple pages (the "many" mentioned in this sentence is less than or equal to The 'many' mentioned in the previous sentence) and each page has several codewords, or only one of the pages has codewords, and the number of codewords on this page can be one or more; "There are various codewords on the whole page of the reading material" may be that there are codewords on every page of the reading material, or there may be codewords only on some pages of the reading material; even a reading material itself has only one page ( There is only one page and only one side is used), there are one or more code words on this one page; a card can be regarded as a reading material with only one page or two pages on the front and back, on one or two pages There are code words on it; for a stack of cards, a stack of cards can be regarded as a reading material, and each card in a stack of cards can also be regarded as a reading material with only one page or two pages; Read several codewords on each page and process playing music. Each code word among the code words on the entire page of an audio music book may all belong to the same code type (for example, they are all bar codes) or have multiple code types (for example, there are both bar codes and conductive codes in the code words) Yes, as each code word in the code words has a plurality of code types, the code words of each code type should generally use different pickups with different reading docks to read codes and play music.

诸个码字中各个码字的值一般是数个乐段拼合信息的值。The value of each codeword among the codewords is generally the value of several pieces of musical segment splicing information.

关于可使用的码字及读码头的形式:Regarding the available codewords and the form of reading terminals:

一:使用一维条形码,一般一个码字至多可表示15个左右十进制位,转换成二进制约有50个二进制位,位数较少;此种码字的识读可使用光笔或ccd读码头等。One: Use one-dimensional barcodes. Generally, a codeword can represent at most about 15 decimal digits, and there are about 50 binary digits when converted into binary, with fewer digits; this kind of codeword can be read using a light pen or a ccd reading dock, etc. .

二:使用二维条形码,一般可使用行排式二维条形码,如果是两排的话,就可表示到100个二进制位以上;此种码字的识读一般可使用线性ccd读码头,使用线性ccd读码头时需沿垂直方向拖动读码头。Two: Use two-dimensional barcodes. Generally, row-type two-dimensional barcodes can be used. If there are two rows, it can be expressed to more than 100 binary digits; this kind of codewords can generally be read using linear ccd reading docks, using linear When the ccd reads the dock, it needs to drag along the vertical direction to read the dock.

三:使用本说明书阐述的导电码,一般每个码字能表示100个以上的二进制位;此种码字的识读可使用本申请文件阐述的机电式导电码读码头。Three: Using the conductive code described in this manual, generally each code word can represent more than 100 binary digits; this kind of code word can be read using the electromechanical conductive code reading terminal described in this application document.

四:使用图形码或磁码,码字的识读应使用相应的读码头。Four: Use graphic codes or magnetic codes, and the corresponding reading terminals should be used for code word recognition.

五:使用条形码,导电码等,并用照相式读码头来读这些种码字。Five: Use barcodes, conductive codes, etc., and use photographic reading docks to read these codes.

六:使用隐形条形码,其码字的识读应使用相应的隐形条形码读码头。Six: Use invisible barcodes, and the corresponding invisible barcodes should be used to read the code words.

七:使用其它码种的码字,码字的识读应使用相应的读码头。Seven: Use code words of other code types, and the reading of the code words should use the corresponding reading terminal.

所使用的码字的特点:Characteristics of the codewords used:

码字可被相应拾音器读取以用于拾音器播放音乐,码字的值所表示的信息是与拼合信息的值有关的信息。The code word can be read by the corresponding pickup for the pickup to play music, and the information represented by the value of the code word is information related to the value of the spliced information.

一个码字的值可以是或含有或可转换得到一个乐段拼合信息的值,一个码字的值也可以是其它情况,乐段拼合信息的值可以是按后面所述的乐谱拼合的编码规则来形成的。The value of a codeword can be or contain or can be converted to obtain the value of a piece of musical composition information, and the value of a codeword can also be other situations, and the value of the musical composition information can be according to the coding rules of the score combination described later to form.

导电码表示码值的方式可参考2006102010785号申请里对导电码技术的阐述,导电码一般可直接或通过通断代名码数来表示二进制码值,而乐段拼合信息的值或乐段拼合信息的值的变换值或乐段拼合信息的值的部分等一般是二进制格式,所以导电码直接或通过通断代名码数来表示的二进制码值可以是数个二进制格式的乐段拼合信息的值或二进制格式的数个乐段拼合信息的值的变换值或一个二进制格式的乐段拼合信息的值的部分等。For the method of expressing the code value of the conductive code, please refer to the explanation of the conductive code technology in the application No. 2006102010785. The conductive code can generally represent the binary code value directly or through the number of on-off code names, and the value of the segment combination information or the segment combination information The conversion value of the value or the part of the value of the segment combination information is generally in binary format, so the binary code value represented by the conductive code directly or through the on-off code name code number can be the value of several segment combination information in binary format Or the transformation value of the value of several piece combination information in binary format or a part of the value of piece combination information in binary format, etc.

一般条形码的码制表示的字符集是10个十进制值(0-9),或者表示的字符集是几十个ascii字符,可以建表将10个十进制值(0-9)或几十个ascii字符按顺序赋与二进制编号,即建立了条形码与二进制值的对应关系;这样就可把数个二进制格式的乐段拼合信息的值或二进制格式的数个乐段拼合信息的值的变换值或一个二进制格式的乐段拼合信息的值的部分等转换成条形码格式的值,并放入条形码里。比如如条形码的码制表示的字符集是10个十进制值(09),那就可用条形码表示的字符集中的十进制的07来对应表示二进制的000-111,如乐段拼合信息的值为011101001101110010(这里的值仅为例子),那每3个BIT一组对应转换成为十进制值351562,这个351562就是放入条形码的值,与此对应的,如在读码时得到条形码里的十进制值为351562,可将其每个十进制数对应转换成3BIT的二进制值,即得011101001101110010,这个011101001101110010就是乐段拼合信息的值。本段所述也是技术常识。The character set represented by the code system of the general barcode is 10 decimal values (0-9), or the character set represented is dozens of ascii characters, you can build a table to store 10 decimal values (0-9) or dozens of ascii Characters are assigned binary numbers in order, that is, the corresponding relationship between the barcode and the binary value is established; in this way, the value of several pieces of music information in binary format or the transformation value of the value of several pieces of information in binary format or A part of the value of the music piece combination information in binary format, etc. is converted into a value in barcode format, and put into the barcode. For example, if the character set represented by the code system of the barcode is 10 decimal values (09), then the decimal 07 in the character set represented by the barcode can be used to correspond to the binary 000-111, such as the value of the music piece combination information is 011101001101110010 ( The value here is just an example), then each group of 3 BITs is converted into a decimal value 351562, and this 351562 is the value put into the barcode. Correspondingly, if the decimal value in the barcode is 351562 when reading the code, you can Convert each of its decimal numbers into a 3BIT binary value, that is, 011101001101110010, which is the value of the segment combination information. This paragraph is also technical common sense.

显然数个乐段拼合信息的值,一个乐段拼合信息的值的部分,数个乐段拼合信息的值的变换值,数个乐段拼合信息的值的变换值的部分等都可以是二进制格式的。Apparently, the value of several pieces of combination information, the part of the value of one piece of combination information, the transformation value of the value of several pieces of combination information, the transformation value of the value of several pieces of combination information, etc. can all be binary formatted.

如图1所示,是一种有声音乐读物的实例,作为此有声音乐读物的主体的读物是乐谱书,乐谱书的页面上印有的码字是条形码57,当前两页面每页面各有2个条形码(当然如是其它情况比如左边页面上有3个条形码而右边页面上有2个条形码,都是可以的),图1中的44显然是有声音乐读物的页面上印刷的乐谱,此有声音乐读物页面上的每个条形码可被相应拾音器读取以用于拾音器播放音乐,此有声音乐读物页面上的每个条形码的值所表示的信息是与拼合信息的值有关的信息,此有声音乐读物页面上的每个条形码的值是此有声音乐读物的某一乐段的拼合信息的值的一部分。As shown in Figure 1, be a kind of example of audiobook, the reading material as the main body of this audiobook is music book, the code word that is printed on the page of music book is bar code 57, each page of current two pages has 2 barcode (certainly in other cases such as 3 barcodes on the left page and 2 barcodes on the right page, all possible), 44 in Fig. 1 is obviously the score printed on the page of the audio music book, this audio music Each barcode on the page of the reading material can be read by the corresponding pickup for the pickup to play music. The information represented by the value of each barcode on the page of the audio music book is information related to the value of the combined information. This audio music book The value of each barcode on the page is part of the value of the stitching information for a certain passage of this audiobook.

把有声音乐读物所有页面上的所有其码值表示的信息是与拼合信息的值有关的信息的码字都集合起来,就是本申请文件各处提及的有声音乐读物的整页面上的诸个码字,有声音乐读物的整页面上的诸个码字与读物的整页面上的诸个码字是同一概念(此读物是作为有声音乐读物的主体的),即把读物所有页面上的所有其码值表示的信息是与拼合信息的值有关的信息的码字都集合起来,就是本申请文件各处提及的读物的整页面上的诸个码字,诸个码字中每个码字的码值表示的信息是与拼合信息的值有关的信息,诸个码字可以是一个或多个码字,比如某个有声音乐读物共有4个页面,各页面分别有2,3,0,1个码字,如这6个(2+3+1)码字的码值都是与拼合信息的值有关的,那此有声音乐读物的整页面(整页面只包括那4个页面)上的诸个码字就是这6个码字;一本有声音乐读物的整页面上的诸个码字中的各码字,一般是同一码种的码字,比如都是条形码,但如在一本有声音乐读物的整页面上存在多种码种的码字也可(即诸个码字中含有多种码种的码字),比如既有条形码也有图形码,一般可以用带有条形码读码头的拾音器和带有光电式图形码读码头的拾音器分别读取这两种码字;在本申请文件中使用‘诸个码字’的概念,是为了方便区别于本申请文件各处提及的单个码字。All the information represented by the code value on all the pages of the audio music reading is the code words of the information related to the value of the combined information, which is exactly all the code words on the entire page of the audio music reading mentioned in this application document. Codewords, the codewords on the entire page of the audiobook and the codewords on the entire page of the reading material are the same concept (this reading material is the main body of the audiobook), that is, all the codewords on all pages of the reading material The information represented by its code value is that the code words of the information related to the value of the combined information are all assembled, which is exactly the code words on the entire page of the reading material mentioned in this application document everywhere, and each code word in the various code words The information represented by the code value of the word is the information related to the value of the combined information, and each code word can be one or more code words, such as a certain audio music book has 4 pages, and each page has 2, 3, 0 pages respectively. , 1 codeword, if the code values of these 6 (2+3+1) codewords are all related to the value of the combined information, then the whole page of the audio music book (the whole page only includes those 4 pages) The code words on the above are exactly these 6 code words; each code word in the code words on the whole page of an audio music book is generally the code words of the same code type, such as all bar codes, but as in It is also possible to have code words of multiple code types on the entire page of an audio book (that is, code words of multiple code types in the code words), for example, there are both bar codes and graphic codes. The pickup of the reading dock and the pickup with the photoelectric graphic code reading dock read these two codewords respectively; the concept of "codewords" is used in this application document for the purpose of making it easy to distinguish from the codewords mentioned throughout the application document. and a single codeword.

诸个码字显然可以是一个或多个码字。诸个码字中的各码字显然可以是全部都处于有声音乐读物的一个页面上或分散处于有声音乐读物的多个页面上;而且诸个码字中的各码字显然可以是分散处于有声音乐读物的全部页面上或部分页面上。The codewords can obviously be one or more codewords. Each code word among all code words can obviously be all on one page of audiobook or be scattered on a plurality of pages of audiobook; On all or part of the pages of a music book.

在权利要求书和本说明书中各处提及的‘部诸码字’,是一种称谓,是指有声音乐读物的整页面上的诸个码字中的全部码字或有声音乐读物的整页面上的诸个码字中的部分码字,部诸码字中的每个码字是都能被同一种拾音器读取以用于此拾音器播放音乐发音的(本句说的拾音器是指某一种拾音器);在权利要求书和本说明书中各处提及有关拾音器读取有声音乐读物的整页面上的部诸码字中的码字,此部诸码字是指此有声音乐读物的整页面上的诸个码字中能被此拾音器读取以用于此拾音器播放音乐发音的所有码字中的全部码字或部分码字(至于把诸个码字中能被此拾音器读取以用于此拾音器播放音乐发音的所有码字中的哪些码字划到此部诸码字集合里,并无特别规定,划多划少随便;而且一遇到诸个码字中几个可被某拾音器读取以用于此拾音器播放音乐发音的码字,就可把这几个码字集合起来定义成此拾音器能读的部诸码字来进行讨论)。本申请文件引入部诸码字概念的原因是,因为一本有声音乐读物的整页面上的诸个码字中可能只有一个码种的码字,或有多个码种的码字(比如既有图形码也有导电码也有条形码等),一般情况,一种拾音器只有一种读码头只能读取一个码种的码字(比如机电式导电码读码头可读导电码)或可读两个码种的码字(比如光电式读码头可读导电码和条形码),而且诸个码字中能被某拾音器读码的码字,其码值不一定可用于此拾音器播放音乐(比如码字的值是数个乐段拼合信息的值,而此拾音器不是专门做来读这种形式的码字的而且不知道这种形式的码字里各个BIT是怎么表示数据的,所以此拾音器读此码字后虽然读出了码字里的各二进制BIT但不知这些BIT何意,不能解码播音乐,这里这种形式的码字是专门用于被另外的拾音器读取以解码播音乐的),即有声音乐读物的整页面上的诸个码字中的全部或部分码字中的每个码字能被同一种拾音器读取以用于此拾音器播放音乐发音,所以为撰些权利要求书的方便,就引入部诸码字的概念。部诸码字中的各码字可能全是属于一个码种的码字,或是分属于多个码种的码字(比如部诸码字中有些码字是导电码,有些码字是条形码,而拾音器的读码头是光电式读码头,可以分别读这两种码字)。本段各处提到的拾音器读码播音乐这类说法,都是指拾音器读一个(或多个)码字后根据这一个(或多个)码字里的拼合信息的值解码播音乐。The "all codewords" mentioned in the claims and this specification are a kind of appellation, referring to all the codewords in the codewords on the entire page of the audio music book or the entire audio music book A part of the codewords in the codewords on the page, each codeword in the codewords of the part can be read by the same pickup for the pronunciation of the pickup playing music (the pickup in this sentence refers to a certain A kind of pickup); In the claims and this description, mention the codewords in the codewords in the codewords on the whole page of the whole page of the audio music book about which the relevant pickup reads, and these codewords refer to the codewords of this audio music book All codewords on the whole page can be read by this pickup for all codewords or part of codewords in all codewords for this pickup to play music pronunciation (as for all codewords can be read by this pickup in all codewords) Which codewords in all codewords for this pickup to play music pronunciation are drawn in the collection of all codewords of this part, there is no special regulation, and it is casual to draw more and less casually; Read by a pickup to be used for the codeword of this pickup to play the music pronunciation, just these several codewords can be assembled and defined as the codewords of this pickup can be read and discussed). The reason why this application document introduces the concept of all codewords is because there may be only one codeword among the codewords on the entire page of an audio music book, or there may be multiple codewords (such as both There are graphic codes, conductive codes, and barcodes, etc.), in general, a pickup has only one reading terminal and can only read one type of code (such as electromechanical conductive code reading terminals can read conductive codes) or can read two The code word of code type (such as photoelectric reading terminal can read conductive code and bar code), and the code word that can be read by a certain pickup in various code words, its code value may not be used for this pickup to play music (such as code word The value of is the value of several pieces of information combined, and this pickup is not specially designed to read this form of codeword and does not know how each BIT in this form of codeword represents data, so this pickup reads this After the code word, although the binary BITs in the code word are read out, I don’t know what these BITs mean, and I can’t decode the broadcast music. Here, this form of code word is specially used to be read by another pickup to decode the broadcast music), That is, each code word in all or part of the code words on the entire page of the audio music book can be read by the same pickup for the playback of music by the pickup, so for the purposes of these claims Conveniently, just introduce the concept of all codewords. Each codeword in all codewords may all be codewords belonging to one codeword, or codewords belonging to multiple codewords (for example, some codewords in all codewords are conductive codes, and some codewords are barcodes. , and the reading dock of the pickup is a photoelectric reading dock, which can read these two codewords respectively). The phrases such as pickup reading code and playing music mentioned in this paragraph all refer to decoding and playing music according to the value of the spliced information in the one (or multiple) codewords after the pickup reads one (or more) codewords.

部诸码字显然可以是一个或多个码字。部诸码字中的各码字显然可以是全部都处于有声音乐读物的一个页面上或分散处于有声音乐读物的多个页面上;而且部诸码字中的各码字显然可以是分散处于有声音乐读物的全部页面上或部分页面上。All codewords can obviously be one or more codewords. Each code word in the code words of the part can obviously be all on a page of the audio music book or be scattered on the multiple pages of the audio music book; and each code word in the code words of the part can obviously be scattered in the audio music book On all or part of the pages of a music book.

显然‘诸个码字’的概念和‘部诸码字’的概念都是显而易见的概念。Obviously the concept of ' codewords ' and the concept of ' all codewords ' are all obvious concepts.

部诸码字可能有这几种情况(还可能有除这3种情况外的其它情况):1是对于部诸码字中的每个码字,都有此特征:拾音器读此一个码字后就可播音乐(比如部诸码字中每个码字的值是一个乐段拼合信息的值);2是对于部诸码字中的每个码字,都有此特征:拾音器读此一个码字后还需读此部诸码字中的其它码字由这多个码字得到乐段拼合信息的值才可播音乐(比如部诸码字中只含有3个码字,一个乐段拼合信息的值的BIT被分散放在这3个码字的值里,这3个码字中每个码字的值是这个乐段拼合信息的值的部分);3是对于部诸码字中的部分码字中的每个码字,都有此特征:拾音器读此一个码字后就可播音乐(比如这部分码字中每个码字的值是一个乐段拼合信息的值),而对于部诸码字中的另外部分码字中的每个码字,都有此特征:拾音器读此一个码字后还需读部诸码字中的其它码字由这多个码字得到乐段拼合信息的值才可播音乐(比如这部分码字中每个码字的值是一个乐段拼合信息的值的部分),这里第3种情况可举一例:如部诸码字中含有3个码字(码字1,码字2,码字3),乐段拼合信息的值1的BIT分开放在这3个码字里,乐段拼合信息的值2的BIT也放在码字1里,那码字1可看成部诸码字中的一个部分,码字2和码字3可看成部诸码字中的另一部分,读前一部分中码字1的值后即可根据码值里的乐段拼合信息的值2解码播音乐,读后一部分中的任一码字(码字2或码字3)后,都需读另外两个码字才能由这3个码字的值组合起来得到乐段拼合信息的值1来解码播音乐(即如果先读码字2就还需读码字3和码字1,如果先读码字3就还需读码字2和码字1;一般在码字2和码字3里有BIT标志表示需读多个码字),而且因为把码字1也读进来了,所以播的音乐还可能包括码字1里的拼合信息的值2对应的乐段发音(这要看具体需要)。All code words of part may have these several situations (also may have other situations except these 3 kinds of situations): 1 is for each code word in all code words of part, has this feature: pickup reads this code word Just can broadcast music (such as the value of each codeword in all codewords of all parts is the value of a music section combination information); After a code word, also need to read other code words in all code words of this part, obtain the value of the music segment combination information by these many code words and just play music (for example, only contain 3 code words in all code words of the part, one music The BIT of the value of the segment combination information is dispersed in the value of these 3 code words, and the value of each code word in these 3 code words is the part of the value of this music segment combination information); 3 is for all codes of the part Each code word in the part code word in the word has this characteristic: just can broadcast music (such as the value of each code word in this part code word is the value of a music section splicing information) after the pickup reads this code word ), and for each code word in other part code words in all code words, this characteristic is arranged: after the pickup reads this code word, also need to read other code words in all code words by these multiple code words Only when the value of the music segment combination information is obtained can the music be played (for example, the value of each code word in this part of the code word is a part of the value of the music segment combination information), the 3rd case here can give an example: such as all codes Contains 3 codewords (codeword 1, codeword 2, codeword 3) in the word, the BIT of the value 1 of the music section splicing information is placed separately in these 3 codewords, and the BIT of the value 2 of the music section splicing information is also Put in codeword 1, then codeword 1 can be regarded as a part in all codewords of department, codeword 2 and codeword 3 can be regarded as another part in all codewords of department, read codeword 1 in the previous part After reading the value, you can decode and play the music according to the value 2 of the music segment in the code value. After reading any code word (code word 2 or code word 3) in the latter part, you need to read the other two code words to be able to play music. The values of these 3 codewords are combined to obtain the value 1 of the segment combination information to decode and play music (that is, if you read codeword 2 first, you need to read codeword 3 and codeword 1, if you read codeword 3 first, you need to read Read codeword 2 and codeword 1; generally there are BIT signs in codeword 2 and codeword 3 to indicate that multiple codewords need to be read), and because codeword 1 is also read in, the music played may also include codewords The pronunciation of the phrase corresponding to the value 2 of the combined information in word 1 (this depends on specific needs).

上段所述也是对权利要求书中多处提到的‘所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的任一码字后,都不需再读所述部诸码字中的其它码字即可播音乐,或者,所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的任一码字后,都需再读所述部诸码字中的其它数个码字才可播音乐,或者,所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的部分码字中的任一码字后都不需再读所述部诸码字中的其它码字即可播音乐,所述拾音器的所述单片机芯片读得所述部诸码字中另外部分码字中的任一码字后都需再读所述部诸码字中的其它数个码字才可播音乐;’的解释。What is said in the last paragraph is also to the claims mentioned in many places: 'when the pickup is in use, after the single-chip microcomputer chip of the pickup reads any codeword in the codewords of the parts, it does not need to Read other codewords in the all codewords of the described part again to play music, or, when the pickup was used, the single-chip microcomputer chip of the pickup read any codeword in the all codewords of the described part Afterwards, it is necessary to read other several code words in the code words to play music again, or, when the pickup is used, the single-chip microcomputer chip of the pickup reads the code words of the pickup No need to read any other code words in the code words in the part of the code words to play music, and the single-chip microcomputer chip of the sound pick-up can read the code words in the code words of the parts. All need to read other several codewords in the codewords of the said part and just play music after any codeword in the codewords in addition;

某拾音器能读的部诸码字可以是只包括一个码字,此拾音器读这一个码字就可播音乐,此部诸码字中的一个码字就是指这仅有的一个码字。All codewords of a part that a certain pickup can read can only comprise a codeword, and this pickup reads this codeword and just can play music, and a codeword in this part of all codewords refers to this unique codeword.

所使用的拾音器的结构,特点(有关拾音器结构特点的叙述从下一段起至第14页止):The structure and characteristics of the pickup used (the description of the structural characteristics of the pickup starts from the next paragraph and ends on page 14):

拾音器的电路实现:拾音器一般包括一个读码头,一个单片机芯片(拾音器的主处理芯片),一个发音器件,存储器,按键模块等。(按键模块一般由一个或多个按键组成,如由超过两个按键组成,那一般是接成按键矩阵形式的,这是常识,并且对于拾音器来说,按键模块不是必需的;这里的存储器是指单片机芯片外拾音器内的存储器,如单片机芯片内无存储器,那单片机芯片外拾音器内的存储器就是必需的,如单片机芯片内有存储器,那单片机芯片外拾音器内的存储器是可根据情况可有可无的,这里有关单片机芯片外拾音器内的存储器取舍的处置也属于技术常识)。The circuit realization of the pickup: the pickup generally includes a reading dock, a single-chip microcomputer chip (the main processing chip of the pickup), a pronunciation device, a memory, a button module, and the like. (The button module is generally composed of one or more buttons. If it is composed of more than two buttons, it is generally connected in the form of a button matrix. This is common sense, and for the pickup, the button module is not necessary; the memory here is Refers to the memory in the pick-up outside the single-chip microcomputer chip. If there is no memory in the single-chip microcomputer chip, the memory in the pick-up outside the single-chip microcomputer chip is necessary. If there is memory in the single-chip microcomputer chip, the memory in the pickup outside the single-chip microcomputer chip is optional according to the situation. No, the handling of the memory selection in the pickup on the chip of the single-chip microcomputer here also belongs to technical common sense).

本申请文件各处所指单片机芯片(即拾音器的主处理芯片),可以是MCU,CPU等,一般是指MCU,所以本说明书各处在技术阐述中以MCU作为单片机芯片的实施例。本申请文件各处有提及的MCU(采用大写字母的,其前面没‘主’字的)或主MCU(其中的字母是采用大写字母的)或主MCU芯片(其中的字母是采用大写字母的)都是代指作为拾音器的主处理芯片的单片机芯片,而采用小写字母的mcu,也是指单片机芯片但不是专指作为拾音器的主处理芯片的单片机芯片的;除非提及时有特别指明有他义,本申请文件各处提及的单片机芯片默认是指作为拾音器的主处理芯片的单片机芯片。The single-chip microcomputer chip (i.e. the main processing chip of the pickup) referred to in this application document everywhere can be MCU, CPU etc., generally refers to MCU, so this description everywhere uses MCU as the embodiment of single-chip microcomputer chip in technical elaboration. The MCU (in capital letters, without the word 'main' in front of it) or the main MCU (in which the letters are in capital letters) or the main MCU chip (in which the letters are in capital letters) mentioned throughout this application document ) all refer to the single-chip microcomputer chip as the main processing chip of the pickup, and the mcu in lowercase letters also refers to the single-chip microcomputer chip but not specifically refers to the single-chip microcomputer chip as the main processing chip of the pickup; unless it is specifically specified when it is mentioned Meaning, the single-chip microcomputer chip mentioned everywhere in this application document refers to the single-chip microcomputer chip as the main processing chip of the pickup by default.

单片机芯片一般应有按音乐合成方式(一般是按播谱数据方式)来播放音乐(接喇叭发音)的功能,如没有此功能那一般应可外接播放音乐的芯片;单片机需有足够多的IO脚完成键扫描,读码等功能。The MCU chip should generally have the function of playing music (connected to the speaker to pronounce) according to the method of music synthesis (usually according to the method of broadcasting spectrum data). The foot completes key scanning, code reading and other functions.

很显然,本申请文件各处所指存储器,是指其存储的数据在掉电后不消失的存储器,比如ROM,也可以是EPROM,EEPROM或FLASH等,而且本申请文件各处所指一个存储器,一般是指一个与外界相对独立的一定数量存储单元的集合,比如一个存储器芯片;按常识,一个单片机工作系统中可有一个存储器或可有处于不同位置的多个存储器,比如单片机芯片内部和外部各有存储器,或单片机芯片内无存储器需外挂存储器,有的单片机芯片内存储器足够大,不需外挂存储器;闪存(FLASH)也属于本申请文件各处所指存储器的概念。而象RAM这样的暂存器,按常识默认包含在单片机芯片里或作为单片机芯片周边的惯例附加芯片,本申请文件不特别单独提及。本段所述都是技术常识。Obviously, the memory referred to in this application document refers to the memory whose stored data does not disappear after power failure, such as ROM, which can also be EPROM, EEPROM or FLASH, etc., and the application document refers to a memory everywhere. , generally refers to a collection of a certain number of storage units that are relatively independent from the outside world, such as a memory chip; according to common sense, there can be one memory or multiple memories in different locations in a single-chip microcomputer operating system, such as inside the single-chip microcomputer chip and There are external memories, or there is no memory in the single-chip microcomputer chip and an external memory is required. The memory in the single-chip microcomputer chip that has is large enough and does not need external memory; However, temporary registers such as RAM are included in the single-chip microcomputer chip by default according to common sense or as a conventional additional chip around the single-chip microcomputer chip, and this application document does not specifically mention it separately. This paragraph is all technical knowledge.

在权利要求2,6中提及的‘此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用’,是指,既可以是在单片机芯片外的存储器(一般是存储器芯片),可通过串行连线或并行总线等方式与单片机芯片连接并被单片机芯片读取使用,也可以是位于单片机芯片内的存储器,可被单片机芯片本身读取使用,还可以是其它情况,这涉及的本来也是技术常识。In claim 2, mentioned in claim 6, 'each memory in this memory set can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned sound pick-up' means that it can be a memory (generally a memory chip) outside the single-chip microcomputer chip, It can be connected to the single-chip microcomputer chip through serial connection or parallel bus, and can be read and used by the single-chip microcomputer chip. It is also technical common sense.

本申请文件有关存储器集的概念属于显而易见的概念。The concept of memory sets in this document is an obvious one.

对于拾音器,本申请文件各处所说的与其有关的存储器集,是指作为拾音器的主处理芯片的单片机芯片能读取的所有存储器;存储器集可以是单片机芯片内的数个存储器,或可以是单片机芯片外拾音器内的数个存储器(一般是存储器芯片),或可以是单片机芯片内的数个存储器加上单片机芯片外拾音器内的数个存储器;拾音器涉及的程序及拾音器涉及的数据存储在此存储器集里;对于在单片机芯片外拾音器内的有些存储器,这些存储器能被单片机芯片读取,这些存储器属于存储器集,但可能没存储有用信息。For the pickup, the memory sets related to it mentioned throughout the application documents refer to all memory that can be read by the single-chip microcomputer chip as the main processing chip of the pickup; the memory collection can be several memories in the single-chip microcomputer chip, or can be single-chip microcomputer Several memories (usually memory chips) in the pickup outside the chip, or several memories in the single-chip microcomputer chip plus several memories in the pickup outside the single-chip microcomputer chip; the programs related to the pickup and the data related to the pickup are stored in this memory In the set; For some memories in the pickup outside the single-chip microcomputer chip, these memories can be read by the single-chip microcomputer chip, and these memories belong to the memory set, but may not store useful information.

拾音器在使用时,需用到存储器集,存储器集里存储有拾音器涉及的程序及拾音器涉及的数据,由单片机芯片执行存储器集里的程序来运行及读码等,并可由单片机芯片调存储器集里的解码程序及解码数据来解码播音乐;这也是技术常识。When the pickup is in use, it needs to use a memory set. The program and data related to the pickup are stored in the memory set. The program in the memory set is executed by the single-chip microcomputer chip to run and read codes, etc., and the memory set can be adjusted by the single-chip microcomputer chip. The decoding program and decoding data to decode and play music; this is also technical common sense.

拾音器涉及的程序(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的程序),一般包括读码,拾音器运行主控,发音等程序,还包括解码程序。拾音器涉及的数据(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的数据),一般包括解码数据,还可以包括其它涉及的数据等。The program involved in the pickup (also refers to the program involved in the use of the single-chip microcomputer chip as the main processing chip of the pickup), generally includes code reading, pickup operation master control, pronunciation and other programs, and also includes a decoding program. The data related to the pickup (also referring to the data related to the use of the single-chip microcomputer chip as the main processing chip of the pickup) generally includes decoding data, and can also include other related data.

如提及时无特别指明含有他义,本申请文件各处提及的‘拾音器内有存储器’,一般可有三种情况:1是MCU片内有存储器,MCU片外拾音器内无存储器,2是MCU片内无存储器,MCU片外拾音器内有存储器,3是MCU片内有存储器,MCU片外拾音器内也有存储器;如提及时无特别指明含有他义,本申请文件各处提及的‘拾音器内无存储器’,是指MCU片内以及MCU片外拾音器内都无存储器。If there is no special indication of other meanings when mentioned, the "storage in the pickup" mentioned in this application document can generally have three situations: 1. There is a storage in the MCU chip, and there is no storage in the pickup outside the MCU. 2. The MCU There is no memory in the chip, and there is a memory in the MCU off-chip pickup. 3 is that there is a memory in the MCU chip, and there is also a memory in the MCU off-chip pickup; if there is no special indication when it is mentioned, it has other meanings, and the "in the pickup" mentioned throughout this application document No memory' means that there is no memory in the MCU chip and in the pickup outside the MCU chip.

与上段对应的,如提及时无特别指明含有他义,本申请文件各处提及的‘拾音器内的存储器’,一般也可有三种情况:1是只指MCU片内的数个存储器(要求MCU片内有存储器),2是只指MCU片外拾音器内的数个存储器(要求MCU片外拾音器内有存储器),3是指MCU片内的数个存储器加上MCU片外拾音器内的数个存储器(要求MCU片外拾音器内有存储器而且MCU片内也有存储器);如拾音器内有存储器,那拾音器内的存储器,显然有一个或多个。Corresponding to the last paragraph, if there is no special indication to contain other meanings when mentioned, the 'memory in the pickup' mentioned in this application document can generally have three situations: 1 only refers to several memories in the MCU chip (required There is memory in the MCU chip), 2 means only several memories in the MCU off-chip pickup (requires that there is a memory in the MCU off-chip pickup), 3 means several memories in the MCU chip plus the number in the MCU off-chip pickup A memory (requires that there is memory in the MCU off-chip pickup and there is also memory in the MCU chip); if there is a memory in the pickup, obviously there are one or more memories in the pickup.

拾音器内须有存储器集,拾音器涉及的程序及涉及的数据是存储在存储器集里的一个或多个存储器里的(至于如何把拾音器涉及的程序及涉及的数据分散存储到多个存储器里是根据需要来定的,这是技术常识)。There must be a memory set in the pickup, and the programs and data involved in the pickup are stored in one or more memories in the memory set (as for how to disperse and store the programs and data involved in the pickup into multiple memories according to Need to decide, this is technical common sense).

在与拾音器有关的存储器集里需存拾音器涉及的程序(一般包括读码,拾音器运行主控等程序,还有发音涉及的解码等程序)及这些程序使用的数据。以上七段所述都是技术常识。In the memory set related to the pickup, the programs related to the pickup (generally including code reading, programs such as the pickup running master control, and programs such as decoding involved in pronunciation) and the data used by these programs need to be stored in the memory set related to the pickup. The above seven paragraphs are all technical common sense.

本申请文件各处述及有关某一拾音器的相关技术时提及的有用信息,显然是指拾音器涉及的程序,或拾音器涉及的数据,或部分拾音器涉及的程序,或部分拾音器涉及的数据,或拾音器涉及的程序加上部分拾音器涉及的数据,或部分拾音器涉及的程序加上拾音器涉及的数据,或部分拾音器涉及的程序加上部分拾音器涉及的数据,或拾音器涉及的程序加上拾音器涉及的数据。(本段内后面提到的拾音器都是指本段内前面提到的拾音器)The useful information mentioned in this application document when referring to the related technology of a certain pickup obviously refers to the program involved in the pickup, or the data involved in the pickup, or the program involved in some pickups, or the data involved in some pickups, or Pickup-related program plus partial pickup-related data, or partial pickup-related program plus pickup-related data, or partial pickup-related program plus partial pickup-related data, or pickup-related program plus pickup-related data . (The pickups mentioned later in this paragraph all refer to the pickups mentioned earlier in this paragraph)

除非提及时有特别指明有他义,本申请文件各处所提及的拾音器涉及的程序默认是指作为拾音器的主处理芯片的单片机芯片涉及使用的所有程序;除非提及时有特别指明有他义,本申请文件各处所提及的拾音器涉及的数据默认是指作为拾音器的主处理芯片的单片机芯片涉及使用的所有数据。Unless otherwise specified when mentioned, the programs involved in the pickup mentioned in this application document refer to all programs involved in the use of the single-chip microcomputer chip as the main processing chip of the pickup by default; unless otherwise specified when mentioned , the data related to the pickup mentioned in various parts of this application document refers to all the data related to the use of the single-chip microcomputer chip as the main processing chip of the pickup by default.

与拾音器有关的存储器集里应存储有对用读码头读码得到的乐段拼合信息的值进行解码用的解码程序及解码数据等。编码规则及编码数据的技术阐述,以及解码规则及解码数据的技术阐述都可参照本说明书后面所述。The memory set related to the pickup should store the decoding program and decoding data etc. used for decoding the value of the piece combination information obtained by reading the code with the reading dock. For the technical description of the encoding rules and encoded data, as well as the technical descriptions of the decoding rules and decoded data, please refer to the description later in this specification.

拾音器的读码头是用于读取有声音乐读物整页面上的码字的值的,读码头可以是光电式读码头(比如光笔或ccd读码头),以读取有声音乐读物整页面上的条形码或导电码(导电码如要被光电式读码头读取,要求其导电物质的颜色是与导电码的载体平面的颜色不同的),也可以是本说明书阐述的机电式导电码读码头,以读取有声音乐读物整页面上的导电码,或其它的读码头。The reading terminal of the pickup is used to read the value of the code word on the entire page of the audio music book, and the reading terminal can be a photoelectric reading terminal (such as a light pen or a ccd reading terminal) to read the barcode on the entire page of the audio music book Or conductive code (if the conductive code is to be read by the photoelectric reading terminal, it is required that the color of its conductive material is different from the color of the carrier plane of the conductive code), it can also be the electromechanical conductive code reading terminal described in this manual, with Reads conductive codes on entire pages of audiobooks, or other reading codes.

单片机芯片(MCU),是拾音器的主处理芯片,是对整机的读码,码值处理,发音等的主控,一般用一个MCU即可。The single-chip microcomputer chip (MCU) is the main processing chip of the pickup, and it is the main control of the whole machine's code reading, code value processing, pronunciation, etc. Generally, one MCU is enough.

单片机芯片(MCU)一般应有按音乐合成方式(一般是按播谱数据方式)来播放音乐的功能(这种情况,MCU一般外接喇叭,发音器件是喇叭,由MCU驱动喇叭发音),如没有此功能那一般应可MCU外接音乐芯片,这种情况,发音器件是MCU外接的带喇叭的音乐芯片,使用时一般是由MCU用通讯线把解码得到的解码结果(一般是乐段的谱数据)顺序传给MCU外接的音乐芯片,由音乐芯片用接受到的解码结果按音乐合成方式自行驱动喇叭发音,这些都属技术常识。The single-chip microcomputer chip (MCU) should generally have the function of playing music according to the method of music synthesis (usually according to the method of broadcasting spectrum data) (in this case, the MCU is generally connected to an external speaker, and the sounding device is a speaker, which is driven by the MCU). This function should generally be able to connect the MCU to an external music chip. In this case, the pronunciation device is a music chip with a speaker connected to the MCU. When using it, the decoding result (usually the spectrum data of the music section) is usually decoded by the MCU with a communication line. ) in sequence to the external music chip connected to the MCU, and the music chip uses the received decoding result to drive the speaker to produce sound according to the music synthesis method. These are all technical common sense.

单片机芯片(MCU)与读码头意向连接,单片机芯片(MCU)与发音器件意向连接。本申请文件使用‘意向连接’词是为了撰写权利要求书的方便,所称意向连接,是指直接连接或间接连接,单片机芯片与读码头意向连接即单片机芯片与读码头直接连接或间接连接,单片机芯片与发音器件意向连接即单片机芯片与发音器件直接连接或间接连接;本申请文件各处所说的两物件之间的直接连接,是指两者之间的连线都是直接相连的,本申请文件各处所说的两物件之间的间接连接,是指两者之间的连线都是通过中继元件来相连的(可以是两者之间的所有连线都是通过同一中继元件来相连的,也可以是两者之间的不同连线是通过不同中继元件来相连的),两者之间通过中继元件仍能传递信息,间接连接可看成是直接连接的等同替换,另如两者之间的有些连线是直接连起来的,有些连线是通过中继元件来连接的,那这种情况与本段前面述的两物件之间的直接连接是等同替换;比如读码头为光笔时,可以直接从读码头连线到MCU,是直接连接;也可以从光笔到MCU是通过总线缓冲器芯片相连的(即从光笔先连线到总线缓冲器芯片,从总线缓冲器芯片再连线到MCU),这时就是间接连接。MCU与发音器件一般是直接连接(比如MCU直接接喇叭,喇叭是发音器件),而如MCU是经过IO扩展芯片(比如类似于8255的芯片)来驱动带喇叭的音乐芯片播放音乐,中间多了个IO扩展芯片所以可称MCU与发音器件是间接连接(这时发音器件应是带喇叭的音乐芯片),是直接连接的等同替换,这也是单片机技术的常规替代技术。另外单片机芯片与读码头之间甚至可以是无线连接比如红外连接,其读码头较复杂,需自己读码,并需有无线通讯电路,这种红外连接可看成是直接连接的等同替换。(本段所说的‘中继元件’比如可以是可编程逻辑器件,总线缓冲器芯片,IO扩展芯片等)The single-chip microcomputer chip (MCU) is intentionally connected with the reading terminal, and the single-chip microcomputer chip (MCU) is intentionally connected with the pronunciation device. The word "intended connection" used in this application document is for the convenience of writing claims. The so-called intentional connection refers to direct connection or indirect connection. The intentional connection between the single-chip microcomputer chip and the pronunciation device means that the single-chip microcomputer chip and the pronunciation device are directly or indirectly connected; the direct connection between the two objects mentioned in this application document means that the connection between the two is directly connected. The indirect connection between two objects mentioned throughout the application documents means that the connection between the two is connected through a relay element (it can be that all the connections between the two are through the same relay element) It can also be that different connections between the two are connected through different relay elements), and information can still be transmitted between the two through relay elements, and the indirect connection can be regarded as an equivalent replacement of the direct connection , and if some connections between the two are directly connected, and some connections are connected through relay components, then this situation is equivalent to the direct connection between the two objects mentioned above in this paragraph; For example, when the reading terminal is a light pen, it can be directly connected from the reading terminal to the MCU; it can also be connected from the light pen to the MCU through the bus buffer chip (that is, from the light pen to the bus buffer chip first, and from the bus The buffer chip is then connected to the MCU), which is an indirect connection at this time. The MCU and the sounding device are generally directly connected (for example, the MCU is directly connected to the speaker, which is the sounding device), and the MCU drives the music chip with the speaker to play music through an IO expansion chip (such as a chip similar to 8255). An IO expansion chip can be said to be an indirect connection between the MCU and the pronunciation device (the pronunciation device should be a music chip with a speaker at this time), which is an equivalent replacement of the direct connection, and this is also a conventional replacement technology for single-chip technology. In addition, there can even be a wireless connection between the single-chip microcomputer chip and the reading terminal, such as an infrared connection. The reading terminal is more complicated. It needs to read the code by itself and needs a wireless communication circuit. This infrared connection can be regarded as an equivalent replacement of the direct connection. (The 'relay element' mentioned in this paragraph can be, for example, a programmable logic device, a bus buffer chip, an IO expansion chip, etc.)

拾音器内的发音器件可以是喇叭(参看本说明书的实施例,本申请文件所说的作为发音器件的喇叭,可由喇叭电路等来作等同替换,喇叭电路是指由喇叭与一些简单的阻容等元件组成的,比如由喇叭和1K电阻和8050组成一个喇叭电路来代替喇叭),必要时也可以是MCU外接的带喇叭的音乐芯片,还可能是其它类型的发音器件;这都是技术常识。在权利要求2,6中述及的‘并由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播放音乐发音’,可以是由MCU用解码得到的解码结果(比如乐段的谱数据或其它形式的解码结果)来播放乐段发音,在MCU芯片内有播放音乐的硬件电路,并且MCU芯片外接喇叭,由MCU根据解码得到的全部解码结果来驱动喇叭把音乐发出来(这种情况喇叭是发音器件),这种情况是整步发音方式;也可以是由MCU用解码得到的解码结果(比如乐段的谱数据或其它形式的解码结果)来播放乐段发音,在MCU芯片内有播放音乐的硬件电路,并且MCU芯片外接喇叭,MCU按乐段的拼合信息的值解码时,每得到解码结果的一部分BIT即拿去由MCU驱动喇叭播音,按这样直至解码结束而完成播乐段发音(这种情况喇叭是发音器件),这种情况是分步发音方式;或者是MCU外接带喇叭的音乐芯片,可由MCU把解码得到的解码结果(比如乐段的谱数据或其它形式的解码结果)通过通讯线传给此外接的音乐芯片(此音乐芯片可按解码结果播音乐),由MCU外接的此音乐芯片按MCU传给它的解码得到的全部解码结果驱动此音乐芯片外接的喇叭播乐段发音(这种情况MCU外接的带喇叭的音乐芯片是发音器件),这种情况是整步发音方式;或者是MCU外接带喇叭的音乐芯片,MCU按乐段的拼合信息的值解码时,每得到解码得到的解码结果(比如乐段的谱数据或其它形式的解码结果)的一部分BIT即由MCU把它通过通讯线传给此外接的音乐芯片,由MCU外接的此音乐芯片按MCU传给它的解码结果的一部分BIT驱动此音乐芯片外接的喇叭发音,按这样直至解码结束而完成播乐段发音(这种情况MCU外接的带喇叭的音乐芯片是发音器件),这种情况是分步发音方式;或者可以是用本说明书后面述的组合形式的发音器件来播音乐;或者还可有其它的单片机芯片使发音器件播放音乐的方式(发音器件可以是喇叭,MCU外接的带喇叭的音乐芯片,别的类型的发音器件等);这些种方式都属于技术常识。本段以上所说的播乐段发音,可以是播乐段的全部发音或是播乐段的部分发音。The sounding device in the pickup can be a loudspeaker (referring to the embodiment of this description, the loudspeaker as the sounding device said in this application document can be done equivalent replacement by the loudspeaker circuit etc., and the loudspeaker circuit refers to the loudspeaker and some simple resistance capacitance etc. components, such as a speaker circuit composed of a speaker, 1K resistor and 8050 to replace the speaker), if necessary, it can also be a music chip with a speaker external to the MCU, or other types of pronunciation devices; this is technical common sense. In claim 2, described in 'and make the aforementioned pronunciation device of the aforementioned pickup play the music pronunciation by the aforementioned single-chip microcomputer chip of the aforementioned pickup according to the code value of the codeword in the codewords of the aforementioned parts read', can be made The MCU uses the decoding result obtained by decoding (such as the spectrum data of the music section or other forms of decoding results) to play the pronunciation of the music section. There is a hardware circuit for playing music in the MCU chip, and the MCU chip is connected to a speaker, and the MCU is obtained according to the decoding. All the decoding results are used to drive the speaker to emit music (in this case the speaker is a sounding device), in this case it is a full-step pronunciation method; it can also be the decoding result obtained by decoding by the MCU (such as the spectral data of the music section or other forms) The decoding result of the music section) to play the pronunciation of the music section. There is a hardware circuit for playing music in the MCU chip, and the MCU chip is connected to a speaker. The MCU drives the speaker to broadcast, and then completes the playback of the music section until the end of decoding (in this case, the speaker is a sounding device). In this case, it is a step-by-step pronunciation method; or the MCU is externally connected to a music chip with a speaker, which can be decoded by the MCU. The decoding result (such as the spectrum data of the music section or other forms of decoding results) is transmitted to the external music chip through the communication line (this music chip can play music according to the decoding result), and the music chip connected to the MCU is transmitted to the external music chip according to the MCU. All the decoding results obtained by its decoding drive the sound of the music section by the speaker connected to the music chip (in this case, the music chip with the speaker connected to the MCU is a sound device), in this case, it is a full-step pronunciation method; or the MCU is connected with a belt For the music chip of the speaker, when the MCU decodes the value of the combined information of the music section, every time a part of the BIT of the decoding result (such as the spectrum data of the music section or other forms of decoding results) is obtained, it is transmitted by the MCU through the communication line. For the external music chip, the music chip externally connected to the MCU drives the external speaker of the music chip to pronounce according to a part of the BIT of the decoding result sent by the MCU. The external music chip with the loudspeaker is a sounding device), this situation is a step-by-step sounding method; or it can play music with the combined sounding device described later in this manual; or other single-chip microcomputer chips can also be used to make the sounding device The way of playing music (the sounding device can be a speaker, a music chip with a speaker connected to the MCU, other types of sounding devices, etc.); these methods are all technical common sense. The above-mentioned pronunciation of the playing music section in this paragraph can be the whole pronunciation of the playing music section or the partial pronunciation of the playing music section.

在权利要求6中提及的‘前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字’,是指,既可以是由单片机芯片作主动控制来读码,比如2006102010785号发明申请中反复阐述的用乙方式读导电码而得到导电码的码值,或比如2006102010785号发明申请中反复阐述的甲方式读导电码,由单片机芯片的IO脚输出扫描电流,电流通过通断式机电读码头流到有声音乐读物的页面上的甲导电码的码盘,电流再流到甲导电码各码线上的触区,再流回读码头,再对从甲导电码各码线触区位置的触区流回的电流有无情况进行检测,检测结果由单片机芯片取得以得到导电码的码值(读码时,是要使通断式机电读码头与有声音乐读物的页面上的甲导电码接触的);也可以是读码头在读码时不需单片机芯片作控制,比如光电式读码头(如光笔)读取有声音乐读物的页面上的条形码(或导电码)并输出读码结果或输出读码信息(读码信息是指比如表示条形码上各条空宽度比情况的波形),由单片机芯片取得读码结果(本段各处提及的读码结果即码字的码值)或由单片机芯片取得读码信息并根据读码信息来得到读码结果(读码时,是要使光电式读码头与有声音乐读物的页面上的条形码或导电码接触或接近的);另还可有别的读码方式。本段以上所述对权利要求6中‘前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字’一句的解释,也可看成是对权利要求2中‘前述部诸码字中的每个码字是通过前述拾音器的读码头被前述拾音器读取的’一句的解释。Mentioned in claim 6 that the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading pier of the aforementioned pickup to read each codeword in some of the codewords in the aforementioned codewords', it means that both can The code is read by the active control of the single-chip microcomputer chip, such as the code value of the conductive code obtained by reading the conductive code repeatedly described in the application for invention No. 2006102010785, or the code value of the conductive code obtained by reading the conductive code repeatedly described in the application for invention No. 2006102010785. The scanning current is output by the IO pin of the single-chip microcomputer chip, and the current flows to the code disc of the A conductive code on the page of the audio music book through the on-off electromechanical reading terminal, and then the current flows to the touch area on each code line of the A conductive code, and then Flow back to the reading terminal, and then detect whether the current flowing back from the contact area of each code line of the first conductive code is detected. The detection result is obtained by the single-chip microcomputer chip to obtain the code value of the conductive code (when reading the code, it is necessary to Make the on-off electromechanical reading dock contact with the first conductive code on the page of the audio music book; it can also be that the reading dock does not need a single-chip microcomputer chip to control when reading the code, such as the photoelectric reading dock (such as a light pen) to read the audio music Read the barcode (or conductive code) on the page of the material and output the code reading result or output the code reading information (the code reading information refers to the waveform indicating the ratio of each space on the barcode, for example), and the code reading result is obtained by the single-chip microcomputer chip (this The code reading result mentioned in the paragraph is the code value of the code word) or the code reading information is obtained by the single-chip microcomputer chip and the code reading result is obtained according to the code reading information (when reading the code, it is necessary to make the photoelectric reading terminal and the audio music reading material The barcode or conductive code on the page is contacted or approached); in addition, there are other ways of reading the code. The interpretation of the sentence of 'the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading pier of the aforementioned pickup to read each of the codewords in the aforementioned codewords' in claim 6 described above in this paragraph, It can also be regarded as an interpretation of the sentence of 'each codeword in the aforementioned codewords is read by the aforementioned pickup by the reading terminal of the aforementioned pickup in claim 2.

本申请文件涉及的各电路举例及实施例的供电电压(VCC)选择范围是2.6V至5V(查本申请文件涉及的单片机芯片型号及其它芯片型号的资料也可得到此工作电压选择范围值)。实际拾音器的供电电压可能可在2.6V至5V的范围以外。The supply voltage (VCC) selection range of each circuit example and embodiment involved in this application document is 2.6V to 5V (check the single-chip microcomputer chip model involved in this application document and the data of other chip models can also get this operating voltage selection range value) . Actual pickups may have supply voltages outside the 2.6V to 5V range.

单片机芯片(MCU)利用读码头(比如光笔)来读取码值,再根据从一个或多个码字的码值得到的拼合信息的值来解码,使发音器件发出音乐声。一般情况下,拾音器还可有按键(比如用按键来调节音量),但按键不是必需的。The single-chip microcomputer chip (MCU) uses the reading terminal (such as a light pen) to read the code value, and then decodes according to the value of the combined information obtained from the code value of one or more code words, so that the pronunciation device emits a musical sound. In general, the pickup can also have buttons (such as using buttons to adjust the volume), but the buttons are not necessary.

在有声音乐读物整页面上的诸个码字中的各码字的值所表示的信息是与拼合信息的值有关的信息,比如码字的值是乐段的拼合信息的值。The information represented by the value of each codeword among the codewords on the entire page of the audio music book is information related to the value of the splicing information, for example, the value of the codeword is the value of the splicing information of the music section.

码字的位置是在有声音乐读物的整页面上的。有声音乐读物的任一个页面上可有一个或多个处于任意不同位置的码字,当然有的页面可没有码字。The position of the codeword is on the full page of the audio music book. There can be one or more codewords in any different positions on any page of the audiobook, and of course some pages can have no codewords.

本申请文件中各处提及的兼容概念是属于计算机领域常规概念,对于乐谱拼合方式,比如2.0版解码程序和解码数据,其能解码编码版本是2.0版的乐段拼合信息的值,1.0版解码程序和解码数据,其能解码编码版本是1.0版的乐段拼合信息的值,如2.0版解码程序和解码数据还能用于解码编码版本是1.0的乐段拼合信息的值,那2.0版的解码程序和解码数据就是兼容1.0版的解码程序和解码数据。再比如解码用的音素数据库的内容包括编码用的音素数据库的内容,那解码用的音素数据库与编码用的音素数据库是兼容的。The compatible concept mentioned in this application document is a conventional concept in the field of computer. For the combination of music scores, such as version 2.0 decoding program and decoding data, it can decode the value of the segment combination information whose encoding version is version 2.0, version 1.0 The decoding program and decoding data, which can decode the value of the segment combination information whose encoding version is version 1.0, such as the version 2.0 decoding program and decoding data can also be used to decode the value of the segment combining information whose encoding version is 1.0, then version 2.0 The decoding program and decoding data are compatible with version 1.0 of the decoding program and decoding data. For another example, the content of the phoneme database for decoding includes the content of the phoneme database for encoding, so the phoneme database for decoding is compatible with the phoneme database for encoding.

关于乐段的定义,本申请文件各处所称乐段与常规的乐段概念有点区别,常规的音乐乐段定义,通常是指采取一致的音乐表现手法,大致相同的节奏及调式,表现同一音乐主题的连续的曲调集合,本申请文件各处涉及定义的乐段,是为了发声和数据组织的方便,对乐段的定义并无特别限制,一般是一个码字对应一个乐段,一个码字对应一个乐段即一个码字的值是一个乐段的拼合信息的值,用拾音器读一下此码字,就可解码把此乐段的发音播放出来;所以一个乐段可以是一整首歌曲,这首歌曲分配一个码字;甚至一本有声音乐读物只有一个码字,整本有声音乐读物的整页面上印刷的乐谱就是一个乐段;也可把将形成的有声音乐读物的一页乐谱当成一个乐段;对于将形成的有声音乐读物(其页面上印刷有五线谱乐谱或简谱乐谱或电子琴谱乐谱或吉他谱乐谱等),可以把页面上印刷的每相邻的几行乐谱,定义成一个乐段,每乐段分配一个码字,甚至把页面上印刷的每一行乐谱定义成一个乐段,每乐段分配一个码字都可,甚至还可把页面上印刷的不相邻的几行乐谱定义成一个乐段,分配一个码字(比如把某页印的第1行和第3行乐谱定义为一个乐段),还可把页面上印刷的某行乐谱的部分,定义成一个乐段,等,显然这些对乐段的定义方式都可由常识推出;本申请文件各处指的乐段,也可以是常规音乐的乐段,对每段常规的音乐乐段分配一个码字。Regarding the definition of a section, the concept of a section referred to in this application document is somewhat different from the conventional concept of a section. The definition of a conventional music section usually refers to adopting a consistent musical expression technique, approximately the same rhythm and mode, and expressing the same music. The continuous melody collection of the theme, the definition of the paragraphs involved in this application document is for the convenience of voice and data organization, there is no special restriction on the definition of the music paragraphs, generally a codeword corresponds to a music section, and a codeword Corresponding to a music section, that is, the value of a codeword is the value of the combined information of a music section. Read the codeword with a pickup, and you can decode and play the pronunciation of the music section; so a music section can be a whole song , this song assigns a code word; Even an audio music book has only one code word, and the music score printed on the whole page of the whole audio music book is exactly a musical section; As a music section; For the audio music book that will be formed (staff score or numbered notation score or electronic piano score score or guitar score score, etc. are printed on its page), every adjacent few lines of music scores printed on the page can be defined as Each music section can be assigned a codeword, or even each line of music printed on the page can be defined as a music section, and each music section can be assigned a codeword, and even several non-adjacent printings on the page can be A line of score is defined as a section, and a code word is assigned (for example, the first line and the third line of music printed on a certain page are defined as a section), and the part of a certain line of score printed on the page can also be defined as a Sections, etc. Obviously, these definitions of sections can be deduced by common sense; the sections referred to in this application document can also be sections of conventional music, and a code word is assigned to each section of conventional music sections.

对于将形成的作为音乐教材的有声音乐读物,各页出现的示范谱(如五线谱),一般只有数行,一般可把这数行当成一个乐段分配一个码字即可。For the audio music book that will be formed as the music teaching material, the demonstration spectrum (as stave) that each page occurs generally only has several rows, and generally these several rows can be regarded as a music section to distribute a code word and get final product.

对于一本将形成的有声音乐读物的印刷内容的不同部分,可以采用不同的乐段划分方式,比如它的整页面上印有20首歌曲的曲谱,那可定义前面10首的曲谱是每首歌作为一个乐段,每乐段分配一个码字,后面10首的曲谱是每页作为一个乐段,每乐段分配一个码字。For different parts of the printed content of an audio music book that will be formed, different music section division methods can be adopted. For example, there are scores of 20 songs printed on its entire page, and the scores of the first 10 songs can be defined as each The song is used as a section, each section is assigned a codeword, and the scores of the next 10 songs are each page as a section, and each section is assigned a codeword.

以上3段所述的有关一个乐段分配一个码字,即是指这一个码字的值是这一个乐段的拼合信息的值,这是显而易见的。Allocating a codeword to a music section described in the above three paragraphs means that the value of this codeword is the value of the combined information of this music section, which is obvious.

一般一个乐段分配一个码字;比如有三行乐谱,可把这三行乐谱当成一个乐段,对这三行乐谱编码生成一个乐段拼合信息的值,然后一个码字里正好是放下这个乐段的拼合信息的值,那此码字的值就是这个乐段的拼合信息的值。Generally, a codeword is assigned to a music section; for example, if there are three lines of music scores, these three lines of music scores can be regarded as a music section, and the value of a music section combination information is generated by encoding the three lines of music scores, and then the music score is just placed in a codeword The value of the combination information of the segment, then the value of the codeword is the value of the combination information of the segment.

把乐段拼合信息的值放在码字里还有一些别的方式,比如把多个乐段拼合信息的值放在一个码字里等;本申请文件所说的一个码字的码值表示的信息是与拼合信息的值有关的信息,其一般可以是:一个码字的值是或含有数个乐段拼合信息的值(或数个乐段拼合信息的值的变换值),或其它形式。本申请文件有说一个码字的码值表示的信息是与拼合信息的值有关的信息,实际上对于这样的码字,其码值的形式可能还有本说明书其它地方没提及的形式,比如对数个乐段拼合信息的值合起来作变换得到变换值,把此变换值放入一个码字里或分开放在多个码字里,这一个或多个码字中每个码字的值所表示的信息仍是与拼合信息的值有关的信息,拾音器可读数个这种码字并按从这数个码字中得到的数个乐段拼合信息的值来播音乐。There are some other ways to put the value of the piece combination information in the codeword, such as putting the values of multiple pieces of piece together information in a codeword, etc.; the code value of a codeword mentioned in this application document represents The information is the information related to the value of the combination information, which generally can be: the value of a codeword is or contains the value of several pieces of combination information (or the transformation value of the value of several pieces of combination information), or other form. This application document states that the information represented by the code value of a code word is information related to the value of the combined information. In fact, for such a code word, the form of the code value may have other forms that are not mentioned elsewhere in this specification. For example, the values of several pieces of music information are combined to transform to obtain a transformation value, and the transformation value is put into one codeword or separated into multiple codewords, and each codeword in the one or more codewords The information represented by the value of is still information relevant to the value of the combination information, and the pickup can read several such code words and play music by the value of the combination information of several sections obtained from these several code words.

显然,如有一个码字的值是或含有多个乐段拼合信息的值这种情况,那乐段的定义仍是无特别限制,比如可把将形成的有声音乐读物的页面上印刷的一整首歌曲曲谱当成一个乐段,也可把印刷的一页乐谱当成一个乐段;对于将形成的有声音乐读物(其页面上印刷有五线谱乐谱或简谱乐谱或电子琴谱乐谱或吉他谱乐谱等),可以把页面上印刷的每相邻的几行乐谱,定义成一个乐段,甚至把页面上印刷的每一行乐谱定义成一个乐段,甚至还可把页面上印刷的不相邻的几行乐谱定义成一个乐段,还可把页面上印刷的一行乐谱的部分定义成一个乐段,还可把页面上印刷的一行乐谱的部分和另一行乐谱定义成一个乐段,等。Apparently, if the value of a codeword is or contains the value of a plurality of musical sections, the definition of that musical section is still not particularly limited, such as a printed page of the audio music book that will be formed. The entire song score is regarded as a paragraph, and a printed sheet of music can also be regarded as a paragraph; for the audio music book to be formed (staff score or numbered score or electronic piano score or guitar score, etc. are printed on its page) , you can define every adjacent line of music printed on the page as a music section, even define each line of music printed on the page as a music section, and even define a few non-adjacent lines printed on the page The music score is defined as a music section, and the part of a line of music scores printed on the page can also be defined as a music section, and the part of a line of music scores printed on the page and another line of music scores can also be defined as a music section, etc.

一本有声音乐读物的所有乐段中的任一乐段,可以在作为有声音乐读物主体的读物的整页面上印刷有此乐段的乐谱,或没任一页面印刷此乐段的乐谱。For any one of all the passages in an audio music book, the score of this passage may be printed on the entire page of the book as the main body of the audio music book, or the score of this passage may not be printed on any page.

对于一将形成的有声音乐读物中定义的乐段,在此有声音乐读物被实际做出来(比如生产出来)后,此乐段就是此被做出来的有声音乐读物的乐段,比如对于一将形成的有声音乐读物,将其某页面上印刷的某一行乐谱定义为一个乐段,那在此有声音乐读物被实际做出来(比如生产出来)后,此乐段(前述的某页面上印刷的前述的某一行乐谱)就是此被做出来的有声音乐读物的乐段。For a section defined in an audio music book to be formed, after the audio music book is actually made (such as produced), this section is the section of the audio music book that is made, such as for a will The formed audio music book defines a certain line of music score printed on a certain page as a music section, then after the audio music book is actually made (such as produced), the music section (printed on the aforementioned certain page Aforesaid certain row music score) is exactly the musical section of this made out audio music book.

本申请文件各处所说的一个拾音器读一个或多个码字后播放音乐,可以是此拾音器读一个或多个码字后并把从这一个或多个码字里得到的所有的乐段拼合信息的值对应的各乐段的发音播出来(一般是指一个或多个乐段的发音),另也可以是按约定(比如码字里有BIT标志)在从这一个或多个码字里得到的所有的乐段拼合信息的值中选择部分乐段拼合信息的值并把这部分乐段拼合信息的值对应的各乐段的发音播出来,比如可能是按约定(比如码字里有BIT标志)在从一个或多个码字里得到的多个乐段拼合信息的值中选择部分乐段的拼合信息的值并对他们各自进行解码并把这部分乐段的发音播出来。Playing music after a pickup reads one or more codewords mentioned throughout the application documents, it can be that the pickup reads one or more codewords and puts together all the sections obtained from the one or more codewords The pronunciation of each music section corresponding to the value of the information is played out (generally referring to the pronunciation of one or more music sections), and it can also be in accordance with the agreement (for example, there is a BIT mark in the code word) from the one or more codes Select the value of part of the combination information from all the values of the combination information obtained in the word and broadcast the pronunciation of each section corresponding to the value of this part of combination information, for example, it may be by agreement (such as code There is BIT mark in the word) in the value of the combination information of selecting part of the music from the value of the combination information of a plurality of music sections obtained in one or more codewords and decode them respectively and broadcast the pronunciation of this part of the music come out.

(本段以下所提到的码字都是在同一本有声音乐读物的整页面上的),如一个拾音器可读取一个码字后播放数个乐段发音(比如一个码字的值里有数个乐段拼合信息的值),且此拾音器也可读取多个码字后播放一个或多个乐段发音(这句话里的两个‘多’不一定是相同数值;拾音器读取多个码字后播放多个乐段发音的例子比如:乐段拼合信息的值1的BIT分成三部分分别放在码字1和码字2和码字3里,乐段拼合信息的值2的BIT也放在码字3里,拾音器读码字1和码字2和码字3后,将这三个码字的值合起来得到乐段拼合信息的值1,从码字3又可得到乐段拼合信息的值2,然后拾音器可解码播放乐段拼合信息的值1对应的乐段发音和乐段拼合信息的值2对应的乐段发音),就可推出此拾音器读取一个或多个码字后有播放某一个乐段发音(还可能有播放别的乐段发音);本申请文件各处所说的一个拾音器读一个或多个码字后播放乐段发音,默认是指播放一个乐段发音,是此拾音器读此一个或多个码字后播放一个或多个乐段发音的下位概念,也是此拾音器读此一个或多个码字后播放音乐的下位概念。(the code words mentioned below in this paragraph are all on the whole page of the same audio music book), as a pick-up can read a code word and then play several paragraph pronunciations (such as the value of a code word with number The value of the combination information of each segment), and the pickup can also read multiple codewords and play one or more segment pronunciations (the two 'multiple' in this sentence are not necessarily the same value; the pickup reads multiple codewords) An example of playing the pronunciation of multiple sections after a codeword For example: the BIT of the value 1 of the section combination information is divided into three parts and placed in the codeword 1, codeword 2 and codeword 3 respectively, and the value of the section combination information is 2 BIT is also placed in codeword 3. After the pickup reads codeword 1, codeword 2 and codeword 3, the values of these three codewords are combined to obtain the value 1 of the segment information, which can be obtained from codeword 3 The value 2 of the segment combination information, then the pickup can decode and play the segment pronunciation corresponding to the value 1 of the segment combination information and the segment pronunciation corresponding to the value 2 of the segment combination information), and the pickup can be released to read one or more After the code words, there is a certain music section pronunciation to be played (there may also be played other music section pronunciations); a pickup mentioned in this application document reads one or more code words and then plays the music section pronunciation, which means playing a music section by default. The pronunciation of the phrase is the subordinate concept of playing the pronunciation of one or more phrases after the pickup reads the one or more codewords, and it is also the subordinate concept of playing music after the pickup reads the one or more codewords.

显然根据乐段的定义方式,那拾音器读一个或多个码字后,对应播放的音乐,可以是多首乐曲或一整首乐曲或一首乐曲的部分,也可以是示范音(比如一段音阶)。Obviously, according to the definition mode of the music section, after the pickup reads one or more code words, the music played correspondingly can be multiple pieces of music or a whole piece of music or a part of a piece of music, or it can be a demonstration tone (such as a section of scale ).

如述及时无特别指明有他义,本说明各处述及的拾音器按从码字得到的数个乐段拼合信息的值来播音乐,默认是指只播这数个乐段拼合信息的值对应的数个乐段的发音。If there is no special meaning when it is mentioned, the pickups mentioned in this description play music according to the value of the combination information of several sections obtained from the codeword, and the default is to only play the value of the combination information of these sections The pronunciation of the corresponding several paragraphs.

如提及时无特别指明有他义且不能从提及处的上下文推出有他义,本说明书各处提及的‘码字’(其前面没加‘诸个’也没加‘部诸’的)默认是指有声音乐读物整页面上诸个码字中的码字。If there is no other meaning specified when mentioning and the other meaning cannot be deduced from the context of the mention, the 'codeword' mentioned throughout this specification (without "all" or "parts" in front of it) ) default refers to the code word in the code words on the entire page of audio music book.

有声音乐读物整页面上诸个码字中的各码字的值所表示的信息是与拼合信息的值有关的信息(比如是一个乐段拼合信息的值),对于单个码字,其码字的值的形式一般是:码字的值是或含有一个乐段拼合信息的值,等。这里所说的‘码字的值是或含有一个乐段拼合信息的值’,其中使用的‘是’词的意思是指:码字的值里除了有这一个乐段拼合信息的值的全部BIT外,没有其它的乐段拼合信息的值涉及的BIT的全部或部分,但码字的值里还可以有或没有其它与播放音乐无关的信息,比如相关的显示信息(比如对应不同码字显示不同形式的动画动作,这要求相应拾音器上有显示器如LCD,并且动画数据在拾音器内),还可以有或没有码字的校验BIT位等;所说的‘码字的值是或含有一个乐段拼合信息的值’中使用的‘含有’词的意思是指:码字的值里除了有这一个乐段拼合信息的值的全部BIT外,还有其它乐段拼合信息的值涉及的BIT(比如其它乐段的拼合信息的值或其它乐段的拼合信息的值的变换值)的全部或部分,另码字的值里还可以有或没有其它与播放音乐无关的信息,比如相关的显示信息,还可以有或没有码字的校验BIT位等;本申请文件各处述的‘一个码字的值是或含有一个乐段拼合信息的值’,‘一个码字的值是或含有数个乐段拼合信息的值’,‘一个码字的值是或含有数个乐段拼合信息的值的变换值的部分’,“一个码字的值是或含有一个乐段拼合信息的值的部分’以及其它类似说法中的‘是’和‘含有’也是本段以上述的含义;至于乐段拼合信息的值等是码字一部分时,与其它信息在码字BIT里的共处格式(如谁前谁后,甚至几种信息交叉如何排列),这由编码者自定,MCU在读码后根据定义的格式内容,将乐段拼合信息的值和其它信息的值从码值里各自取出,分别用于播放音乐和其它处理。The information represented by the value of each codeword in the codewords on the entire page of the audio music book is information related to the value of the combined information (for example, the value of a piece of combined information), and for a single codeword, its codeword The form of the value of is generally: the value of the code word is or contains the value of a musical segment splicing information, etc. Here said 'the value of the codeword is or contains the value of a piece of music piece together information', wherein the meaning of the word 'yes' used refers to: all the values of the codeword except the value of this piece of piece piece together information Except for the BIT, there is no other piece of information that involves all or part of the BIT, but the value of the codeword may or may not have other information that is not related to playing music, such as related display information (such as corresponding to different codewords) Display different forms of animation actions, which requires a display such as LCD on the corresponding pickup, and the animation data is in the pickup), and can also have or not check BIT bits of codewords, etc.; the value of the said 'codeword is or contains The word "contains" used in the value of a segment combination information' means that in addition to all the BITs of the value of this segment combination information in the value of the codeword, there are other values of the segment combination information involved All or part of the BIT (such as the value of the combined information of other sections or the transformation value of the combined information of other sections), and the value of the other codeword may or may not have other information that has nothing to do with playing music, such as Relevant display information can also have or not check BIT bit of codeword etc.; Is or contains the value of several music segment splicing information', 'the value of a codeword is or contains the transformed value part of the value of several musical segment splicing information', "the value of a code word is or contains a musical segment splicing "Part of the value of the information" and "is" and "contains" in other similar sayings also have the above-mentioned meanings in this paragraph; when the value of the music segment combination information is a part of the code word, it is related to other information in the code word BIT. Coexistence format (such as who is in front of who is behind, or even how to arrange several kinds of information crossed), which is determined by the coder. After reading the code, the MCU will combine the value of the segment information and the value of other information from the code value according to the defined format content. Take them out and use them for playing music and other processing.

本申请文件述及的有关一个码字的值是或含有其它形式与拼合信息的值有关的值,涉及的‘是’及‘含有’的意思,也是按上段所述类推理解。(本申请文件各处提及的导电码的值是或含有某值也是按上段所述类推理解)The value of a codeword mentioned in this application document is or contains other forms of values related to the value of the combined information, and the meanings of "yes" and "contains" involved are also understood by analogy as described in the previous paragraph. (The value of the conductive code mentioned in this application document is or contains a certain value, which is also understood by analogy as described in the previous paragraph)

至于在一个码字的值里既有数个乐段拼合信息的值又有数个乐段拼合信息的值的部分这类情况,其码值所表示的信息也是与拼合信息的值有关的信息,拾音器可以读数个码字后根据这样数个码字的值得到各乐段拼合信息的值并分别作相应处理播放音乐。显然放在一个码字的值里的多个乐段拼合信息的值,一般是顺序排列,也可按约定交叉排列。在一个码字里怎么放以及放哪些乐段拼合信息的值,是根据需要来定。As for the value of a code word, there are not only the value of several music section combination information but also the part of the value of several music section combination information. After several codewords can be read, the value of each music segment combination information can be obtained according to the values of such several codewords, and corresponding processing can be performed to play music respectively. Apparently, the values of multiple music segment combination information placed in the value of a codeword are generally arranged in order, and can also be arranged in an alternate manner according to the agreement. How to put it in a codeword and which music sections to put together the value of the information are determined according to the needs.

本申请文件有如此限定:本申请文件各处所说的码字的码值表示的信息是与拼合信息的值有关的信息,是指码字的值是共与数个乐段拼合信息的值有关的(不再与这数个乐段拼合信息的值外的乐段拼合信息的值有关)。对于码字的值是共与数个乐段拼合信息的值有关的,一般是指:码字的值是一个乐段的拼合信息的值或含有一个乐段的拼合信息的值或是多个乐段的拼合信息的值或含有多个乐段的拼合信息的值或是一个乐段的拼合信息的值的部分或含有一个乐段的拼合信息的值的部分或是一个乐段的拼合信息的值的变换值或含有一个乐段的拼合信息的值的变换值或是多个乐段的拼合信息的值的变换值或含有多个乐段的拼合信息的值的变换值或是一个乐段的拼合信息的值的变换值的部分或含有一个乐段的拼合信息的值的变换值的部分或是多个乐段的拼合信息的值的变换值的部分或含有多个乐段的拼合信息的值的变换值的部分,或是一个乐段拼合信息的值的部分的变换值或含有一个乐段拼合信息的值的部分的变换值或是多个乐段拼合信息的值的部分的变换值或含有多个乐段拼合信息的值的部分的变换值或是一个乐段拼合信息的值的部分的变换值的部分或含有一个乐段拼合信息的值的部分的变换值的部分或是多个乐段拼合信息的值的部分的变换值的部分或含有多个乐段拼合信息的值的部分的变换值的部分或是对数个乐段拼合信息的值的部分和数个乐段拼合信息的值进行变换得到的变换值或含有对数个乐段拼合信息的值的部分和数个乐段拼合信息的值进行变换得到的变换值或是对数个乐段拼合信息的值的部分和数个乐段拼合信息的值进行变换得到的变换值的部分或含有对数个乐段拼合信息的值的部分和数个乐段拼合信息的值进行变换得到的变换值的部分;如码字的值是共与数个乐段拼合信息的值有关的,那码字的值可能还有除以上这些形式外的一些其它形式,这些其它形式与以上这些形式实质是相同的即都是涉及怎样把拼合信息的值放到码字里。显然不管码字的值里怎么放数据,相应的用于读码字的拾音器应可从码字的值里得出这些数据(比如知道怎么从乐段拼合信息的值的变换值作逆变换得到乐段拼合信息的值)This application document is limited in this way: the information represented by the code value of the code word mentioned throughout the application document is information related to the value of the combined information, which means that the value of the code word is related to the value of the combined information of several music sections (no longer related to the value of the piece combination information other than the value of these pieces of piece combination information). The value of the code word is related to the value of the combination information of several music sections, generally refers to: the value of the code word is the value of the combination information of a music section or the value of the combination information containing a music section or multiple The value of the patch information of a section or the value of the patch information of multiple sections or the part of the value of the patch information of a section or the part of the value of the patch information of a section or the patch information of a section The transformation value of the value of the value or the transformation value of the value of the combination information containing one section or the transformation value of the value of the combination information of multiple sections or the transformation value of the value of the combination information containing multiple sections or a section Part of the transformation value of the value of the patch information of a section or a part containing the transformation value of the value of the patch information of one section or a part of the transformation value of the value of the patch information of multiple sections or a section containing multiple sections The transformation value part of the value of the information, or the transformation value of the value part of the value of a piece of piecewise information, or the transformation value of the part of the value of the piece of piecemeal information containing a piece of music, or the part of the value of the piecewise information of multiple pieces Transformation value or transformation value of part of value containing multiple phrase splicing information or part of transformed value of part of value of one phrase splicing information or part of transformed value of part of value containing one phrase splicing information or It is a part of the conversion value of the value part of the combination information of multiple sections or a part of the conversion value of the part containing the value of the combination information of a plurality of sections, or a part of the value of the combination information of several sections and several sections The transformation value obtained by transforming the value of the segment combination information, or the transformation value obtained by converting the value of several segment combination information and the value of several segment combination information, or the value of several segment combination information The part of the part and the value of several piece combination information are transformed into the transformed value part or the part containing the part of the value of several pieces of combination information and the value of several pieces of combination information is transformed into the part of the value obtained; If the value of the code word is related to the value of the combined information of several music sections, the value of the code word may have some other forms other than the above forms, and these other forms are essentially the same as the above forms. It involves how to put the value of the combined information into the codeword. Obviously, no matter how to put data in the value of the code word, the corresponding pickup used to read the code word should be able to obtain these data from the value of the code word (for example, knowing how to inversely transform the value of the piece information from the music segment) the value of the segment stitching information)

对于把一个乐段拼合信息的值与其他与发音无关的值比如显示信息的值,一起作变换生成的值,可等同看成是此乐段拼合信息的值的变换值,可对此变换值作逆变换得到此乐段拼合信息的值;对于把多个乐段拼合信息的值与其他与发音无关的值比如显示信息的值,一起作变换生成的值,可等同看成是这多个乐段拼合信息的值的变换值,可对此变换值作逆变换得到这多个乐段拼合信息的值。For the value generated by transforming the value of a piece of combination information and other values that have nothing to do with pronunciation, such as the value of display information, it can be regarded as the transformation value of the value of this piece of combination information, and the value can be transformed Perform an inverse transformation to obtain the value of the combined information of this segment; for the values of the combined information of multiple segments and other values that have nothing to do with pronunciation, such as the value of display information, the value generated by transforming together can be equivalent to these multiple The transformation value of the value of the combination information of the music segment, which can be inversely transformed to obtain the values of the combination information of the plurality of music segments.

显然,如在说一个码字的值是共与数个乐段拼合信息的值有关的的前提下,如再说这里‘数个乐段拼合信息的值’是一个乐段拼合信息的值的话,那这一个码字的值是只与这一个乐段拼合信息的值有关的;显然,如在一个码字的值是共与数个乐段拼合信息的值有关的的前提下,如再说这里‘数个乐段拼合信息的值’是多个乐段拼合信息的值的话,那这一个码字的值是只与这多个乐段拼合信息的值有关的。Obviously, under the premise that the value of a codeword is related to the value of several pieces of information combined together, if the value of the combination information of several pieces here is the value of one piece of combination information, Then the value of this codeword is only related to the value of this piece of music piece combination information; obviously, as in the premise that the value of a codeword is related to the value of several pieces of music piece piece together information, as mentioned here If the 'value of combining information of several music segments' is the value of combining information of multiple music segments, then the value of this code word is only related to the value of combining information of these multiple music segments.

本申请文件各处所说的数个乐段拼合信息的值的变换值,这里的‘数个’是修饰‘乐段拼合信息的值’的,本申请文件各处所说的数个乐段谱数据的变换值,等,也是按此类推理解;本申请文件各处所说的多个乐段拼合信息的值的变换值,这里的‘多个’是修饰‘乐段拼合信息的值’的,本申请文件各处所说的多个乐段谱数据的变换值,等,也是按此类推理解。The conversion value of the value of the several pieces of music information mentioned in this application document, where the 'several' is to modify the 'value of the music section combination information', the number of music score data mentioned in this application document The conversion value, etc., are also understood by analogy; the conversion value of the value of the multiple music segment combination information mentioned in this application document, the "multiple" here is to modify the "value of the music segment combination information", this The conversion values of the multiple musical score data mentioned in various parts of the application documents, etc., are also understood by analogy.

对乐段拼合信息的值进行解码播音乐时,是用音乐合成方式来播音乐的,拾音器的MCU或者MCU外接的播音乐的芯片需具备用音乐合成方式播音乐的能力,并且解码时可由MCU把解码结果转成乐段的谱数据,并由拾音器的MCU或者MCU外带的播音乐的芯片根据乐段的谱数据用音乐合成方式来播音乐,至于解码时由MCU把解码结果转成其它形式的数据,并由拾音器MCU或者其外带的播音乐的芯片根据其它形式的数据用音乐合成方式来播音乐,那这里其它形式的数据与乐段的谱数据的实质是相同的;拾音器的MCU或者MCU外带的播音乐的芯片根据乐段的谱数据播放音乐,类似于播放MIDI音乐,下面描述一下用乐段的谱数据播音乐的原理When decoding the value of the segment combination information and playing music, the music is played by music synthesis. The MCU of the pickup or the music playback chip connected to the MCU must have the ability to play music by music synthesis, and the decoding can be performed by the MCU. Convert the decoding result into the spectrum data of the music section, and use the MCU of the pickup or the music playing chip outside the MCU to play the music in the way of music synthesis according to the spectrum data of the music section. As for the decoding, the MCU converts the decoding result into other form of data, and play music by the pickup MCU or its external music playing chip according to other forms of data in a music synthesis mode, then the essence of the other forms of data and the spectral data of the music section is the same here; The MCU or the music playing chip attached to the MCU plays music according to the spectrum data of the section, which is similar to playing MIDI music. The following describes the principle of playing music with the spectrum data of the section

本段以下主要描述的是MCU根据谱数据播放音乐的方式:音乐可以用记谱来代替录音,比如最简单的单声道的一乐段音乐含DO,MI,SO三个音,可顺序记录1,2,3三个数字作为谱数据,播放时就可由拾音器从谱数据取得1,2,3三个数字并按约定来顺序播出DO,MI,SO三个音调,当然这里说的是最简单的情况,实际的谱数据的形式要更复杂,而且一般是多声道的;按谱数据来播放音乐要求拾音器里的MCU有音乐合成的功能,(音乐合成技术是公知成熟技术,这里不多述),谱数据里既记录音高信息,也记录节奏信息(必要时还可能记录音色信息,但这不是必需的),而且一般是记录多声道的谱,可由拾音器里的MCU按谱数据播放出音乐声,谱数据的格式类似于MIDI数据格式(并且可把MIDI数据格式称为谱数据格式的一种),但依不同型号的MCU或不同应用场合,谱数据的具体格式定义及声道数有别;可把与拾音器有关的存储器集里存储的由MCU使用来按此方式根据乐段的谱数据播放音乐的程序叫做谱数据播放程序。这里要求MCU有根据谱数据播放音乐的功能(一般是音乐合成功能),MCU根据谱数据播放音乐,是指MCU按谱数据各字节的内容(一般是把谱数据一个一个字节顺序取出),按音乐合成的方式播放音乐,此属技术常识;另也可用MCU外接的带喇叭的音乐芯片来按谱数据播放音乐,其播放的原理与本段前面述的用MCU播放的原理是类似的,只是需由MCU把乐段的谱数据通过通讯线传给MCU外接的音乐芯片以用于音乐芯片播放音乐。另外,在有些乐谱中,可能会出现力度,滑音等特殊记号,不过一般有根据谱数据播放音乐功能的单片机和音乐芯片不具备处理这些特殊记号的功能,所以谱数据里一般也不记录这些记号,只记录主要乐调的数据信息,播放时只把主要的乐调播出;如拾音器的用于播放音乐的主MCU或用于播放音乐的音乐芯片有处理这些记号的功能,那可以在解码生成乐段的谱数据时,在乐段的谱数据里记录这些记号,播放时由单片机芯片或音乐芯片处理。可把乐段的拼合信息的值放在码字里,拾音器读一个或多个码字后,直接按这一个或多个码字里的数个乐段拼合信息的值各自解码得到乐段谱数据并用得到的乐段谱数据播音乐(拾音器应有本段说的按谱数据播音乐的功能,比如是拾音器的MCU或MCU外接的音乐芯片有本段说的按谱数据播音乐的功能);以上主要是以解码时是按整步发音方式来说的,如按分步发音方式,每步解码得到乐段谱数据的部分(也是解码结果的部分)且可由MCU按乐段谱数据的部分来发音或由MCU把乐段谱数据的部分传给MCU外接的音乐芯片来发音。(本段各处提到的‘这些乐段’就是指本段第2行提到的‘一些乐段’)The following section mainly describes the way the MCU plays music according to the spectrum data: music can be recorded by notation, for example, the simplest monophonic piece of music contains DO, MI, SO three tones, which can be recorded sequentially The three numbers 1, 2, and 3 are used as spectral data. When playing, the pickup can obtain the three numbers 1, 2, and 3 from the spectral data and play the three tones of DO, MI, and SO in order according to the agreement. Of course, what is said here is In the simplest case, the form of the actual spectral data is more complicated, and generally multi-channel; playing music according to the spectral data requires the MCU in the pickup to have the function of music synthesis, (music synthesis technology is a well-known mature technology, here Not to mention), the spectrum data not only records pitch information, but also records rhythm information (if necessary, it may also record timbre information, but this is not necessary), and generally records multi-channel spectrum, which can be pressed by the MCU in the pickup. Spectrum data plays music sound. The format of spectral data is similar to MIDI data format (and MIDI data format can be called a kind of spectral data format), but the specific format of spectral data is defined according to different models of MCU or different applications. And the number of channels is different; the program that can be stored in the memory set related to the pickup and used by the MCU to play music according to the spectrum data of the music section in this way is called the spectrum data playback program. Here, the MCU is required to have the function of playing music according to the spectrum data (usually music synthesis function), and the MCU plays music according to the spectrum data, which means that the MCU uses the content of each byte of the spectrum data (generally, the spectrum data is taken out one by one byte order) , to play music in the way of music synthesis, which is technical common sense; in addition, the music chip with speaker connected to the MCU can also be used to play music according to the spectrum data. The principle of playing music is similar to the principle of playing with MCU mentioned above in this paragraph , it is only necessary for the MCU to transmit the spectrum data of the music section to the music chip external to the MCU through the communication line for the music chip to play music. In addition, in some music scores, there may be special marks such as dynamics and portamento, but generally the single-chip microcomputer and music chip with the function of playing music according to the spectrum data do not have the function to process these special marks, so these marks are generally not recorded in the spectrum data , only record the data information of the main tune, and only broadcast the main tune during playback; for example, the main MCU of the pickup for playing music or the music chip for playing music has the function of processing these marks, which can be decoded When generating the spectrum data of the music section, these marks are recorded in the spectrum data of the music section, and are processed by a single-chip microcomputer chip or a music chip during playback. The value of the combination information of the music section can be placed in the codeword. After the pickup reads one or more codewords, it can be decoded directly according to the values of the combination information of several pieces of music in the one or more codewords to obtain the music section spectrum The data is combined with the obtained music spectrum data to play music (the pickup should have the function of playing music according to the spectrum data mentioned in this paragraph, for example, the MCU of the pickup or the music chip connected to the MCU has the function of playing music according to the spectrum data mentioned in this paragraph) The above mainly refers to the whole-step pronunciation mode during decoding, such as the step-by-step pronunciation mode, each step of decoding obtains the part of the music spectrum data (also the part of the decoding result) and can be used by the MCU according to the music spectrum data. part to pronounce or the MCU sends part of the music spectrum data to the music chip connected to the MCU to pronounce. ('These passages' mentioned in this paragraph refer to the 'some passages' mentioned in the second line of this paragraph)

下面是对乐谱拼合方式的描述(从下一段起至本说明书第99页止):The following is a description of how the score is put together (from the next paragraph to page 99 of this manual):

乐段拼合信息是指一种信息,一般可用二进制值表示,其表示的是如何用音素来拼合乐段发音的信息;乐段拼合信息的值由编码形成,并可被用于解码来拼合乐段发音;乐段拼合信息的值的格式,可以是本说明书反复提到的连续二进制格式或其它格式。本申请文件的说明书和权利要求书中各处提及的‘拼合信息的值’和‘乐段拼合信息的值’和‘乐段的拼合信息的值’和‘表示乐段发音的拼合信息的值’和‘表示如何用音素来拼合乐段发音的拼合信息的值’是相同说法。The segment combination information refers to a kind of information, which can generally be represented by a binary value, which indicates how to use phonemes to combine the sound of the segment; the value of the segment combination information is formed by encoding, and can be used for decoding to combine music The pronunciation of the segment; the format of the value of the combination information of the segment can be the continuous binary format mentioned repeatedly in this manual or other formats. The "value of the combination information" and "the value of the combination information of the music section" and "the value of the combination information of the music section" and "the value of the combination information indicating the pronunciation of the music section" mentioned in the specification and claims of this application document The value of' is the same as the value of the combination information indicating how to use phonemes to combine the pronunciation of the passage'.

每次是用编码规则对一个乐段的乐谱进行编码生成一个乐段拼合信息的值的,也即每次是用编码规则对一个乐段进行编码生成一个乐段拼合信息的值的;以下乐谱拼合的技术阐述,每次是以对一个乐段拼合信息的值进行解码为例的。Each time a music score of a music section is coded with encoding rules to generate a value of music piece joining information, that is, each time a music piece is encoded with coding rules to generate a value of music piece joining information; the following score The technical description of stitching is an example of decoding the value of stitching information of a music segment each time.

是按照编码规则来对一个乐段的乐谱进行编码生成一个乐段拼合信息的值的,是按解码规则来对乐段拼合信息的值进行解码以播乐段发音的。The music score of a music section is encoded according to the coding rules to generate the value of the music piece combination information, and the value of the music piece combination information is decoded according to the decoding rules to broadcast the pronunciation of the music piece.

本申请文件所说的一个表示如何用音素来拼合乐段发音的拼合信息的值,实际是指一个表示如何用多个音素来拼合乐段发音的拼合信息的值,这多个音素中的任一个音素都可以从这拼合信息的值中直接得到(比如可从这拼合信息的值中直接取出其音素编号),且这多个音素也就是用于拼合乐段发音的音素。The value of the combined information that represents how to use phonemes to combine the pronunciation of the passage mentioned in this application document actually refers to a value that indicates how to use multiple phonemes to combine the combined information of the pronunciation of the passage. Any of the multiple phonemes A phoneme can be directly obtained from the value of the combined information (for example, its phoneme number can be directly obtained from the value of the combined information), and these multiple phonemes are the phonemes used for the pronunciation of the combined segment.

音素是信息的集合。用于拼合的音素里必须包含一个或多个音的信息,音素的类型一般可以是以下几种(用于拼合的音素还可以是其它类型的,当然这些其它类型的音素里也要包含一个或多个音的信息):A phoneme is a collection of information. The phonemes used for merging must contain the information of one or more sounds, and the types of phonemes can generally be the following types (the phonemes used for merging can also be of other types, of course, these other types of phonemes should also contain one or more phonemes) information for multiple tones):

1型:一个音素里包括一个音的音长值不包括这个音的音高值也不包含别的音的信息,或者这音素里包括这个音的音长值和这个音的音高值但不包含别的音的信息,总之,这音素里包含一个音的信息且不包含别的音的信息,Type 1: A phoneme includes the length value of a sound but does not include the pitch value of this sound and does not contain information about other sounds, or the phoneme includes the length value of this sound and the pitch value of this sound but not Contains information of other sounds, in short, this phoneme contains information of one sound and does not contain information of other sounds,

2型:一个音素里包括多个音(这多个音是同时发音的)各自的音长值,还包括这多个音各音互相之间的音程关系数据,但不包含别的音的信息,或者这音素里包括这多个音各自的音长值,还包括这多个音各自的音高值,但不包含别的音的信息,总之,这音素里包含同时发音的多个音的信息且不包含别的音的信息,Type 2: A phoneme includes the respective sound length values of multiple sounds (these multiple sounds are pronounced at the same time), and also includes the interval relationship data between the multiple sounds, but does not contain information about other sounds , or this phoneme includes the respective pitch values of the multiple tones and the respective pitch values of the multiple tones, but does not contain information about other tones. information and does not contain information about other sounds,

3型:一个音素里包括多个音(这多个音中有不同时发音的音)各自的发音时刻值,这多个音各自的音长值,这多个音各音互相之间的音程关系数据,但不包含别的音的信息,或者这音素里包括这多个音各自的发音时刻值,这多个音各自的音长值,这多个音各自的音高值,但不包含别的音的信息;总之,这音素里包含多个音的信息且不包含别的音的信息,且这多个音中有不同时发音的音,Type 3: A phoneme includes multiple sounds (there are sounds that are pronounced at different times in the multiple sounds), the respective pronunciation time values of the multiple sounds, the respective sound length values of the multiple sounds, and the intervals between the multiple sounds Relational data, but does not contain information about other sounds, or this phoneme includes the pronunciation time values of the multiple sounds, the sound length values of the multiple sounds, and the pitch values of the multiple sounds, but does not contain Information about other sounds; in short, this phoneme contains information about multiple sounds and does not contain information about other sounds, and there are sounds that are not pronounced at the same time in these multiple sounds,

以上几段里‘别的音的信息’的说法中‘别的’是修饰‘音’的;除非提及时有特别指明有他意,本说明书以下各处提及的音素默认是指以上这3种形式的音素。不过以上3种音素中任一种,其包含的信息形式可有一些等同替换形式,比如3型音素,一个音素里包括多个音各自的发音时刻值,这多个音各自的音长值,这多个音各自的音高值,那其等同替换形式可以是:一个音素里包括多个音各自的发音时刻值,这多个音各自的音长值,这多个音各音互相之间的音程关系数据,这多个音中任一音在五线谱上的上下位置值2;除非提及时有特别指明有他意,本说明书各处提及的音素默认不涉及这些等同替换形式。In the phrase "information about other sounds" in the above paragraphs, "other" is used to modify "sound"; unless otherwise specified, the phonemes mentioned in the following parts of this manual refer to the above three by default. form phonemes. However, in any of the above three phonemes, the information form contained in it may have some equivalent replacement forms, such as type 3 phonemes, where a phoneme includes the respective pronunciation time values of multiple sounds, the respective sound length values of these multiple sounds, The respective pitch values of these multiple tones, then its equivalent replacement form can be: a phoneme includes the respective pronunciation time values of multiple tones, the respective sound length values of these multiple tones, and the mutual relationship between each of these multiple tones The interval relationship data of any one of the multiple tones on the stave has a value of 2; unless otherwise specified when mentioned, the phonemes mentioned in this manual do not involve these equivalent replacement forms by default.

如把图16的音组合定义成音素,这音素中的两个音是在不同时刻发音的。If the combination of sounds in Fig. 16 is defined as a phoneme, two sounds in this phoneme are pronounced at different times.

一般可由音素里的信息看出这个音素里包含究竟多少个音的信息(比如音素里只包含两个音的音长值,那一般就可按约定推出这音素里只包含2个音的信息),另也可采用等同形式,在音素里放一些BIT来表示这个音素里包含究竟多少个音的信息,比如用二进制的3表示音素里包含3个音的信息;如提及时无特别指明,本说明书各处提及的任一音素,默认是以在这个音素里不用放BIT来表示这个音素里包含究竟多少个音的信息而是可由这个音素里的信息看出这个音素里包含究竟多少个音的信息为例的。Generally, information about how many tones are contained in the phoneme can be seen from the information in the phoneme (for example, if the phoneme only contains the length value of two tones, then the information of only two tones in the phoneme can generally be deduced according to the agreement) , and the equivalent form can also be used, and some BITs are placed in the phoneme to indicate how many tones are contained in the phoneme. For example, binary 3 is used to indicate that the phoneme contains 3 tones; For any phoneme mentioned in the manual, the default is not to put BIT in this phoneme to indicate how many tones are contained in this phoneme, but how many tones are contained in this phoneme can be seen from the information in this phoneme information as an example.

音素里包含数个音的信息,音素里可以不包括这数个音中任一音的音高值,也可包括这数个音中每个音的音高值或部分音各自的音高值,本说明书以下编码规则举例和解码规则举例里涉及定义的音素里是不包括任一音的音高值的,后面编码举例和解码举例涉及的可从拼合信息的值里直接得到的音素也是这样的(可从拼合信息的值里直接得到的音素里是不包括任一音的音高值的)。The phoneme contains the information of several tones, and the phoneme may not include the pitch value of any one of the several tones, or may include the pitch value of each of the several tones or the respective pitch values of some of the tones , the phonemes defined in the following encoding rule examples and decoding rule examples in this manual do not include the pitch value of any tone, and the same is true for the phonemes that can be directly obtained from the value of the combined information involved in the following encoding examples and decoding examples. (the phonemes that can be directly obtained from the value of the combined information do not include the pitch value of any tone).

音素里包含的信息情况可以是多种多样的,音素的形式也可以是多种多样的;音素里可以只包含数个音的信息(除此之外音素里无其它附加信息),以上1型,2型,3型音素都可以是这种情况;音素里也可以既包含数个音的信息也包含其它附加信息(其它附加信息比如是节奏信息,调式,谱号信息,此音素的音色信息及音强信息,拼合涉及的操作信息,等中的一种或多种),以上1型,2型,3型音素都可以是这种情况;本说明书后面编码规则举例和解码规则举例和编码举例和解码举例里涉及的音素,默认都是以音素里只包含数个音的信息而无其它附加信息为例的且涉及的音素都是以音素就是音组合为例的。The information contained in the phoneme can be various, and the form of the phoneme can also be various; the phoneme can only contain the information of several sounds (other than that, there is no other additional information in the phoneme), the above type 1 , Type 2, and Type 3 phonemes can be the case; the phoneme can also contain both the information of several sounds and other additional information (other additional information such as rhythm information, mode, clef information, timbre information of this phoneme and sound intensity information, operation information involved in stitching together, etc., one or more), the above type 1, type 2, and type 3 phonemes can be this case; the following coding rules and decoding rules are examples and coding The phonemes involved in the example and the decoding example are all based on the example that the phoneme only contains information of several tones without other additional information, and the phonemes involved are all based on the example that the phoneme is a combination of sounds.

用音素来拼合乐段发音时,可以是全部用音素,也可以是用到多个音素和一些其它附加信息(其它附加信息比如是被编码乐段的节奏,调式,谱号信息,某音素的音色信息,音强信息,拼合涉及的操作信息,等),如提及时无特别指明有他意,本说明书以下编码举例和解码举例里各处提及的用音素来拼合乐段发音,默认是用到多个音素和一些其它附加信息(一些其它附加信息比如是被编码乐段的节奏信息)。When phonemes are used to combine the pronunciation of a section, it can be all phonemes, or multiple phonemes and some other additional information (other additional information such as the rhythm of the coded section, mode, clef information, the number of a certain phoneme) Timbre information, sound intensity information, operation information involved in stitching, etc.), if there is no special meaning when it is mentioned, the following encoding examples and decoding examples in this manual mention the use of phonemes to stitch the pronunciation of segments, and the default is to use To a plurality of phonemes and some other additional information (some other additional information such as the rhythm information of the encoded music section).

本说明书各处所说的把一含一个音的音组合定义为一个音素(也可说成这个音素就是这一个音,或可说成这个音素就是这一个音组合),实际是指这个音素包含这个音的信息,具体的说,这个音素里只包含这个音的音长值(音素里无其它信息),或者是这个音素里只包含这个音的音长值和音高值(音素里无其它信息);本说明书各处所说的把一含多个音的音组合定义为一个音素(也可说成这个音素就是这一个音组合),实际是指这个音素只包含这音组合里的多个音(这音组合里只包含这多个音)各自的发音时刻值以及这多个音各自的音长值以及这多个音各音互相之间的音程关系数据(音素里无其它信息),或者是这个音素只包含这音组合里的多个音各自的发音时刻值以及这多个音各自的音长值以及这多个音各自的音高值(音素里无其它信息)。The definition of a sound combination containing one sound as a phoneme (also can be said that this phoneme is this sound, or it can be said that this phoneme is this sound combination) said everywhere in this manual actually means that this phoneme contains this Tone information, specifically, this phoneme only contains the sound length value of this sound (there is no other information in the phoneme), or this phoneme only contains the sound length value and pitch value of this sound (there is no other information in the phoneme) ; This instruction manual is defined as a phoneme (it can also be said that this phoneme is exactly this tone combination) that a sound combination containing a plurality of sounds is said everywhere in this manual, actually means that this phoneme only includes a plurality of sounds in this tone combination ( This tone combination only includes the respective pronunciation time values of these multiple tones) and the respective sound length values of these multiple tones and the interval relationship data between each of these multiple tones (there is no other information in the phoneme), or This phoneme only includes the pronunciation time values of the multiple sounds in the sound combination and the respective sound length values of the multiple sounds and the respective pitch values of the multiple sounds (there is no other information in the phoneme).

本申请文件各处提到的音组合,其包含的各音是有确定的实际的音高值的,但如把音组合定义成音素,那音素里可能有包含这音组合里全部或部分音各自的音高值,也可能没有包含这音组合里任一音的音高值。一个音组合里可以是包含有一个或多个音。The sound combination mentioned in this application document has a definite actual pitch value for each sound contained in it, but if the sound combination is defined as a phoneme, the phoneme may contain all or part of the sound combination. The respective pitch values may not include the pitch value of any tone in this tone combination. A sound combination can contain one or more sounds.

在编码规则和解码规则里一般要对用于拼合的音素赋予音素编号,在用编码规则来编码生成拼合信息的值时,一般是用音素编号来表示音素的,一般是用唯一的音素编号来表示一个音素的。甚至在编码规则里可对同一音素赋予不同的音素编号,在编码生成拼合信息的值时,根据不同应用需要或不同应用场合,可用不同的音素编号来表示同一音素,除非有特别说明,本说明书各处是以编码规则里对各音素都是赋予唯一的音素编号为例的(当然不同音素的音素编号不同),同样,在解码规则里可对同一音素赋予不同的音素编号,在对拼合信息的值解码时,在不同应用需要时或不同应用场合,可能在拼合信息的值中遇到用不同的音素编号来表示同一音素的情况,可相应的处理解码,除非有特别说明,本说明书各处是以解码规则里对各音素都是赋予唯一的音素编号为例的(当然不同音素的音素编号不同)。In the encoding rules and decoding rules, the phonemes used for splicing are generally given phoneme numbers. When encoding rules are used to encode the value of the splicing information, the phoneme numbers are generally used to represent the phonemes, and generally unique phoneme numbers are used to represent represents a phoneme. Even in the encoding rules, different phoneme numbers can be assigned to the same phoneme. When encoding the value of the combined information, according to different application needs or different application occasions, different phoneme numbers can be used to represent the same phoneme. Unless otherwise specified, this manual Everywhere is an example of assigning unique phoneme numbers to each phoneme in the encoding rules (of course, different phoneme numbers are different). Similarly, different phoneme numbers can be assigned to the same phoneme in the decoding rules. When decoding the value of , when required by different applications or in different application occasions, it may be encountered that different phoneme numbers are used to represent the same phoneme in the value of the combined information, and the decoding can be handled accordingly. Unless otherwise specified, each part of this manual The place is an example where a unique phoneme number is assigned to each phoneme in the decoding rule (of course, different phonemes have different phoneme numbers).

从本说明书第20页以‘乐段拼合信息是指一种信息,’开头的那段到上一段所提到的音素,是泛指编码规则里定义的音素,解码规则里定义的音素,从拼合信息的值里可直接得到的音素,放在拼合信息的值里相当于是音素的那些信息集合。The phonemes mentioned in the paragraph starting with "Music segment information refers to a kind of information" on page 20 of this manual to the previous paragraph generally refer to the phonemes defined in the encoding rules and the phonemes defined in the decoding rules, from The phonemes that can be directly obtained in the value of the combined information are equivalent to those information sets of the phonemes in the value of the combined information.

一个音素里的发音时刻值,音长值一般都是用拍来计量,并且如一个音素里既有发音时刻值也有音长值,且计量发音时刻值和计量音长值都是用拍来计量,那计量发音时刻值的拍的标准与计量音长值的拍的标准一般相同,一个类音素里的发音时刻值,音长值一般都是用拍来计量,并且如一个类音素里既有发音时刻值也有音长值,且计量发音时刻值和计量音长值都是用拍来计量,那计量发音时刻值的拍的标准与计量音长值的拍的标准一般相同。(拍的标准是指把几分音符当成一拍,比如把4分音符当成一拍)。The pronunciation time value and sound length value in a phoneme are generally measured in beats, and for example, a phoneme has both pronunciation time value and sound length value, and both the pronunciation time value and the sound length value are measured in beats , the beat standard for measuring the pronunciation time value is generally the same as the beat standard for measuring the sound length value. The pronunciation time value and the sound length value in a phoneme are generally measured by beats, and if a phoneme has both The pronunciation time value also has a sound length value, and both the measurement pronunciation time value and the measurement sound length value are measured by beats, and the beat standard for measuring the pronunciation time value is generally the same as the beat standard for measuring the sound length value. (The standard of beat refers to taking a fraction of a note as a beat, such as taking a quarter note as a beat).

但在不同地方,拍的标准可能不同,并且一个标准的拍可转换成另一个标准的拍,一般可把拍的定义分成这几类:抽象拍,实际拍,中间拍,抽象拍是指在编码规则和解码规则里定义的音素里以及拼合信息的值里相当于定义的音素里,涉及定义的拍,实际拍是指被编码乐段实际用的拍(一般被编码乐段只用一个实际拍的标准),中间拍是在解码过程中作为过渡的拍,一般可把抽象拍的标准转成中间拍的标准,作些处理,再把中间拍的标准转成实际拍的标准。But in different places, the shooting standards may be different, and one standard shooting can be converted into another standard shooting. Generally, the definition of shooting can be divided into these categories: abstract shooting, actual shooting, intermediate shooting, abstract shooting refers to The phonemes defined in the encoding rules and decoding rules and the value of the combined information are equivalent to the defined phonemes, involving the defined beats, and the actual beat refers to the actual beat used by the encoded segment (generally, only one actual beat is used for the encoded segment) The standard of shooting), the intermediate shooting is used as a transitional shooting in the decoding process. Generally, the standard of the abstract shooting can be converted into the standard of the intermediate shooting, and after some processing, the standard of the intermediate shooting can be converted into the standard of the actual shooting.

解码时,可能涉及到拍的标准的转换,而如音素里的抽象拍的标准与乐段的实际拍的标准都相同(比如都把4分音符当成一拍),那一般不需转换,而如音素里的抽象拍的标准与乐段的实际拍的标准不相同(比如一个把4分音符当成一拍,一个把2分音符当成一拍),那一般需把抽象拍的标准转换成实际拍的标准(也可考虑把乐段的实际拍的标准转成抽象拍的标准,比如拼合乐段发音的各音素都用这个抽象拍的标准,那可把此乐段拼合信息的值里比如编码头里放的节奏信息里的实际拍的标准改了,比如要把原来按实际拍的标准计量1分钟多少拍改成现在按抽象拍的标准计量一分钟多少拍),并且从一个拼合信息的值里直接得到的不同音素里的抽象拍的标准也可能不同,一般都需将这些抽象拍的标准分别转换成实际拍的标准,必要时还可能把抽象拍的标准转成中间拍的标准,作些处理,再把中间拍的标准转成实际拍的标准,涉及拍的标准的转换的情况还可能有其它情况。When decoding, it may involve the conversion of the beat standard, and if the standard of the abstract beat in the phoneme is the same as the actual beat standard of the music section (for example, the quarter note is regarded as a beat), then generally there is no need to convert, but If the standard of the abstract beat in the phoneme is different from the standard of the actual beat of the passage (for example, one regards a 4th note as a beat, and the other regards a 2nd note as a beat), then it is generally necessary to convert the standard of the abstract beat into an actual beat. The standard of beat (you can also consider converting the actual beat standard of the passage into the standard of abstract beat. For example, each phoneme that is pronounced in the merged passage uses this abstract beat standard, then you can put the value of the merged information of this passage into the value such as The actual shooting standard in the rhythm information in the encoding head has been changed, for example, the original measurement of the number of shots per minute according to the standard of actual shooting should be changed to the current measurement of the number of shots per minute according to the standard of abstract shooting), and from a combined information The standard of the abstract beats in different phonemes directly obtained from the value of the value may also be different. Generally, it is necessary to convert these abstract beat standards into actual beat standards, and if necessary, convert the abstract beat standards into intermediate beat standards. , do some processing, and then convert the intermediate shooting standard into the actual shooting standard. There may be other situations involving the conversion of the shooting standard.

不过在解码过程中涉及到不同标准的拍时,也可不作拍的标准的转换,只要解码时能识别及处理即可。如一个未定位或定位的音素里涉及的拍的标准被转成别的拍的标准,那此未定位或定位的音素里的发音时刻值(如有发音时刻值的话)和音长值(如有音长值的话)显然一般要跟着转换,如一个类音素里涉及的拍的标准被转成别的拍的标准,那这个类音素里的发音时刻值(如有发音时刻值的话)和音长值(如有音长值的话)显然一般要跟着转换。涉及转换的情况还可能有其它情况。However, when different standards of beats are involved in the decoding process, the standard conversion of the beats may not be performed, as long as they can be identified and processed during decoding. If the beat standard involved in an unlocated or positioned phoneme is converted to another beat standard, then the pronunciation time value (if there is a pronunciation time value) and the sound length value (if any) in the unlocated or positioned phoneme If the sound length value) is obviously generally followed by conversion, such as the beat standard involved in a phoneme is converted to another beat standard, then the pronunciation time value (if there is a pronunciation time value) and the sound length value in this phoneme (If there is a sound length value), it is obvious that it generally needs to be converted. There may be other cases involving transitions.

本说明书将解码过程中不用转换拍的标准,而且解码过程中涉及的抽象拍与实际拍都是相同的这样的情况称为转换情况1(本说明书后面解码举例1和解码举例2都是这样,被编码乐段所用的实际拍是以4分音符为一拍,所用的音素里的抽象拍也是以4分音符为一拍),本说明书将解码过程中不用转换拍的标准,而且解码过程中涉及的抽象拍与实际拍有不相同的这样的情况称为转换情况4。In this manual, there is no need to convert the beat standard in the decoding process, and the abstract beat and the actual beat involved in the decoding process are the same as the conversion case 1 (this is the case for decoding example 1 and decoding example 2 later in this manual, The actual beat used by the coded passage is a quarter note, and the abstract beat in the phonemes used is also a quarter note), this manual will not use the conversion standard during the decoding process, and Cases involving abstract beats that differ from real beats are referred to as transition cases 4 .

本说明书各处说到的音长值,一般可用拍来计量,比如一个音的音长可以是1拍,2分之1拍,4分之3拍,8分之1拍等。本说明书后面编码举例,解码举例里涉及的音长值,是默认用拍来计量的,比如一个音的音长可以是1拍,2拍,8分之1拍等。The sound length values mentioned in this manual can generally be measured in beats. For example, the sound length of a sound can be 1 beat, 1/2 beat, 3/4 beat, 1/8 beat, etc. The coding examples at the end of this manual, and the sound length values involved in the decoding examples are measured in beats by default. For example, the sound length of a sound can be 1 beat, 2 beats, 1/8th of a beat, etc.

在编码规则和解码规则里定义的音素以及在拼合信息的值里相当于是音素的那些信息集合里涉及的音长,一般是按约定定义的拍来计量,比如在编码规则里有定义一个拍的标准(比如把4分音符定义成一拍),此编码规则里定义的各音素里的音长值都用这个拍(也即抽象拍)来计量,与此编码规则专门对应的解码规则里定义的拍的标准与编码规则里定义的拍的标准相同,此解码规则里定义的各音素里的音长值都用这个拍(也即抽象拍)来计量,按此解码规则来对用此编码规则编码生成的拼合信息的值进行解码时,再根据被编码乐段实际用的实际拍的定义(指把几分音符当成一拍),可把拼合信息的值里的音素里的按抽象拍来计量的音长值以及解码得到的类音素里的按抽象拍来计量的音长值(类音素一般是对音素进行处理得到的,所以音素里的按抽象拍来计量的音长值可能会被照搬放到类音素里),换算成按实际拍来计量的音长值,比如抽象拍是把4分音符当成一拍,实际拍是把2分音符当成一拍,如按抽象拍来计量的音长值是1(1拍),那按实际拍来计量这个音长值就是0.5(0.5拍),当然实际中怎么来定义使用音长值及怎么转换音长值可有很多形式,可根据需要选择用什么形式。本说明书将本段所说情况称为转换情况2。The phonemes defined in the coding rules and decoding rules and the sound lengths involved in the information sets that are equivalent to phonemes in the value of the combined information are generally measured by the beat defined by agreement, for example, there is a beat defined in the coding rules standard (such as defining a quarter note as a beat), the sound length value of each phoneme defined in this encoding rule is measured by this beat (that is, an abstract beat), and defined in the decoding rule specifically corresponding to this encoding rule The beat standard is the same as the beat standard defined in the encoding rule. The sound length value in each phoneme defined in the decoding rule is measured by this beat (that is, the abstract beat), and the encoding rule is used according to this decoding rule When the value of the mosaic information generated by encoding is decoded, according to the definition of the actual beat actually used by the coded section (referring to treat a fraction of a note as a beat), the phoneme in the value of the mosaic information can be represented as an abstract beat. The measured sound length value and the sound length value measured by the abstract beat in the decoded phoneme (the phoneme is generally obtained by processing the phoneme, so the sound length value measured by the abstract beat in the phoneme may be Copy it into the quasi-phoneme), and convert it into the sound length value measured by the actual beat. For example, the abstract beat is to regard the 4th note as a beat, and the actual beat is to regard the 2nd note as a beat, such as the abstract beat. The sound length value is 1 (1 beat), so the sound length value measured according to the actual beat is 0.5 (0.5 beats). Of course, there are many forms of how to define and use the sound length value and how to convert the sound length value in practice. Need to choose what form to use. In this specification, the situation described in this paragraph is referred to as conversion situation 2.

另也可在编码规则里定义多个不同的拍的标准,比如有些定义的音素里的音长值按某一个拍的标准(比如把4分音符当成一拍),另一些定义的音素里的音长值按另一个拍的标准(比如把8分音符当成一拍),以上两句提到的拍即为抽象拍,与此编码规则专门对应解码规则里也这么定义拍,按此解码规则来对用此编码规则编码生成的拼合信息的值进行解码时,再根据被编码乐段实际用的实际拍的定义(指把几分音符当成一拍),可把拼合信息的值里的音素里的按抽象拍来计量的音长值以及解码得到的类音素里的按抽象拍来计量的音长值(类音素一般是对音素进行处理得到的,所以音素里的按抽象拍来计量的音长值可能会被照搬放到类音素里),换算成按实际拍来计量的音长值,对于不同的音素换算的情况可能不同(因为不同音素里的音长值用的拍的标准可能不同),对于不同的类音素换算的情况可能不同(因为不同音素里的音长值用的拍的标准可能不同而可能造成不同类音素里的音长值用的拍的标准也不同),比如抽象拍是把4分音符当成一拍,实际拍是把2分音符当成一拍,如按抽象拍来计量的音长值是1(1拍),那按实际拍来计量这个音长值就是0.5(0.5拍),当然实际上怎么来定义使用音长值及怎么转换音长值可有很多形式,可根据需要选择用什么形式。本说明书将本段所说情况称为转换情况3。In addition, multiple different beat standards can be defined in the coding rules. For example, the sound length value in some defined phonemes is based on a certain beat standard (for example, a quarter note is regarded as a beat), and in other defined phonemes The sound length value is based on the standard of another beat (for example, the 8th note is regarded as a beat). The beat mentioned in the above two sentences is an abstract beat. This encoding rule is specifically corresponding to the definition of the beat in the decoding rule. According to this decoding rule When decoding the value of the mosaic information generated by encoding with this encoding rule, and then according to the definition of the actual beat actually used by the coded section (referring to treat a fraction of a note as a beat), the phoneme in the value of the mosaic information can be The sound length value measured by the abstract beat in the phoneme and the sound length value measured by the abstract beat in the decoded phoneme (the phoneme is generally obtained by processing the phoneme, so the sound length value in the phoneme is measured by the abstract beat The sound length value may be copied into the class phoneme), converted into the sound length value measured by the actual beat, and the conversion situation may be different for different phonemes (because the beat standard used for the sound length value in different phonemes may be different), for different types of phonemes, the conversion situation may be different (because the beat standards used by the sound length values in different phonemes may be different, which may cause different beat standards for the sound length values in different phonemes), for example The abstract beat is to regard the 4th note as a beat, and the actual beat is to regard the 2nd note as a beat. If the sound length value measured according to the abstract beat is 1 (1 beat), then the sound length value measured according to the actual beat is 0.5 (0.5 beats), of course, there are many forms of how to define and use the sound length value and how to convert the sound length value, and you can choose which form to use according to your needs. In this specification, the situation described in this paragraph is referred to as conversion situation 3.

除以上转换情况1,转换情况2,转换情况3,转换情况4外,还可能有别的转换情况。In addition to the above conversion situation 1, conversion situation 2, conversion situation 3, and conversion situation 4, there may be other conversion situations.

关于音素里多个音各自的发音时刻值,一般是按约定来定义发音时刻值,比如按抽象拍来计量发音时刻值(而1抽象拍可以是4分音符,2分音符,8分音符等),音素里第1时刻(即最先)发音的音(也可能是静音),其发音时刻值为0,然后比如定义如后面一个音发的音在第1时刻后面2拍(抽象拍),那后面这个音的发音时刻值为2,并且可以编码规则和解码规则涉及定义的每个音素里的音都按这样来确定其发音时刻值。在解码过程中如未定位或定位的音素里涉及的拍的标准被转换了,那此未定位或定位的音素里的各发音时刻值也可能要相应转换(比如拍的标准从抽象拍的4分音符为一拍转变成2分音符为一拍,那原来为1的发音时刻值就要转变成为0.5),象前面述的转换情况3,转换情况4,或其它转换情况都是这样。如一个音素里包含多个音的信息,这多个音中某两个音各自的发音时刻值不同,那这两个音可被称为在不同时间点激发的发音。Regarding the respective pronunciation time values of multiple sounds in a phoneme, the pronunciation time values are generally defined according to the agreement, such as measuring the pronunciation time value by an abstract beat (and an abstract beat can be a 4th note, a 2nd note, an 8th note, etc. ), the sound (maybe silent) pronounced at the first moment (that is, the first) in the phoneme, its pronunciation time value is 0, and then, for example, the sound that is defined as the next sound is 2 beats behind the first moment (abstract beat) , then the pronunciation moment value of the latter sound is 2, and the encoding rule and the decoding rule relate to the sound in each phoneme defined in this way to determine its pronunciation moment value. In the decoding process, if the standard of the beat involved in the unlocated or positioned phoneme is converted, the value of each pronunciation moment in the unlocated or positioned phoneme may also be converted accordingly (for example, the standard of the beat is changed from the 4 of the abstract beat. It is a beat that one beat of fen note is changed into 2 fen note, and then the pronunciation moment value that is 1 will change into 0.5) originally, like the above-mentioned conversion situation 3, conversion situation 4, or other conversion situations are all like this. If a phoneme contains the information of multiple tones, and the pronunciation time values of two tones among the multiple tones are different, then these two tones can be called pronunciations stimulated at different time points.

音素里的定义的音的发音时刻值(此音素里含多个音的信息),与解码时涉及的乐段发音时刻值(比如后面解码举例里解码得到的音的乐段发音时刻值)不是同一回事,音素里的定义的音的发音时刻值一般只用拍(抽象拍)来计量,而且一般是以音素里最先发音的音的时刻作为时刻起点,比如最先发音的音的发音时刻值可定为0,然后以后的音每延后1拍(这里延后是从最先发音的音的发音时刻延后),此音的发音时刻值就加1,每延后1/2拍(这里延后是从最先发音的音的发音时刻延后),此音的发音时刻值就加0.5,而解码时涉及的乐段发音时刻值,可针对拍(一般是实际拍)来计量或是针对实际时间来计量,而且一般是以乐段最先发音的音的时刻作为时刻起点,并需计算各发音时刻到时刻起点之间的拍数或时间,象前面述的转换情况1,转换情况2,转换情况3,转换情况4,或其它转换情况都是这样,但是解码时要得到乐段里的音的乐段发音时刻值,一般是要用到音素里的定义的音的发音时刻值等来得到。对于解码得到的任一个类音素(或被定位的音素),如其涉及的乐段发音时刻值不是按实际拍而是按某中间拍来计量,那发音时一般也要转成按实际拍计量,相应的这个类音素(或被定位的音素)里涉及的乐段发音时刻值要跟着转换,相应的这个类音素(或被定位的音素)里的音长值等一般也要跟着转换。The pronunciation time value of the defined sound in the phoneme (this phoneme contains information about multiple tones) is not the same as the pronunciation time value of the segment involved in decoding (such as the pronunciation time value of the segment obtained by decoding in the following decoding example) The same thing, the pronunciation time value of the defined sound in the phoneme is generally only measured by beat (abstract beat), and generally the time starting point is the first pronounced sound in the phoneme, such as the pronunciation of the first pronounced sound The time value can be set to 0, and then every subsequent sound is delayed by 1 beat (the delay here is delayed from the pronunciation time of the first pronounced sound), the pronunciation time value of this sound is just increased by 1, and every delay is 1/2 beat (delay here is delayed from the pronunciation time of the first sound), the pronunciation time value of this sound is just added by 0.5, and the music section pronunciation time value involved in decoding can be adjusted for the beat (generally the actual beat) Measurement or measurement for the actual time, and generally take the moment of the first sound of the section as the starting point of the time, and it is necessary to calculate the number of beats or time between each pronunciation time and the starting point of the time, as in the aforementioned conversion situation 1 , conversion situation 2, conversion situation 3, conversion situation 4, or other conversion situations are all like this, but when decoding, to get the pronunciation time value of the music section of the tone in the section, generally the defined sound in the phoneme is used The pronunciation time value is obtained by waiting. For any type of phoneme (or positioned phoneme) obtained by decoding, if the pronunciation time value of the music segment involved is not measured by the actual beat but by a certain intermediate beat, then the pronunciation is generally converted to the actual beat. Correspondingly, the pronunciation time value of the passage involved in this type of phoneme (or positioned phoneme) will be converted accordingly, and correspondingly the sound length value in this type of phoneme (or positioned phoneme) will generally also be converted accordingly.

而解码得到的类音素里的音的发音时刻值,一般只用拍(拍的标准可为各种标准,比如是抽象拍,中间拍,实际拍等)来计量,其可以是根据解码时的需要来约定其形式,比如类音素里的音的发音时刻值,可以以类音素里最先发音的音的时刻作为时刻起点(这种情况类音素里含多个音的信息),比如最先发音的音的发音时刻值可定为0,然后比如以后的音每延后1拍(这里延后是从最先发音的音的发音时刻延后),此音的发音时刻值就加1,每延后1/2拍(这里延后是从最先发音的音的发音时刻延后),此音的发音时刻值就加0.5,如类音素里涉及的拍的标准被转换了,那仍是用这样的方式来表示类音素里的发音时刻值,只是各发音时刻值的数值一般要跟着转换(比如拍的标准从4分音符为一拍转变成2分音符为一拍,那原来为1的发音时刻值就要转变成为0.5),这种情况解码得到的这个类音素一般还要用于后面步骤的拼合;也可以以乐段最先发音的音的时刻作为时刻起点,这种情况类音素里的各音的发音时刻值(也即乐段发音时刻值),就是类音素里的各音发音的时刻与乐段最先发音的音的时刻之间的持续时间的值(这种情况类音素里只含一个音的信息或含多个音的信息),一般用实际拍来计量,这种情况解码时一般要综合前面解码得到的信息来确定从这个类音素里的各音发音的时刻到乐段最先发音的音的时刻之间的拍数,这种情况适合前面述的转换情况1,转换情况2,转换情况3,转换情况4,或其它转换情况,这种情况一般是直接把解码得到的这个类音素拿去发音;也可以以乐段中任一音发音的时刻作为时刻起点,类音素里的各音的发音时刻值,就是类音素里的各音发音的时刻与这个时刻起点之间的持续时间的值(这种情况类音素里只含一个音的信息或含多个音的信息),一般用拍(拍的标准可为各种标准,比如是抽象拍,中间拍,实际拍等)来计量,这种情况解码时一般要综合前面解码得到的信息来确定从这个类音素里的各音发音的时刻到这个时刻起点之间的拍数,这种情况适合前面述的转换情况1,转换情况2,转换情况3,转换情况4,或其它转换情况,这种情况解码得到的这个类音素里的各音的发音时刻值还可根据情况转成乐段发音时刻值。And the pronunciation moment value of the sound in the class phoneme that decoding obtains, generally only use beat (the standard that beats can be various standards, such as abstract beat, middle beat, actual beat etc.) to measure, and it can be according to when decoding It needs to agree on its form, such as the pronunciation time value of the sound in the phoneme, which can be the time starting point of the first sound pronounced in the phoneme (in this case, the phoneme contains information of multiple sounds), such as the first The pronunciation time value of the sound that is pronounced can be set as 0, and then for example, the following sound is delayed by 1 beat (the delay here is delayed from the pronunciation time of the first pronounced sound), and the pronunciation time value of this sound is just increased by 1. Every delay of 1/2 beat (the delay here is to delay from the pronunciation time of the first pronounced sound), the pronunciation time value of this sound is just added by 0.5, as the standard of the beat involved in the class phoneme is converted, then still This method is used to represent the pronunciation time value in the class phoneme, but the value of each pronunciation time value generally needs to be converted accordingly (for example, the standard of beat is changed from a 4th note to a 2nd note, which is originally The pronunciation time value of 1 will be changed to 0.5), in this case, the phoneme obtained by decoding is generally used for the combination of the following steps; it can also be the time starting point of the sound that is first pronounced in the music section, in this case The pronunciation time value of each sound in the class phoneme (that is, the music section pronunciation time value), is exactly the value of the duration between the time of each sound pronunciation in the class phoneme and the first sound of the music section (this The phoneme of the situation class contains information of only one sound or information of multiple sounds), which is usually measured by actual beats. In this case, the information obtained from the previous decoding is generally combined to determine the pronunciation of each sound in this class of phonemes. The number of beats between the moment of the music segment and the moment of the sound that is first pronounced in the section, this situation is suitable for the above-mentioned conversion situation 1, conversion situation 2, conversion situation 3, conversion situation 4, or other conversion situations, this situation is generally It is to directly pronounce the phoneme obtained by decoding; it can also use the moment when any sound in the paragraph is pronounced as the starting point of the time, and the pronunciation time value of each sound in the phoneme is the time when each sound in the phoneme is pronounced The value of the duration between the starting point of this moment (in this case, the information of only one tone or the information of multiple tones in the phoneme), usually in beat (the standard of beat can be various standards, such as abstract beat , middle beat, actual beat, etc.) to measure, in this case, it is generally necessary to synthesize the information obtained from the previous decoding to determine the number of beats from the moment when each sound in this type of phoneme is pronounced to the starting point of this moment, in this case It is suitable for the above-mentioned conversion situation 1, conversion situation 2, conversion situation 3, conversion situation 4, or other conversion situations. In this case, the pronunciation time value of each sound in this type of phoneme obtained by decoding can also be converted into a paragraph according to the situation. Pronunciation moment value.

而对于一个被定位的音素,也可处理成其含有的音的发音时刻值也可类似于以上所述类音素的情况那样,可以以被定位的音素里最先发音的音的时刻作为时刻起点,也可以以乐段最先发音的音的时刻作为时刻起点,也可以以乐段中任一音发音的时刻作为时刻起点,来计量被定位的音素里的音的发音时刻值。And for a positioned phoneme, it can also be processed into the pronunciation time value of the sound it contains and also can be similar to the situation of the above-mentioned class of phonemes, and the moment of the sound that is first pronounced in the positioned phoneme can be used as the time starting point , it is also possible to use the moment of the sound that is first pronounced in the music section as the starting point of the time, or the time when any sound is pronounced in the music section to be used as the starting point of the time to measure the pronunciation time value of the sound in the positioned phoneme.

以上这些段有关拍及音长值及发音时刻值的叙述中涉及的实际拍,都是以被编码乐段中只使用一个拍的标准为例的(被编码乐段中不改变拍的标准),本说明书后面述的编码规则举例和解码规则举例和编码举例和解码举例里也是这样都是以被编码乐段中只使用一个拍的标准为例的。如被编码乐段中有改变拍的标准的情况,仍可象以上这些段所述(包括可为了处理方便来进行各个拍的标准的转换),只是在涉及拍的标准的转换时,可能会出现有些拍的标准要转成某一实际拍的标准,另有些拍的标准要转成另一实际拍的标准这样的事情,但各个转换时涉及的音长值的改变,发音时刻值的改变的情况都各自如以上所述乐段中只有一个拍的标准时涉及的拍的标准转换时涉及的音长值的改变,发音时刻值的改变的情况,计算从作为时刻起点的乐段最先发音的音的时刻(或从作为时刻起点的乐段中任一音发音的时刻)到类音素(或未定位的音素或定位的音素)里的音的发音时刻值之间的拍数的值,也可参照以上所述,只是要知道被编码乐段中不同地方有使用不同的拍的标准。The actual beats involved in the narrations of beats, sound length values and pronunciation time values in the above paragraphs are all based on the standard that only one beat is used in the coded paragraph (the standard of the beat is not changed in the coded paragraph) This is also the case in the encoding rule examples and decoding rule examples and encoding examples and decoding examples described later in this manual, all of which use the standard of using only one beat in the encoded music section as an example. If there is a situation of changing the standard of the beat in the coded musical section, it can still be described in the above paragraphs (including the standard conversion of each beat can be carried out for the convenience of processing), but when the standard conversion of the beat is involved, it may be possible Some beat standards need to be converted into a certain actual beat standard, and some other beat standards need to be converted into another actual beat standard, but the change of the sound length value and the pronunciation time value involved in each conversion The situation is all respectively as above-mentioned paragraph when there is only the change of the sound length value involved in the standard conversion of the beat when there is only one beat standard, and the situation of the change of the pronunciation time value, the calculation starts from the beginning of the music paragraph as the time point The value of the number of beats between the moment of the sound (or from the moment when any sound is pronounced in the paragraph as the starting point of the moment) to the pronunciation moment value of the sound in the class phoneme (or unlocated phoneme or positioned phoneme), You can also refer to the above, just know that different places in the coded section have different beat standards.

至于前面说到的解码时一般要综合前面解码得到的信息来确定从这个类音素里的各音发音的时刻到乐段最先发音的音的时刻之间的拍数(本段以下称这个类音素为当前类音素),其处理方式一般是:对一个声道,看从乐段开始,到当前类音素为止,中间有哪些音素和类音素发音(本句和以上3句所述适合于解码用整步发音方式和分步发音方式,分步发音方式时要看从乐段开始到这里处理当前类音素这一步之间各步此声道有些什么音素及类音素发音),这些音素和类音素也就是此声道当前类音素前面发音的所有音素和类音素,按发音顺序看这些音素和类音素(不包括当前类音素),对其中任一音素(定位或未定位的),看其最后发音的音(如最后同时有多个音发音,那取这多个音中音长最长的音),得此音的音长值,将此音的发音时刻值作为一结果(本句括号外所说是在此音的发音时刻值是以此音素的第1个发音时刻为时刻起点的前提下说的;如果此音的发音时刻值不是以此音素的第1个发音时刻为时刻起点的,那要用此音的发音时刻值减去此音素的第1个发音时刻的发音时刻值来作为这里说的结果),将此结果和此音的音长值加起来就是这个音素中的所有音发音持续的时间(一般按拍来计量),以上几句是按音素里含有多个音的信息来说的(比如前面说的2型,3型音素,这里括号内说的音素是指编解码规则定义的音素),对于音素里只含有1个音的信息的情况(比如前面说的1型音素,这里括号内说的音素是指编解码规则定义的音素),音素里的音长值就是音素的所有音发音持续的时间(一般按拍来计量),对于其中定位或未定位的类音素也可象以上这样计算其含有的所有音发音持续的时间(一般按拍来计量),这样就把此声道当前类音素前面发音的所有音素和类音素各自的发音持续时间加起来,就是此声道当前类音素的前面发音占有的时间(一般按拍来计量),从而也得到当前类音素最先发音的乐段发音时刻值,并可根据当前类音素里各音之间的发音时刻值的差值进而得到当前类音素里的各音发音的时刻到乐段最先发音的音的时刻之间的时间(一般用拍来计量;这里当前类音素是含多个音的信息的),也得到当前类音素里各音的乐段发音时刻值。以上各计算时一般还要注意各数据涉及的计量单位(比如拍的标准)是否一致,如不一致那一般需换算成一致。本段以上计算类音素里的各音的乐段发音时刻值的原理,也适合于计算被定位的音素里的各音的乐段发音时刻值,即解码时综合前面解码得到的信息来确定从当前音素里的各音发音的时刻到乐段最先发音的音的时刻之间的拍数,也可按本段以上所述处理,而且也是适合于解码用整步发音方式和分步发音方式的。本说明书后面各解码举例(解码举例1和解码举例2)里计算音的乐段发音时刻值都可按本段所述这样计算。(本段各处所说的拍的标准可为各种标准,比如是抽象拍,中间拍,实际拍等;除非提及是有特别注明,本段各处提及的音素默认是未定位或定位的音素)As for the above-mentioned decoding, it is generally necessary to synthesize the information obtained from the previous decoding to determine the number of beats from the moment when each sound in this class of phonemes is pronounced to the moment when the first sound of the music section is pronounced (this paragraph is hereinafter referred to as this class) The phoneme is the current class phoneme), and its processing method is generally: for a sound track, see from the beginning of the musical section to the current class phoneme, which phonemes and phoneme-like pronunciations are in the middle (this sentence and the above 3 sentences are suitable for decoding Use the whole-step pronunciation method and the step-by-step pronunciation method. During the step-by-step pronunciation method, it depends on what phonemes and phoneme-like pronunciations there are in this channel in each step from the beginning of the paragraph to the step of processing the current class of phonemes here), these phonemes and class Phonemes are all the phonemes and phonemes pronounced in front of the current phoneme in this channel. Look at these phonemes and phonemes (excluding the current phoneme) in the order of pronunciation. For any phoneme (positioned or unlocated), look at its The sound of last pronunciation (as finally having multiple sound pronunciations at the same time, then get the longest sound in these multiple sounds), get the sound length value of this sound, and use the pronunciation moment value of this sound as a result (this sentence What is said outside the brackets is the premise that the pronunciation time value of this sound is the first pronunciation time of this phoneme as the starting point of the time; if the pronunciation time value of this sound is not the first pronunciation time of this phoneme as the time starting point, then use the pronunciation time value of this sound minus the pronunciation time value of the first pronunciation time of this phoneme as the result mentioned here), add this result and the sound length value of this sound to get the phoneme The duration of the pronunciation of all the sounds (usually measured by beat), the above sentences are based on the information that contains multiple sounds in the phoneme (such as the type 2 and type 3 phonemes mentioned above, the phonemes in brackets here are Refers to the phoneme defined by the codec rule), for the case where the phoneme contains only one sound information (such as the type 1 phoneme mentioned above, the phoneme mentioned in the brackets here refers to the phoneme defined by the codec rule), the phoneme in the phoneme The long value is the duration of all sounds of the phoneme (generally measured by beats), and the duration of all sounds contained in it can also be calculated as above for the positioned or unlocated phonemes (generally measured by beats) , in this way, add up all the phonemes pronounced in front of the current phoneme in this channel and the respective pronunciation durations of the phoneme, which is the time occupied by the pronunciation in front of the current phoneme in this channel (usually measured by beat), thus also get The pronunciation time value of the first utterance of the current class phoneme, and the difference between the pronunciation time values between the sounds in the current class phoneme can be obtained to the first pronunciation time of each sound in the current class phoneme The time between the moments of the sound (generally measured by beat; the current class phoneme contains information of multiple sounds), also obtains the pronunciation moment value of each tone in the current class phoneme. In the above calculations, it is generally necessary to pay attention to whether the measurement units involved in each data (such as the shooting standard) are consistent. If they are not consistent, they generally need to be converted into consistency. The above principle of calculating the pronunciation moment value of each tone in the class phoneme is also suitable for calculating the pronunciation moment value of each tone in the positioned phoneme, that is, when decoding, the information obtained by comprehensively decoding in front is determined from The number of beats between the moment when each sound in the current phoneme is pronounced and the moment when the sound is first pronounced in the music section can also be processed as described above in this paragraph, and it is also suitable for decoding with a full-step pronunciation method and a step-by-step pronunciation method of. In each decoding example (decoding example 1 and decoding example 2) in the following description of this manual, the musical section pronunciation time value of the calculated sound can be calculated as described in this paragraph. (The standard of the beats mentioned in this paragraph can be various standards, such as abstract beats, intermediate beats, actual beats, etc.; unless otherwise specified, the default phonemes mentioned in this paragraph are unlocated or positioned phonemes)

对于前面说到的解码时一般要综合前面解码得到的信息来确定从这个类音素里的各音发音的时刻到乐段任一音发音的时刻之间的拍数(本段以下称这个类音素为当前类音素),也是与上段所述类似,一般是找到任一音所处的音素或类音素里最后时刻发音的所有音中音长最长的音,将此音的发音时刻值减去任一音的发音时刻值,再加上最后这个音的音长值得一结果(以上几句是按此音不是其所处的音素或类音素里最后发音来说的,如果此音是最后发音,那就是把此音所处的音素或类音素里最后时刻发音的所有音中音长最长的音的音长值作为这个结果),把这个结果再加上此声道这个音素或类音素与当前类音素之间的各音素和类音素各自的发音持续时间(本句所述适合于解码用整步发音方式和分步发音方式,分步发音方式时要看从前述乐段任一音发音的时刻到这里处理当前类音素这一步之间各步此声道有些什么音素及类音素发音),就是当前类音素最先发音的发音时刻值,并可根据当前类音素里各音之间的发音时刻值的差值进而得到当前类音素里的各音发音的时刻到前述任一音发音的时刻之间的时间(一般用拍来计量;这里当前类音素是含多个音的信息的),也得到当前类音素里各音的发音时刻值;以上各计算时一般还要注意各数据涉及的计量单位(比如拍的标准)是否一致,如不一致那一般需换算成一致。本段以上计算类音素里的各音的发音时刻值的原理,也适合于计算被定位的音素里的各音的发音时刻值(以乐段中某一音发音的时刻为时刻起点),即解码时综合前面解码得到的信息来确定从当前音素里的各音发音的时刻到乐段任一音发音的时刻之间的拍数,也可按本段以上所述处理,而且也是适合于解码用整步发音方式和分步发音方式的。(本段各处所说的拍的标准可为各种标准,比如是抽象拍,中间拍,实际拍等;除非提及是有特别注明,本段各处提及的音素默认是未定位或定位的音素)For the above-mentioned decoding, it is generally necessary to synthesize the information obtained from the previous decoding to determine the number of beats from the moment of pronunciation of each sound in this class of phonemes to the moment of pronunciation of any tone in the music section (this paragraph is hereinafter referred to as this class of phonemes) is the current class phoneme), which is also similar to that described in the previous paragraph. Generally, the sound with the longest sound length among all the sounds pronounced at the last moment in the phoneme or phoneme where any sound is located is subtracted from the pronunciation time value of this sound The pronunciation time value of any sound, plus the sound length of the last sound is worth a result (the above sentences are based on the fact that this sound is not the last pronunciation in the phoneme or phoneme it is in, if this sound is the last pronunciation , that is, the sound length value of the sound with the longest sound length among all the sounds pronounced at the last moment in the phoneme or phoneme where this sound is located is taken as the result), and this result is added to the phoneme or phoneme of this vocal tract Each phoneme and class phoneme between the current class phoneme and the respective pronunciation duration of the class phoneme (this sentence is suitable for decoding with a whole-step pronunciation method and a step-by-step pronunciation method, and the step-by-step pronunciation method depends on any tone from the preceding paragraph Pronunciation moment to here to deal with the current class phoneme this step (what phonemes and phoneme pronunciations are there in this channel) is the pronunciation time value of the first pronunciation of the current class phoneme, and can be based on the current class phoneme between each sound The time difference between the pronunciation time value of each sound in the current class phoneme and the time between the time of the pronunciation of any one of the above-mentioned sounds (generally measured by beat; here the current class phoneme is the information containing multiple sounds ), also obtain the pronunciation time value of each sound in the current class phoneme; generally also pay attention to whether the unit of measurement (such as the standard of clapping) involved in each data is consistent during the above calculations, and generally need to be converted into consistency as inconsistent. The principle of calculating the pronunciation time value of each sound in the class phoneme above this paragraph is also suitable for calculating the pronunciation time value of each sound in the positioned phoneme (taking the time when a certain sound is pronounced in the music section as the time starting point), that is When decoding, the information obtained by the previous decoding is integrated to determine the number of beats from the moment when each sound in the current phoneme is pronounced to the moment when any sound is pronounced in the music section. It can also be processed as described above in this paragraph, and it is also suitable for decoding Use the whole-step pronunciation method and the step-by-step pronunciation method. (The standard of the beats mentioned in this paragraph can be various standards, such as abstract beats, intermediate beats, actual beats, etc.; unless otherwise specified, the default phonemes mentioned in this paragraph are unlocated or positioned phonemes)

以上两段所述都是每次针对一个声道来说的。上两段提到的‘这些音素和类音素’,可能只包括数个音素,或只包括数个类音素,甚至既不包括音素也不包括类音素(累加的发音持续的时间为0,比如当前类音素就是乐段最先发音的),累加发音持续的时间时没有音素就不计算音素的发音持续的时间,没有类音素就不计算类音素的发音持续的时间;同样,上一段提到的‘各音素和类音素’,可能只包括数个音素,或只包括数个类音素,甚至既不包括音素也不包括类音素(累加的发音持续的时间为0,比如当前类音素就是乐段最先发音的),累加发音持续的时间时没有音素就不计算音素的发音持续的时间,没有类音素就不计算类音素的发音持续的时间。(除非提及时有特别注明,本段各处提及的音素默认是未定位或定位的音素)The above two paragraphs are for one channel at a time. The 'these phonemes and phonemes' mentioned in the last two paragraphs may only include a few phonemes, or only a few phonemes, or even neither phonemes nor phonemes (the cumulative pronunciation duration is 0, such as The current class phoneme is exactly the first pronunciation of the music section), when adding up the duration of the pronunciation, the duration of the pronunciation of the phoneme will not be calculated if there is no phoneme, and the duration of the pronunciation of the phoneme will not be calculated if there is no class phoneme; similarly, the previous paragraph mentioned 'Each phoneme and phoneme' may only include several phonemes, or only several phonemes, or even neither phoneme nor phoneme (the cumulative pronunciation duration is 0, such as the current phoneme is music segment first pronounced), when accumulating the duration of the pronunciation, the duration of the pronunciation of the phoneme is not calculated without the phoneme, and the duration of the pronunciation of the phoneme is not calculated without the phoneme. (Unless otherwise specified at the time of mention, phonemes mentioned throughout this paragraph default to unlocated or located phonemes)

在拼合信息的值里也可专门放些BIT以在解码时对从拼合信息的值里可直接得到的音素或解码过程中得到的类音素,作修改拍的标准,确定拍的标准,转换拍的标准等操作,这些操作可看成是拼合相关处理形式。In the value of the combined information, some BITs can also be specially placed to modify the standard of the beat, determine the standard of the beat, and convert the beat to the phonemes that can be directly obtained from the value of the combined information or the phonemes that can be obtained during the decoding process during decoding. These operations can be regarded as a form of flattening related processing.

在乐段中可能存在同一时刻有发多个音,且这多个音的音长值不全相同,其处理其实也没什么特别,如音组合里有这样的情况,那把这音组合定义成音素时,仍可记录音组合里各音的发音时刻值和各音的音长值即可(不管同时发音的不同音的音长值有没有相同关系),对未定位或定位的音素作修改时,也可增加删除这种在某一发音时刻与其它此发音时刻发的音的音长不同的音,解码发音时,按音素或修改结果等里面的各音的发音时刻值及各音的音长值等作相应发音操作即可;本说明书后面述的编码规则举例,解码规则举例都可这样处理以上所述情况。There may be multiple sounds at the same time in the passage, and the length values of these multiple sounds are not all the same. The processing is actually nothing special. If there is such a situation in the sound combination, then define the sound combination as a phoneme When it is time, it is still possible to record the pronunciation time value of each sound in the sound combination and the sound length value of each sound (regardless of whether the sound length values of different sounds pronounced at the same time have the same relationship), when modifying unlocated or positioned phonemes , it is also possible to add and delete the sound whose sound length is different from that of other sounds at a certain pronunciation moment. The long value and the like can be used for corresponding pronunciation operations; the examples of encoding rules and decoding rules described later in this specification can handle the above-mentioned situations in this way.

如提及时没指明有他意,也不能从提及处的上下文推出有他意(比如说到‘对拍的标准作转换’就是有他意,即这里涉及的拍并不是专指抽象拍),本说明书以下各处提及的‘拍’默认是指音素定义涉及的抽象拍。If there is no other intention when mentioning it, and it cannot be deduced from the context of the mention (for example, there is another intention when "converting the standard of the beat" is used, that is, the beat mentioned here does not refer specifically to abstract beats), this manual The 'beat' mentioned in the following places refers to the abstract beat involved in the phoneme definition by default.

本申请文件所说的其它附加信息,是指比如音的音色信息,音强信息,节奏信息(节奏信息包含每实际拍是多长时间等信息),调式信息,谱号信息,拼合涉及的操作信息,等,这些附加信息中的任一附加信息可能放在音素里,也可能处于音素外且放在拼合信息的值里在拼合时要使用到,而如对多个未定位或定位的音素(音素里有其它附加信息)进行后面述的顺序拼合,交织拼合,复制处理等得到类音素,那这多个未定位或定位的音素里原来那些其它附加信息也可继续存在于得到的类音素里比如用其表示类音素里哪些音是什么音色等;拼合时也可对未定位或定位的音素进行处理以对放在未定位或定位的音素里的附加信息进行处理,比如对未定位或定位的音素里的音色信息进行修改,拼合时也可对未定位或定位的类音素进行处理以对放在未定位或定位的类音素里的附加信息进行处理,比如对未定位或定位的类音素里的音色信息进行修改,拼合时可根据未定位或定位的音素里的操作信息(属于其它附加信息)来在拼合时作相应操作(比如控制拼合时每一拼合步骤作什么,或改变调式,节奏,或按操作信息来完成用组拼形式,拼合相关处理形式等作的处理),拼合时可根据未定位或定位的类音素里的操作信息(属于其它附加信息)来在拼合时作相应操作(比如控制拼合时每一拼合步骤作什么,或改变调式,节奏,或按操作信息来完成用组拼形式,拼合相关处理形式等作的处理);拼合时一般要把处于音素外(比如拼合信息的值的编码头里)的被编码乐段的所有声道各自的节奏信息,调式,谱号信息(节奏信息,调式信息,谱号信息是其它附加信息)与用音素进行拼合得到的结果一起组成解码结果,以供播乐段发音。The other additional information mentioned in this application document refers to the timbre information of the sound, the sound intensity information, the rhythm information (the rhythm information includes information such as how long each actual beat is), the mode information, the clef information, and the operations involved in stitching Information, etc., any of these additional information may be placed in the phoneme, or it may be outside the phoneme and placed in the value of the combined information to be used when combining, and for multiple unlocated or positioned phonemes (there are other additional information in the phoneme) to carry out the following sequential splicing, interweaving and splicing, copy processing etc. to obtain the quasi-phoneme, then those other additional information in the multiple unlocated or positioned phonemes can continue to exist in the obtained quasi-phoneme For example, it is used to indicate which sounds in the class phoneme are which sounds, etc.; when assembling, the unlocated or positioned phoneme can also be processed to process the additional information placed in the unlocated or positioned phoneme, such as the unlocated or positioned phoneme. The timbre information in the positioned phoneme can be modified, and the unlocated or positioned phoneme can also be processed to process the additional information placed in the unlocated or positioned phoneme, such as the unlocated or positioned phoneme. The timbre information in the phoneme is modified, and the operation information (belonging to other additional information) in the unlocated or located phoneme can be used to perform corresponding operations during the combination (such as controlling what to do in each combination step during the combination, or changing the mode) , rhythm, or according to the operation information to complete the processing with the form of combination, combination of related processing forms, etc.), the combination can be done according to the operation information (belonging to other additional information) in the unlocated or positioned phoneme. Corresponding operations (such as controlling what to do in each merging step during merging, or changing the mode, rhythm, or completing the processing with group merging forms, merging related processing forms, etc. according to the operation information); For example, the rhythm information, mode, and clef information (rhythm information, mode information, and clef information are other additional information) of all channels of the coded music section in the encoding header of the value of the combined information are combined with phonemes to obtain The results together form the decoding result for the playback of the music segment.

如一音素里含有其它附加信息,那可能是含有比如音的音色信息,音强信息,节奏信息,调式信息,谱号信息,拼合涉及的操作信息,等这些信息中的一种或多种,且如是含有多种的话,那可把各种附加信息分别用于拼合涉及的处理;如一拼合信息的值里放有其它附加信息(不是指音素编号对应的音素里放的),那可能是放有比如音的音色信息,音强信息,节奏信息,调式信息,谱号信息,拼合涉及的操作信息,等这些信息中的一种或多种,且如是放有多种的话,那可把各种附加信息分别用于拼合涉及的处理。If a phoneme contains other additional information, it may contain one or more of such information as timbre information, pitch information, rhythm information, mode information, clef information, operation information involved in stitching, etc., and If it contains multiple words, then various additional information can be used for the processing involved in the combination; if there are other additional information in the value of the combination information (not referring to the phoneme corresponding to the phoneme number), it may be that there is For example, one or more of the timbre information, sound intensity information, rhythm information, mode information, clef information, operation information involved in combination, etc., and if there are multiple kinds of information, then various The additional information is used separately for stitching-involved processing.

音素里的音色信息,一般就是指本音素里的音的音色信息。The timbre information in a phoneme generally refers to the timbre information of the tone in the phoneme.

音素里含有的其它附加信息,一般是用于拼合时本音素所处声道的拼合处理的(甚至就是作用于本音素的),比如音素里的音色信息,一般就是指本音素里的音的音色信息,音素里含有的其它附加信息,也可以是用于拼合时包括本音素所处声道以外的声道在内的一个或多个声道的拼合处理的,比如音素里的操作信息,其既可以只用于拼合时本音素所处声道的拼合处理,也可以还用于拼合时本音素所处声道以外的声道的拼合处理。Other additional information contained in the phoneme is generally used for the splicing process of the sound channel where the phoneme is located (even for the phoneme). For example, the timbre information in the phoneme generally refers to the sound in the phoneme. Timbre information, other additional information contained in the phoneme, can also be used for the stitching process of one or more channels including the channel other than the channel where the phoneme is located, such as the operation information in the phoneme, It can be used only for the combination processing of the channel where the phoneme is located during the combination, or it can also be used for the combination processing of the channels other than the channel where the phoneme is located during the combination.

对于表示如何用多个音素来拼合乐段发音的拼合信息的值,那可以是这多个音素中的每个音素里都无其它附加信息,也可以是这多个音素中的每个音素里都有其它附加信息,也可以是这多个音素中的部分音素中每个音素里都无其它附加信息而这多个音素中的另外部分音素中每个音素里都有其它附加信息,而且对于以上3种情况中的任一情况,都可以是在拼合信息的值里(不是音素里)无其它附加信息或有其它附加信息。For the value of the combination information indicating how to use multiple phonemes to combine the pronunciation of the passage, it can be that there is no other additional information in each of the multiple phonemes, or it can be in each of the multiple phonemes. all have other additional information, or there may be no other additional information in each phoneme in some of the multiple phonemes, but there is other additional information in each phoneme in the other part of the phonemes, and for In any of the above three cases, there may be no other additional information or other additional information in the value of the combined information (not in the phoneme).

如提及时无特别指明有他意,本申请文件各处提及的其前面没‘其它’来修饰的‘附加信息’,默认是指‘其它附加信息’;如提及时无特别指明有他意,本申请文件各处提及的其前面没‘涉及拼合’来修饰的‘操作信息’,默认是指‘涉及拼合的操作信息’。If there is no special indication of other meanings when mentioned, the "additional information" mentioned in this application document without the modification of "other" in front of it refers to "other additional information" by default; if there is no special indication of other meanings when mentioned, this The 'operational information' mentioned in various parts of the application documents, which is not modified by 'involving splicing', refers to 'operational information involving splicing' by default.

一个音素里的操作信息,可以是只涉及本音素的操作(比如是用拼合相关处理形式等作的处理),也可涉及本音素和其它未定位或定位的音素(或其它未定位或定位的类音素)的操作(比如是用组拼形式,或拼合相关处理形式等作的处理),也可以只涉及其它未定位或定位的音素(或其它未定位或定位的类音素)的操作(比如是用组拼形式,或拼合相关处理形式等作的处理)。一个音素里的操作信息,甚至可以是包括对整个拼合信息的值(这个音素是放在这个拼合信息的值里的)进行解码涉及的所有步骤里每个步骤要做什么这样的信息。The operation information in a phoneme can be an operation involving only this phoneme (for example, processing by combining related processing forms), or it can involve this phoneme and other unlocated or positioned phonemes (or other unlocated or positioned phonemes) Phoneme) operations (such as the processing of combining forms, or combining related processing forms, etc.), can also only involve other unlocated or positioned phonemes (or other unlocated or positioned phonemes) operations (such as It is processed in the form of grouping or combining related processing forms). The operation information in a phoneme may even include information about what to do in each step in all steps involved in decoding the value of the entire combined information (this phoneme is placed in the value of the combined information).

比如对于一个音组合,可对应定义两个音素,一个音素(此音素是自定位的)里只含这个音组合里的那些音的信息(比如各音的音长值,各音的音高值等),另一个音素(此音素是自定位的)里既含这个音组合里的那些音的信息,也含一个操作信息(复制操作),在编码时,如乐段里只出现一次这音组合,那就在拼合信息的值里放前面那个音素的音素编号,如乐段里有连续重复几次出现这个音组合,那就可在拼合信息的值里放后面这个音素的音素编号和复制的次数,这样解码时从拼合信息的值里取得后面这个音素的音素编号后,按音素编号到音素数据库里找到此音素的记录项,从记录项里取出复制的操作信息,再按拼合信息的值里这个音素编号后面的复制的次数,来作复制操作,(这里说的音素数据库的定义描述在后面)。For example, for a sound combination, two phonemes can be defined correspondingly, and a phoneme (this phoneme is self-positioning) only contains the information of those sounds in this sound combination (such as the sound length value of each sound, the pitch value of each sound etc.), another phoneme (this phoneme is self-positioning) contains not only the information of those sounds in this sound combination, but also an operation information (copy operation). combination, then put the phoneme number of the previous phoneme in the value of the merged information, if there are several consecutive repetitions of this tone combination in the passage, then put the phoneme number and copy of the latter phoneme in the value of the merged information In this way, after decoding the phoneme number of the following phoneme from the value of the combined information, find the record item of this phoneme in the phoneme database according to the phoneme number, take out the copied operation information from the record item, and then press the button of the combined information The number of times of copying behind the phoneme number in the value is used to perform the copying operation (the definition of the phoneme database mentioned here is described later).

(将本段内以下所提及的未定位或定位的音素都改成未定位或定位的类音素,那本段内以下内容仍成立),在解码时,对于任一含其它附加信息的未定位或定位的音素,都可以有本段内以下所述情况:对这未定位或定位的音素作处理(或对这未定位或定位的音素与其它音素或类音素作处理等)得到类音素,那这未定位或定位的音素里的其它附加信息可能要被照搬放到得到的类音素里,比如这未定位或定位的音素里的音色,节奏等信息一般要被照搬放到得到的类音素里,另外这未定位或定位的音素里的有些其它附加信息也可能要被修改后再放到得到的类音素里,比如对音色,节奏等信息进行修改后再放到得到的类音素里,这些修改后放在得到的类音素里的其它附加信息可用于此动作(此动作指得到这个类音素的动作)后面的解码处理(比如后面的发音或后面的拼合处理)。(The unlocated or positioned phonemes mentioned below in this paragraph are all changed into unlocated or positioned phonemes, then the following content in this paragraph is still established), when decoding, for any unlocated or positioned phonemes that contain other additional information The phoneme of positioning or positioning can have the following situation in this paragraph: the phoneme that is not positioned or positioned is processed (or the phoneme that is not positioned or positioned is processed with other phonemes or phonemes, etc.) to obtain the phoneme , then other additional information in the unlocated or located phoneme may be copied into the obtained class phoneme, such as the timbre, rhythm and other information in the unlocated or located phoneme. In addition, some other additional information in the unlocated or located phoneme may also be modified and then put into the obtained phoneme, such as modifying the timbre, rhythm and other information before putting it into the obtained phoneme , other additional information placed in the obtained phoneme after these modifications can be used for decoding processing (such as subsequent pronunciation or subsequent splicing processing) after this action (this action refers to the action of obtaining this phoneme).

在发音时,如要被用来发音的定位的音素或定位的类音素里有音色信息,那一般要按这音色信息来发相应音色的音。When pronouncing, if there is timbre information in the locating phoneme or locating phoneme to be used for pronunciation, then generally the sound of the corresponding timbre should be issued according to the timbre information.

本申请各处提及的‘用音素来拼合乐段发音’的含义,是默认由拾音器来执行,即由拾音器用多个音素来作处理(作的处理包括:把几个音素拿来作组合,对音素先作修改再拿去与别的音素作组合,对音素进行定位,对音素作复制操作等这些处理中的一种或多种)而得到乐段的发音。The meaning of "using the phonemes to combine the pronunciation of the passage" mentioned in this application is to be executed by the pickup by default, that is, the pickup uses a plurality of phonemes for processing (the processing includes: combining several phonemes , the phoneme is first modified and then combined with other phonemes, the phoneme is positioned, the phoneme is copied, etc. one or more of these processes) to obtain the pronunciation of the passage.

如提及时无特别指明有他意,本说明书各处在描述用音素来拼合的技术时提及的用音素来拼合乐段发音(或类似提法比如说用音素来拼合一乐段发音,用多个音素来拼合乐段发音,用多个音素来拼合一乐段发音等),默认是用多个音素来拼合一乐段的全部发音,且这多个音素也就是用于拼合乐段发音的音素。If there is no special indication when it is mentioned, there is no other meaning in this manual, when describing the technology of combining phonemes, it is mentioned that using phonemes to combine the pronunciation of a period (or similar formulations such as using phonemes to combine the pronunciation of a period, using multiple use multiple phonemes to combine the pronunciation of a segment, and use multiple phonemes to combine the pronunciation of a segment, etc.), the default is to use multiple phonemes to combine all the pronunciations of a segment, and these multiple phonemes are also used to combine the pronunciation of a segment. phoneme.

本申请文件所说的表示如何用音素来拼合乐段发音的拼合信息的值,实际是指表示如何用音素来拼合乐段的全部发音的拼合信息的值,即编码生成拼合信息的值时,是预定让解码规则按此拼合信息的值解码以播放被编码乐段的全部发音的,但实际按解码规则解码时,可能是根据情况解码播被编码乐段的全部发音或部分发音。The value of the combination information that indicates how to use phonemes to combine the pronunciation of a section mentioned in this application document actually refers to the value of the combination information that indicates how to use phonemes to combine all the pronunciations of a section, that is, when the value of the combination information is generated by encoding, It is intended that the decoding rule be decoded according to the value of the combined information to play all the pronunciations of the coded section, but when actually decoding according to the decoding rule, it may decode and play all or part of the pronunciation of the coded section according to the situation.

用音素来拼合乐段发音,其中一般要涉及到组合操作,常规的组拼形式就是顺序拼合,即比如把两个物件顺序排列放在一起(这里说的两个物件中任一物件可以是音素,或是类音素等,比如这里说的两个物件都是音素,或一个是复制前面的音素而得到的类音素且另一个是音素),并可将此顺序拼合的结果用于发音,比如把结果中的这两个物件顺序转换成发音播出来(这里这两个物件的发音是处于先后顺序的比如这两个物件的发音是属于同一声道里且是按先后顺序发音),或是把结果中的这两个物件同时转换成发音播出来(这里这两个物件的发音是属于不同声道且是同时发音的),也可以把此顺序拼合的结果(即顺序排列放在一起的这两个物件)用于此顺序拼合处理后面的拼合步骤的拼合,顺序拼合也可以是把多个物件顺序排列放在一起,以上描述顺序拼合时涉及的物件里是以没有其它附加信息为例的,如物件里还有其它附加信息(比如音色,节奏信息,操作信息),那仍可象以上所述这样作顺序拼合,拼合得到的结果里按约定一般也相应含有这些其它附加信息(比如一个音素里含有某几个音的信息并含有其它附加信息比如这某几个音的音色信息,另一个音素里含有另几个音的信息并含有其它附加信息比如这另几个音的音色信息,那把这两个音素顺序拼合起来得到的结果里既有这某几个音的信息也有这某几个音的音色信息也有这另几个音的信息也有这另几个音的音色信息),拼合得到的结果仍可用于发音或用于其它拼合,以上顺序拼合时用到的音素,类音素,可以是定位的,也可以未定位的,拼合得到的结果可以是自定位的,也可以没自定位的;涉及的组拼形式也可以是交织拼合,即比如把两个物件顺序排列放在一起(这里说的两个物件中任一物件可以是音素,或是类音素等,比如这里说的两个物件都是音素,或一个是复制前面的音素而得到的类音素且另一个是音素),而且前面物件中至少有一个音的发音是处于后面物件中某一音的发音的后面(即前面物件中至少有一个音插入到后面物件中),具体情况可能是前面物件中的音插入到后面物件里了且后面物件中的音插入到前面物件里了,也可能是只有前面物件中的音插入到后面物件里了,也可能是只有后面物件中的音插入到前面物件里了,等,解码时把前后两个物件按这样交织拼合的方式顺序排列放在一起组成一个新的类音素(这个新的类音素的任意部分也可被看成是类音素),这个新的类音素(或其任意部分,其任意部分也可看成是类音素)可用于发音(比如分步发音方式时的一步只发这个新的类音素的音,或比如这个新的类音素就是解码用整步发音方式时解码得到的解码结果的全部并可按这个新的类音素播乐段的全部或部分发音,以上两句说到的情况中的新的类音素是以是自定位的为例的),也可用于此交织拼合的后面的拼合步骤的拼合,交织拼合也可以是把多个物件顺序排列放在一起即拼合起来,且拼合起来后原来这多个物件中的每个物件中都有音插入到这多个物件中其它物件里;以上交织拼合时用到的音素,类音素,一般是都定位的,但也可以是未定位的,拼合得到的结果可以是自定位的,也可以没自定位的;比如图15所示为例子,由这两个被定位的音素交织拼合得到8个音的组合,就如是图14所示声道2的音素56(3)后面的8个音的组合,交织拼合时图15中的第2个被定位的音素的第1音插入到第1个被定位的音素里去了,这里作交织拼合前是有确定用于交织拼合的两个被定位的音素在五线谱上所处的上下位置的即所处上下位置是象图15那样;如用于交织拼合的物件都是音素,那交织拼合时一般是先确定用于交织拼合的各音素在五线谱上的上下位置,前后顺序后,再按约定的法则进行交织拼合;本说明书后面编码举例2和解码举例2里涉及的交织拼合,是先把用于交织拼合的两个音素在五线谱上的上下位置定好(即先作定位),再进行交织拼合。以上描述交织拼合时涉及的物件里是以没有其它附加信息为例的,如物件里还有其它附加信息(比如音色,节奏信息,操作信息),那仍可象以上所述这样作交织拼合,拼合得到的结果里按约定一般也相应含有这些其它附加信息,拼合得到的结果仍可用于发音或用于其它拼合;用于交织拼合的物件以及交织拼合得到的结果一般都属于同一个声道,本段以上所述用于交织拼合的物件以及交织拼合得到的结果是以都属于同一个声道为例的;显然在编码用的编码数据和解码用的解码数据里一般要对定义的用于交织拼合的音素进行注明这些音素是用于交织拼合的,在编码用这些音素时及解码用这些音素时再采取相应处理;甚至可以是把对音素(可以是本来是用于交织拼合的音素,也可以是本来不是用于交织拼合的音素)进行修改等处理后得到的类音素来进行交织拼合。用音素来拼合乐段发音中涉及到的组拼形式,除了以上述的顺序拼合,交织拼合外,还可能用别的组拼形式。本段各处提到的‘物件’,可以是未定位或定位的音素,或是类音素等。Use phonemes to assemble the pronunciation of a passage, which generally involves combination operations. The conventional combination method is sequential combination, that is, for example, to arrange two objects together in sequence (any of the two objects mentioned here can be a phoneme , or phoneme, etc., for example, the two objects mentioned here are phonemes, or one is a phoneme obtained by copying the previous phoneme and the other is a phoneme), and the result of this order can be used for pronunciation, such as The sequence of these two objects in the result is converted into pronunciation and played (here the pronunciation of these two objects is in sequence, for example, the pronunciation of these two objects belongs to the same channel and is pronounced in sequence), or It is to convert the two objects in the result into pronunciations and play them out (the pronunciations of these two objects belong to different channels and are pronounced at the same time), or you can combine the results of this order (that is, the order is arranged in the These two objects together) are used for the merging of the merging step after this sequential merging process. Sequential merging can also be to arrange multiple objects together in sequence. There is no other additional information in the objects involved in the sequential merging described above For example, if there are other additional information (such as timbre, rhythm information, and operation information) in the object, it can still be assembled sequentially as described above. The result of the assembly generally also contains these other additional information as agreed (For example, one phoneme contains the information of certain tones and other additional information such as the timbre information of these certain tones, and another phoneme contains the information of other several tones and contains other additional information such as the timbre information of these other tones Timbre information, the result obtained by combining the two phonemes in sequence contains information about certain tones, timbre information about certain tones, and information about other tones and timbres of these other tones. Information), the results obtained by the combination can still be used for pronunciation or for other combinations. The phonemes and phonemes used in the above sequence combination can be positioned or not positioned. The result obtained by the combination can be self-positioned. It can also be without self-positioning; the assembling form involved can also be interweaving and merging, that is, for example, to arrange two objects together in sequence (any of the two objects mentioned here can be a phoneme, or a phoneme-like, etc., For example, the two objects mentioned here are both phonemes, or one is a phoneme obtained by copying the previous phoneme and the other is a phoneme), and the pronunciation of at least one sound in the front object is the pronunciation of a certain sound in the back object (that is, at least one sound in the front object is inserted into the back object), the specific situation may be that the sound in the front object is inserted into the back object and the sound in the back object is inserted into the front object, or it may be only The sound in the front object is inserted into the back object, or it may be that only the sound in the back object is inserted into the front object, etc. When decoding, arrange the front and back two objects together in this way to form a A new phone-like element (any part of this new phone-like element can also be regarded as a phone-like element), and this new phone-like element (or any part thereof, and any part of which can also be regarded as a phone-like element) can be used for pronunciation (such as step-by-step pronunciation Step only sends out the sound of this new class phoneme, or such as this new class phoneme is exactly the whole decoding result that decodes and obtains when decoding with the full-step pronunciation mode and can press all or part of the pronunciation of this new class phoneme playing music section, The new phoneme in the situation mentioned in the above two sentences is self-positioning as an example), it can also be used for the stitching of the stitching steps behind this interweaving stitching, and the interweaving stitching can also be to arrange multiple objects in order Together, they are put together, and after being put together, each of the multiple objects has sounds inserted into other objects in the multiple objects; the phonemes and phonemes used in the above interweaving and stitching are generally all Positioned, but it can also be unlocated, and the result obtained by stitching can be self-positioned or not; for example, Figure 15 shows an example, and 8 phonemes are obtained by interleaving and stitching these two positioned phonemes The combination is just like the combination of the 8 sounds behind the phoneme 56 (3) of channel 2 shown in Figure 14, the first sound of the second positioned phoneme in Figure 15 is inserted into the first one when interweaving and stitching together Go in the phoneme of positioning, before doing interweaving and stitching here, there are two positioned phonemes for interweaving and stitching to determine the upper and lower positions on the staff, that is, the upper and lower positions are as shown in Figure 15; if used for interweaving The objects to be combined are all phonemes. When interweaving and combining, the upper and lower positions of each phoneme used for interweaving and combining are generally determined first, and then the order is followed, and then the interweaving and combining is carried out according to the agreed rules; the coding example 2 and decoding at the end of this manual For the interweaving and stitching involved in Example 2, the upper and lower positions of the two phonemes used for the interweaving and stitching on the stave are determined (that is, the positioning is performed first), and then the interweaving and stitching is carried out. The above description of the objects involved in the interweaving and stitching is an example where there is no other additional information. If there are other additional information (such as timbre, rhythm information, and operation information) in the object, then the interweaving and stitching can still be performed as described above. The results obtained by stitching generally also contain these other additional information as agreed, and the results obtained by stitching can still be used for pronunciation or other stitching; the objects used for interweaving and stitching and the results of interweaving and stitching generally belong to the same channel. The objects used for interleaving and stitching mentioned above in this paragraph and the results obtained by interleaving and stitching are all taken as examples of the same channel; obviously, in the encoded data for encoding and the decoded data for decoding, it is generally necessary to define the used Interweave and combine phonemes to indicate that these phonemes are used for interweaving and combining, and then take corresponding processing when encoding and decoding these phonemes; , can also be the phonemes that are not originally used for interweaving and mosaic) after modification and other processing to perform interweaving and mosaic. Use phonemes to put together the grouping forms involved in the pronunciation of the passages. In addition to the above-mentioned order and interweaving, other grouping forms may also be used. The 'object' mentioned throughout this paragraph may be an unlocated or located phoneme, or a phoneme-like, etc.

本说明书所说的定位,是指五线谱上的上下位置定位。The positioning mentioned in this manual refers to the positioning of the upper and lower positions on the staff.

在顺序拼合,交织拼合等组拼形式里涉及用到的音素(这里说到的音素不局限于编码规则和解码规则里定义的音素),可以是未定位的,即没确定其含有的任一音在五线谱上的垂直方向哪个位置,也可以是定位的,即有确定其含有的每个音在五线谱上的垂直方向哪个位置,定位的情况可能是音素里本来就有其含有的每个音的音高值(即音素本身就是定位的,本申请文件称之为自定位),也可能是象后面编码举例1的拼合信息的值里每个音素的音素编号值前有此音素的上下位置值1,由这音素的上下位置值1来确定此音素的上下位置也进而可结合此音素里的音的构成情况来确定(这里作确定时还要根据被编码乐段的这个音素所处声道的调式及谱号)此音素里含有的每个音的音高值(本申请文件称此为被定位),再可用此被定位的音素来作与拼合有关的处理;在顺序拼合,交织拼合等组拼形式里涉及用到的类音素,可以是未定位的,即没确定其含有的任一音在五线谱的垂直方向哪个位置,也可以是定位的,即有确定其含有的每个音在五线谱的垂直方向哪个位置,且定位可以是自定位或被定位(这种情况类音素的自定位和被定位与本段以上述的音素的自定位,被定位情况类似);在各拼合相关处理形式里涉及用到的音素(这里说到的音素不局限于编码规则和解码规则里定义的音素),可以是未定位的,即没确定其含有的任一音在五线谱的垂直方向哪个位置,也可以是定位的,即有确定其含有的每个音在五线谱的垂直方向哪个位置,且定位可以是自定位或被定位(这种情况音素的自定位和被定位与本段以上述的组拼形式里涉及到的音素的自定位,被定位情况类似);在各拼合相关处理形式里涉及用到的类音素,可以是未定位的,即没确定其含有的任一音在五线谱的垂直方向哪个位置,也可以是定位的,即有确定其含有的每个音在五线谱的垂直方向哪个位置,且定位可以是自定位或被定位(这种情况类音素的自定位和被定位与本段以上述的组拼形式里涉及到的音素的自定位,被定位情况类似)。(本段所说的类音素是含有音的信息的)The phonemes involved in the combination forms such as sequence stitching and interweaving stitching (the phonemes mentioned here are not limited to the phonemes defined in the coding rules and decoding rules), can be unlocated, that is, no one is determined to be contained in it. The vertical position of the sound on the staff can also be positioned, that is, the vertical position of each sound contained in it can be determined. The situation of positioning may be that each sound contained in the phoneme itself The pitch value (that is, the phoneme itself is positioned, this application document is called self-positioning), it may also be the upper and lower positions of the phoneme before the phoneme number value of each phoneme in the value of the combined information in the following coding example 1 The value is 1, the up and down position of this phoneme is determined by the value 1 of the upper and lower position of this phoneme, and the up and down position of this phoneme can also be determined in conjunction with the composition of the sound in this phoneme (here, it is also determined according to the sound of the phoneme in the coded section). mode and clef of the road) the pitch value of each sound contained in this phoneme (this application document is referred to as being positioned), and then this positioned phoneme can be used for processing related to splicing; The phonemes used in combination forms such as splicing can be unlocated, that is, the position of any sound contained in it in the vertical direction of the staff has not been determined, or it can be positioned, that is, each sound contained in it has been determined. Where is the sound in the vertical direction of the staff, and the positioning can be self-positioning or being positioned (in this case, the self-positioning and being positioned of the class phoneme are similar to the self-positioning of the above-mentioned phoneme in this section, and the situation of being positioned is similar); The phonemes involved in the relevant processing forms (the phonemes mentioned here are not limited to the phonemes defined in the encoding rules and decoding rules) can be unlocated, that is, it is not determined which of any sound it contains in the vertical direction of the staff The position can also be positioned, that is, it is determined which position of each sound it contains in the vertical direction of the staff, and the positioning can be self-positioned or positioned (in this case, the self-positioning and being positioned of the phoneme are the same as the above-mentioned in this paragraph. The self-positioning of the phonemes involved in the group spelling form is similar to the situation of being positioned); the class phonemes used in the relevant processing forms of each spelling can be unlocated, that is, it is not determined that any sound contained in it is in the stave Which position in the vertical direction can also be positioned, that is, there is a determination of which position in the vertical direction of each sound it contains, and the positioning can be self-positioned or positioned (in this case, the self-positioned and positioned of the class phoneme Similar to the self-positioning of the phonemes involved in the above-mentioned combination form in this paragraph, the situation of being positioned is similar). (The phonemes mentioned in this paragraph contain sound information)

本说明书有本段约定,如音素里含有多个音的信息,那被定位后的音素里必须是有对这多个音各自都有作定音高值;如类音素里含有多个音的信息,那被定位后的类音素里必须是有对这多个音各自都有作定音高值。This manual has the agreement in this paragraph, if the phoneme contains information of multiple tones, then the positioned phoneme must have pitch-setting values for each of the multiple tones; if the phoneme contains multiple tones information, the positioned phoneme must have a fixed pitch value for each of these multiple tones.

如提及时无特别指明有他意,本说明书各处提及的定位的音素,默认可以是自定位的音素或被定位的音素;如提及时无特别指明有他意,本说明书各处提及的定位的类音素,默认可以是自定位的类音素或被定位的类音素;各组拼形式得到的结果里,如含有数个音的信息,如果这数个音都各自有确定的音高值,那这结果也算是自定位的,如果这数个音中至少有一音没有确定的音高值,那这结果不算是自定位的而是未定位的。If no other meaning is specified when mentioned, the positioned phoneme mentioned in this manual can be a self-located phoneme or a positioned phoneme by default; The default phoneme can be a self-positioned phoneme or a localized phoneme; in the result of each combination, if it contains information about several tones, if these tones have their own definite pitch values, Then this result is also regarded as self-localization, if at least one of these several tones has no definite pitch value, then this result is not self-localization but unlocalization.

根据具体情况,不论用哪种组拼形式,其操作涉及的数据形式可以是多种多样的,比如顺序拼合时,用两个音素来拼合(这两个音素都是未定位的),用于拼合的是这两个音素各自的音素编号,那可以是把这两个音素的音素编号顺序排列起来而成拼合结果(这里拼合结果是没自定位的),或是把这两个音素都转换成它们各自的谱格式数据(先对音素作定位再转成谱格式数据),把这两个音素的谱格式数据顺序排列起来而成拼合结果(拼合结果是自定位的),再比如顺序拼合时,用一个音素和一个类音素来拼合,用于拼合的可以是这个音素的音素编号和这个类音素的代号(比如这个类音素是复制另一音素,那可用这另一音素的音素编号来作为这个类音素的代号),那可以是把这个音素的音素编号和这个类音素的代号顺序排列起来而成拼合结果,或是把这个音素和这个类音素都转换成它们各自的谱格式数据(先对它们各自进行定位再各自转成谱格式数据),把他们各自的谱格式数据顺序排列起来而成拼合结果(拼合结果是自定位的),再比如顺序拼合时,用一个未定位的音素和一个定位的类音素来拼合,用于拼合的可以是这个音素的音素编号和这个定位的类音素的谱格式数据,那可以是把这个音素的音素编号和这个定位的类音素的谱格式数据顺序排列起来而成拼合结果(拼合结果不是自定位的),或是把这个音素转换成它的谱格式数据(先对音素进行定位再转成谱格式数据),把这个音素的谱格式数据和这个类音素的谱格式数据顺序排列起来而成拼合结果(拼合结果是自定位的),再比如交织拼合的结果(结果即类音素),其可以是谱格式数据的形式,另也可以在交织拼合的最后把得到的谱格式数据转成一个或多个音素编号加上这些音素的上下位置值3来当成交织拼合的结果。According to the specific situation, no matter which combination form is used, the data forms involved in its operation can be various. For example, when combining sequentially, two phonemes are used to combine (these two phonemes are not located), and the What is merged is the respective phoneme numbers of the two phonemes, which can be obtained by arranging the phoneme numbers of the two phonemes in sequence (here, the merged result is not self-positioned), or the two phonemes are converted into their respective spectral format data (the phonemes are first positioned and then converted into spectral format data), and the spectral format data of these two phonemes are arranged in order to form a combined result (the combined result is self-positioning), and another example is sequential combination When combining a phoneme and a phoneme-like element, the combination can be the phoneme number of the phoneme and the code name of the phoneme-like element (for example, if the phoneme-like element is copied from another phoneme, then the phoneme number of the other phoneme can be used to As the code name of this phoneme), that can be the result of arranging the phoneme number of this phoneme and the code name of this phoneme in sequence, or converting this phoneme and this phoneme into their respective spectral format data ( Position them first and then convert them into spectral format data), arrange their respective spectral format data sequentially to form a combined result (the combined result is self-positioning), and for example, when combining sequentially, use an unlocated phoneme Combined with a positioned phoneme, the combination can be the phoneme number of the phoneme and the spectral format data of the positioned phoneme, that can be the phoneme number of the phoneme and the spectral format data of the positioned phoneme Arranged in order to form a stitching result (the stitching result is not self-positioning), or convert this phoneme into its spectral format data (first locate the phoneme and then convert it into spectral format data), combine the spectral format data of this phoneme and The spectral format data of this phoneme is arranged in order to form a stitching result (the stitching result is self-positioning), and another example is the result of interleaving stitching (the result is a phoneme), which can be in the form of spectral format data, and can also be interleaved. At the end of the stitching, convert the obtained spectrum format data into one or more phoneme numbers plus the upper and lower position values of these phonemes 3 as the result of interweaving and stitching.

上段说的谱格式数据,就是对数个音的组合(数个音的组合也是音组合),用记谱的方式来记录这数个音的组合中各音的音高,音长,发音时刻等情况,而成的数据,并且还可把定位的音素里的音色等信息也编入谱格式数据里;将定位的音素或定位的类音素转成谱格式数据时一般要使用拼合信息的值里的被编码乐段的此定位的音素或定位的类音素所处声道的此定位的音素或定位的类音素所处位置(一般是所处五线谱相应声道的那小节里)的节奏,调式,谱号等信息(节奏,调式,谱号等信息甚至可以是处在定位的音素或类音素里)。本段所说的‘数个音的组合’中,‘数个’是修饰‘音’的,且本段说的数个音的组合可以就是一个音。The spectrum format data mentioned in the previous paragraph is the combination of several tones (the combination of several tones is also a combination of tones), and the pitch, length and pronunciation time of each sound in the combination of several tones are recorded by notation And so on, the data formed, and information such as timbre in the positioned phoneme can also be compiled into the spectral format data; when converting the positioned phoneme or positioned phoneme-like into spectral format data, the value of the combined information is generally used The rhythm of the location of the positioned phoneme or the location of the location of the location of the location of the phoneme or the location of the location of the location of the location of the coded section of the phoneme or location of the location of the location of the phoneme or the location of the location of the location of the phoneme (usually in the section of the corresponding channel of the stave), Mode, clef and other information (rhythm, mode, clef and other information can even be in the positioned phoneme or phoneme). In the "combination of several tones" mentioned in this paragraph, "several" modifies the "sound", and the combination of several tones mentioned in this paragraph can be one sound.

对于本说明书所说的按编码规则编码生成表示如何用音素来拼合乐段发音的拼合信息的值,其中说的‘如何用音素来拼合乐段发音’是指按解码规则解码时如何用音素来拼合乐段发音,且解码是按前面两句说的拼合信息的值来解码的,前面3句里说的解码规则是与前面3句里说的编码规则专门对应的,此解码规则如实际不存在(指没写在纸上等)那也是理论上存在的。对于本说明书的类似本段第1句的说法,也按本段这样理解。For the value of the combination information that represents how to use phonemes to combine the pronunciation of the passage by encoding according to the coding rules mentioned in this manual, the "how to use the phonemes to combine the pronunciation of the passage" refers to how to use the phonemes when decoding according to the decoding rules. The pronunciation of the segment is combined, and the decoding is based on the value of the combined information mentioned in the previous two sentences. The decoding rules mentioned in the previous three sentences are specifically corresponding to the encoding rules mentioned in the previous three sentences. If the decoding rules are different in practice Existence (refers to not written on paper, etc.) that also exists theoretically. As for the statement similar to the first sentence of this paragraph in this manual, it should also be interpreted according to this paragraph.

解码过程中使用的音素,是在解码时从拼合信息的值中可直接得到(而不需作修改,复制操作,不用组拼形式拼合等处理来得到)的(比如后面编码举例1和解码举例1里涉及对图13的乐段的拼合信息的值的编解码,图13中声道2中第1个音素56是从拼合信息的值中可直接得到的即可从拼合信息的值中直接得到图13中声道2中第1个音素56的音素编号,声道2后面的那些音是复制前面那些音而得,解码时从拼合信息的值中不能直接得到声道2中后面的那些音),而如是把一些数据作像加密这样的变换后将变换结果放在拼合信息的值里(此变换结果是此拼合信息的值的全部或部分),解码时从这拼合信息的值得到变换结果并对此变换结果作逆变换,从逆变换的结果里可得到一个或多个音素(比如从逆变换的结果里可取出一个或多个音素编号),那这也算是从这拼合信息的值里可直接得到这一个或多个音素;本段以上说到的解码过程中使用的音素,也是解码时所使用的解码规则里定义的音素;关于类音素的定义,类音素是信息的集合,解码时对从拼合信息的值中直接得到(直接得到是指不需作修改,复制操作,不用组拼形式拼合等处理来得到,本段以下提到的直接得到的意思与此同)的一个音素进行处理(比如用拼合相关处理形式,改音色等处理,这音素本身可以是自定位的也可以不是自定位的,且处理前有对此音素作定位或没作定位)后得到的结果的全部是一个类音素,结果的任意部分也是一个类音素,解码时对从拼合信息的值中直接得到的多个音素进行处理(比如用组拼形式拼合等处理,这些音素各自本身可以是自定位的也可以不是自定位的,且处理前有对这些音素各自作定位或没作定位)后得到的结果的全部是一个类音素,结果的任意部分也是一个类音素,解码时对数个类音素与从拼合信息的值中直接得到的数个音素进行处理(比如用组拼形式拼合,或先对某音素作修改再用组拼形式拼合等处理,这些音素,类音素,各自本身可以是自定位的也可以不是自定位的,且处理前有对这些类音素,音素各自作定位或没作定位)后得到的结果的全部是一个类音素,结果的任意部分也是一个类音素,解码时对一个类音素进行处理(比如用拼合相关处理形式,改音色等处理,这个类音素本身可以是自定位的也可以不是自定位的,且处理前有对这个类音素作定位或没作定位)后得到的结果的全部是一个类音素,结果的任意部分也是一个类音素,解码时对多个类音素进行处理(比如用组拼形式拼合等处理,这些类音素各自本身可以是自定位的也可以不是自定位的,且处理前有对这些类音素各自作定位或没作定位)后得到的结果的全部是一个类音素,结果的任意部分也是一个类音素。(比如后面编码举例1和解码举例1里涉及对图13的乐段的拼合信息的值的编解码,图13中声道2中第1个音素56是从拼合信息的值中可直接得到的即可从拼合信息的值中直接得到图13中声道2中第1个音素56的音素编号,且解码时对图13中声道2中的第1个音素56先作定位再用复制操作得到后面3个自定位的类音素,且这每个自定位的类音素中的音的构成情况与图13中声道2中第1个音素56中的音的构成情况相同)。类音素可以是自定位的,也可没自定位的;类音素一般是只包含数个音的信息(比如一个自定位的类音素里音的构成情况与一个定位的音素里的音的构成情况相同),类音素还可以是其它形式比如既包含数个音的信息还包含一些其它附加信息比如音色信息,操作信息。本段以上各处提到的‘结果的任意部分’,可以是结果的连续的部分(比如结果是简谱‘135’,此结果的连续的部分可以是简谱‘35’),也可以是结果的不连续的部分(比如结果是简谱‘135’,此结果的不连续的部分可以是此结果的第1个音‘1’加第3个音‘5’,本括号内的引号里的数字都是指简谱)。The phonemes used in the decoding process can be directly obtained from the value of the combined information during decoding (without modification, copy operation, and without combination of combined form and other processing) (such as the following encoding example 1 and decoding example 1 relates to the encoding and decoding of the value of the combined information of the music section in Figure 13, the first phoneme 56 in the channel 2 in Figure 13 can be directly obtained from the value of the combined information, that is, directly from the value of the combined information Obtain the phoneme number of the first phoneme 56 in the sound channel 2 in Fig. 13, those sounds behind the sound channel 2 are obtained by duplicating the previous sounds, and those behind the sound channel 2 cannot be directly obtained from the value of the combined information during decoding sound), and if some data is transformed like encryption, the transformation result is placed in the value of the combined information (the result of this transformation is all or part of the value of the combined information), and it is obtained from the value of the combined information when decoding. Transform the result and perform inverse transformation on the transformation result, one or more phonemes can be obtained from the result of the inverse transformation (for example, one or more phoneme numbers can be taken out from the result of the inversion transformation), then this can also be regarded as the combination information The one or more phonemes can be directly obtained from the value of this paragraph; the phonemes used in the decoding process mentioned above in this paragraph are also the phonemes defined in the decoding rules used in decoding; about the definition of phonemes, phonemes are information Set, when decoding, it is directly obtained from the value of the combined information (directly obtained means that it can be obtained without modification, copy operation, or combination of combination forms, etc. The meaning of direct acquisition mentioned below in this paragraph is the same) It is obtained after processing a phoneme (such as combining related processing forms, changing timbre, etc., the phoneme itself may or may not be self-positioning, and the phoneme has been positioned or not positioned before processing) All of the result is a phoneme, and any part of the result is also a phoneme. When decoding, a plurality of phonemes directly obtained from the value of the combined information are processed (such as processing such as combining with a combination form, and these phonemes can be each themselves. Self-positioning can also not be self-positioning, and these phonemes are respectively positioned or not positioned before processing), all of the results obtained after the result are a class of phonemes, and any part of the result is also a class of phonemes. When decoding, logarithms The class phoneme is processed with several phonemes obtained directly from the value of the combined information (for example, combined in a combined form, or first modifying a certain phoneme and then combined in a combined form). It may or may not be self-positioning, and before processing, these phonemes and phonemes have been positioned or not positioned respectively), and the result obtained after all is a phoneme, and any part of the result is also a phoneme, decoding When processing a phoneme (for example, by combining related processing forms, changing timbre, etc., this phoneme itself can be self-positioned or not, and this phoneme has been positioned or not positioned before processing. ) is all a phoneme, and any part of the result is also a phoneme. When decoding, a plurality of phonemes are processed (for example, combined with a combination of forms, etc., each of these phonemes can be Self-positioning may not be self-positioning, and before processing, these phonemes are positioned separately or not positioned), all of the results obtained after are a phoneme, and any part of the result is also a phoneme. (For example, in the following encoding example 1 and decoding example 1, it involves the encoding and decoding of the value of the mosaic information of the music section in Figure 13, the first phoneme 56 in the channel 2 in Figure 13 can be directly obtained from the value of the mosaic information The phoneme number of the first phoneme 56 in channel 2 in Figure 13 can be directly obtained from the value of the combined information, and the first phoneme 56 in channel 2 in Figure 13 is first positioned and then copied during decoding The following three self-localized phonemes are obtained, and the composition of the sound in each self-localized phoneme is the same as the composition of the sound in the first phoneme 56 in the channel 2 in Fig. 13). A phoneme can be self-localized or not self-localized; a phoneme generally only contains the information of several sounds (such as the composition of a self-positioned phoneme and the composition of a localized phoneme). The same), the phoneme can also be in other forms such as information containing several tones and some other additional information such as timbre information and operation information. The "any part of the result" mentioned above in this paragraph can be a continuous part of the result (for example, if the result is numbered musical notation '135', the continuous part of this result can be numbered musical notation '35'), or the result The discontinuous part (for example, the result is numbered notation '135', the discontinuous part of this result can be the first sound '1' plus the third sound '5' of this result, the numbers in the quotation marks in the brackets are all refers to the simplified notation).

本说明书所说的类音素,可能含有数个音的信息,也可能不含任一音的信息(比如此结果只含整个被编码乐段的节奏信息)且此类音素仍可能被用于后面的拼合;本说明书各处说到用类音素来发音,或述及到类音素里的音时,或述及到类音素的上下位置时,是默认类音素里是含有数个音的信息的。本说明书各处述及组拼形式,拼合相关处理形式的处理时,提到的‘结果’,都可被看成是类音素。The phonemes mentioned in this specification may contain information of several tones, or may not contain information of any tones (for example, the result only contains the rhythm information of the entire coded section), and such phonemes may still be used later When it is mentioned in this manual that the phoneme is used to pronounce, or when it refers to the sound in the phoneme, or when it refers to the upper and lower positions of the phoneme, it is defaulted that the phoneme contains information of several tones . This instruction manual mentions the combination form everywhere, and when combining the processing of related processing forms, the 'result' mentioned can be regarded as phoneme-like.

一般可由类音素里的信息看出这个类音素里包含究竟多少个音的信息(比如类音素里只包含两个音的音长值,那一般就可按约定推出这个类音素里只包含2个音的信息),另也可采用等同形式,在类音素里有放一些BIT来表示这个类音素里包含究竟多少个音的信息,比如用二进制的3表示类音素里包含3个音的信息;本段以上所说的类音素是含有音的信息的类音素;如提及时无特别指明,本说明书各处提及的含有音的信息的任一个类音素,默认是以在这个类音素里没有放BIT来表示这个类音素里包含究竟多少个音的信息而是可由这个类音素里的信息看出这个类音素里包含究竟多少个音的信息为例的。Generally, information about how many sounds are included in the phoneme can be seen from the information in the phoneme (for example, the phoneme only contains the sound length value of two tones, then generally it can be deduced that only two phonemes are included in the phoneme according to the agreement. Tone information), the equivalent form can also be adopted, put some BITs in the phoneme to represent the information of how many tones are included in the phoneme, for example, use binary 3 to indicate the information of 3 tones in the phoneme; The phoneme mentioned above in this paragraph is the phoneme that contains the information of the sound; if it is not specified when it is mentioned, any phoneme that contains the information of the sound mentioned in this manual is defaulted to be that there is no phoneme in this phoneme. Put BIT to represent the information of exactly how many tones are included in this class of phonemes, but it can be seen from the information in this class of phonemes that how many tones are included in this class of phonemes as an example.

对一被定位的音素做复制操作,复制得到的那些音放在此被定位的音素的后面(复制时复制出来的音的音高值与原来被定位的音素里的对应音的音高值相同),复制得到的那些音的信息就是复制得到的类音素,这个类音素里的每个音都有确定的音高值,这个类音素是自定位的。Do a copy operation on a positioned phoneme, and the copied sounds are placed behind the positioned phoneme (the pitch value of the copied sound is the same as the pitch value of the corresponding sound in the original positioned phoneme) ), the information of those sounds that are copied is the copied phoneme, each sound in this phoneme has a certain pitch value, and this phoneme is self-positioning.

对一未定位的音素做复制操作,一般就是把这个音素的音素编号当成复制结果,复制得到的音素编号就是复制得到的类音素,这个类音素里的每个音都没有确定的音高值,这个类音素不是自定位的;然后可对这个类音素作定位处理(以下几句述的是对一个类音素作定位处理的举例),比如在拼合信息的值里放有一个上下位置值1以对解码过程中得到的这个类音素进行定位(比如对这个类音素进行定位是使这个类音素里的最先发音的所有音中的最低音的上下位置值为本句括号外说的上下位置值1),然后可根据这个上下位置值1和这个类音素里含有的各音的互相之间的音程关系数据(也就是前面说到的那个音素里含有的各音的互相之间的音程关系数据)来确定这个被定位的类音素里各音的音高值,这就得到一个被定位的类音素。上几句有说到可根据这个上下位置值1和这个类音素里含有的各音的互相之间的音程关系数据来确定这个被定位的类音素里各音的音高值,这里作确定时一般还要根据被编码乐段的这个被定位的类音素所处声道的调式及谱号,这样得到的音的音高值可以就是音的音调值,另在作确定时也可不根据被编码乐段的这个被定位的类音素所处声道的调式及谱号,这样得到的音高值可在后面随时被转成被编码乐段的实际的音高值,除非可从述及处推出有他意,本说明书其它地方述及的根据定位的类音素的上下位置值和这个类音素里含有的各音的互相之间的音程关系数据来确定这个定位的类音素里各音的音高值这类说法,也都默认可按本段第9行‘上几句’到本段第13行‘实际的音高值’所述那样理解。To copy an unlocated phoneme, generally the phoneme number of this phoneme is regarded as the copy result, and the copied phoneme number is the copied phoneme, and each sound in this phoneme has no definite pitch value. This phoneme is not self-positioning; then this phoneme can be positioned (the following sentences describe an example of positioning a phoneme), for example, an upper and lower position value above 1 is placed in the value of the combined information The phoneme obtained in the decoding process is positioned (for example, the phoneme is positioned so that the upper and lower position values of the lowest tone in all sounds pronounced first in this phoneme are the upper and lower position values mentioned outside the brackets in this sentence 1), then according to the mutual interval relationship data of each tone contained in this phoneme and this upper and lower position value (that is, the mutual interval relationship data of each tone contained in that phoneme mentioned above ) to determine the pitch value of each sound in this positioned phoneme, and this just obtains a positioned phoneme. The last few sentences have said that the pitch value of each tone in this positioned phoneme can be determined according to the upper and lower position value 1 and the interval relationship data between the tones contained in this phoneme. When making a determination here Generally also according to the mode and the clef of the channel where this positioned phoneme of the coded section is located, the pitch value of the sound obtained in this way can be the pitch value of the sound, and it can not be determined according to the coded value in addition. The key and clef of the channel in which this localized phoneme-like element of the passage is located, the pitch value obtained in this way can be converted to the actual pitch value of the encoded passage at any time later, unless it can be deduced from the reference For other purposes, as mentioned elsewhere in this manual, the pitch value of each tone in the positioned phoneme is determined according to the upper and lower position values of the positioned phoneme and the interval relationship data between the sounds contained in the phoneme. Such statements can also be understood by default as described in line 9 of this paragraph "a few sentences" to line 13 of this paragraph "actual pitch value".

拼合时对一个未定位的音素作定位处理,是指使这未定位的音素里的各音具有音高值,而且如这未定位的音素里含多个音的信息时可以是在拼合时给这未定位的音素里某一音设定音高值,并结合这未定位的音素里的各音互相之间的音程关系来得到这未定位的音素里的各音的音高值;拼合时对一个定位(自定位或被定位)的音素作定位处理,是指使这定位的音素里的各音具有新的音高值,而且如这定位的音素里含多个音的信息时可以是在拼合时给这定位的音素里某一音设定新的音高值,并结合这定位的音素里的各音互相之间的音程关系来得到这定位的音素里的各音的新的音高值;拼合时对一个未定位的类音素作定位处理,是指使这未定位的类音素里的各音具有音高值,而且如这未定位的类音素里含多个音的信息时可以是在拼合时给这未定位的类音素里某一音设定音高值,并结合这未定位的类音素里的各音互相之间的音程关系来得到这未定位的类音素里的各音的音高值;拼合时对一个定位(自定位或被定位)的类音素作定位处理,是指使这定位的类音素里的各音具有新的音高值,而且如这定位的类音素里含多个音的信息时可以是在拼合时给这定位的类音素里某一音设定新的音高值,并结合这定位的类音素里的各音互相之间的音程关系来得到这定位的类音素里的各音的新的音高值。Carrying out location processing to an unlocated phoneme during merging refers to making each sound in the unlocated phoneme have a pitch value, and as this unlocated phoneme contains multiple sound information, it can be given to the phoneme when merging Set the pitch value of a certain tone in the unlocated phoneme, and combine the interval relationship between each tone in the unlocated phoneme to obtain the pitch value of each tone in the unlocated phoneme; A positioning (self-positioning or being positioned) phoneme is done positioning processing, refers to making each sound in the phoneme of this positioning have a new pitch value, and when the phoneme of this positioning contains the information of multiple sounds, it can be combined Set a new pitch value for a certain tone in this positioned phoneme, and combine the interval relationship between each tone in this positioned phoneme to get the new pitch value of each tone in this positioned phoneme Carrying out location processing to an unlocated class phoneme during splicing refers to making each sound in this unlocated class phoneme have a pitch value, and as this unlocated class phoneme contains information of a plurality of sounds, it can be in When merging, set the pitch value for a certain sound in the unlocated phoneme, and combine the interval relationship between each sound in the unlocated phoneme to obtain the pitch value of each sound in the unlocated phoneme. Pitch value; during splicing, a positioning (self-positioning or being positioned) class phoneme is carried out to locate, and refers to making each sound in the class phoneme of this location have a new pitch value, and as contained in the class phoneme of this location The information of multiple tones can be obtained by setting a new pitch value for a certain sound in the positioned phoneme when merging, and combining the interval relationship between each sound in the positioned phoneme to obtain the positioning The new pitch value of each note in the class phoneme.

可对一个未定位的音素作定位处理(以下几句述的是对一个未定位的音素作定位处理的举例),比如在拼合信息的值里放有一个上下位置值1以对这个未定位的音素进行定位(比如对这个未定位的音素进行定位是使这个未定位的音素里的最先发音的所有音中的最低音的上下位置值为本句括号外说的上下位置值1),然后可根据这个上下位置值1和这个未定位的音素里含有的各音的互相之间的音程关系数据(这个未定位的音素里含有的各音的互相之间的音程关系数据也是这个被定位的音素里含有的各音的互相之间的音程关系数据)来确定这个被定位的音素里各音的音高值,这就得到一个被定位的音素。上几句有说到可根据这个上下位置值1和这个未定位的音素里含有的各音的互相之间的音程关系数据来确定这个被定位的音素里各音的音高值,这里作确定时一般还要根据被编码乐段的这个被定位的音素所处声道的调式及谱号,这样得到的音的音高值可以就是音的音调值,另在作确定时也可不根据被编码乐段的这个被定位的类音素所处声道的调式及谱号,这样得到的音高值可在后面随时被转成被编码乐段的实际的音高值,除非可从述及处推出有他意,本说明书其它地方述及的根据被定位的音素的上下位置值和这个音素里含有的各音的互相之间的音程关系数据来确定这个被定位的音素里各音的音高值这类说法,也都默认可按本段第5行‘上几句’到本段第10行‘实际的音高值’所述那样理解。An unlocated phoneme can be positioned (the following sentences describe an example of positioning an unlocated phoneme). Phoneme is positioned (for example, positioning this unlocated phoneme is to make the upper and lower position values of the lowest tone in all sounds that are first pronounced in this unlocated phoneme be the upper and lower position value 1 outside the parentheses of this sentence), and then According to this upper and lower position value 1 and the mutual interval relationship data of each sound contained in this unlocated phoneme (the mutual interval relationship data of each sound contained in this unlocated phoneme is also this positioned The mutual interval relationship data of each sound contained in the phoneme) to determine the pitch value of each sound in this located phoneme, and this just obtains a located phoneme. The last few sentences mentioned that the pitch value of each tone in this located phoneme can be determined according to the upper and lower position value 1 and the interval relationship data between each tone contained in this unlocated phoneme, and the determination is made here Generally also according to the mode and clef of the located phoneme of the coded section, the pitch value of the sound obtained in this way can be the pitch value of the sound, and it can also be determined not according to the coded phoneme. The mode and clef of the channel in which this localized phoneme-like element of the passage is located, the pitch value obtained in this way can be converted to the actual pitch value of the encoded passage at any time later, unless it can be deduced from the reference For other purposes, the pitch value of each tone in this positioned phoneme is determined according to the upper and lower position values of the positioned phoneme and the interval relationship data between the sounds contained in the phoneme as mentioned elsewhere in this manual. Such sayings can also be understood as described in the 5th line of this paragraph "a few sentences" to the 10th line of this paragraph "actual pitch value" by default.

本说明书述及音素,类音素,各种处理(比如顺序拼合,交织拼合,复制处理)得到的结果等时,涉及的定位处理,可以是多次进行的,比如对一未定位的音素,先作定位处理得到被定位的音素,接着又可再对被定位的音素作一次新的定位处理,比如是把这被定位的音素里的各音的音高再提高2个音步,得到的还是一个被定位的音素;再比如对一个自定位的音素,再进行新的定位,得到一个被定位的音素;再比如可对一个类音素作一次定位操作,再作一次定位操作,得到的还是一个被定位的类音素。When this manual refers to phonemes, phonemes, and various processing (such as sequence stitching, interweaving stitching, and copy processing), the positioning processing involved can be performed multiple times, such as for an unlocated phoneme. Do positioning processing to obtain the positioned phoneme, and then perform a new positioning process on the positioned phoneme again, for example, the pitch of each sound in the positioned phoneme is increased by 2 steps again, and the result is still A localized phoneme; another example is to perform a new localization on a self-localized phoneme to obtain a localized phoneme; another example is to perform one localization operation on a class of phonemes and another localization operation to obtain a The localized phoneme.

本说明书前面述及的自定位的音素,对未定位或定位的音素作定位这些内容时涉及的这些未定位或定位的音素里可以没有其它附加信息也可以有其它附加信息;如未定位或定位的音素里含有其它附加信息,那对这未定位或定位的音素进行处理(比如对其本身进行处理或与其它音素或类音素一起处理;处理形式可以是组拼形式或拼合相关处理形式等)而得到的类音素里,按约定也可含有这些其它附加信息的全部或部分;本说明书前面述及的自定位的类音素,对类音素作定位这些内容时涉及的这些类音素里可以没有其它附加信息也可以有其它附加信息;如未定位或定位的类音素里含有其它附加信息,那对这未定位或定位的类音素进行处理(比如对其本身进行处理或与其它音素或类音素一起处理;处理形式可以是组拼形式或拼合相关处理形式等)而得到的类音素里,按约定也可含有这些其它附加信息的全部或部分;本说明书前面述及的各种处理(比如顺序拼合,交织拼合,复制处理)得到的结果是自定位的内容时涉及的这结果里可以没有其它附加信息也可以有其它附加信息。The self-positioned phonemes mentioned earlier in this manual, when positioning the unlocated or positioned phonemes, there may be no other additional information or other additional information in these unlocated or positioned phonemes; such as unlocated or positioned If there are other additional information in the phoneme, then process the unlocated or located phoneme (such as processing it by itself or processing it together with other phonemes or phonemes; the processing form can be a combination form or a combination of related processing forms, etc.) And in the obtained phonemes, all or part of these other additional information may also be contained by agreement; the self-positioned phonemes mentioned above in this manual may not have other phonemes involved in these contents when positioning the phonemes. The additional information can also have other additional information; if the unlocated or positioned phoneme contains other additional information, then the unlocated or positioned phoneme is processed (for example, it is processed by itself or together with other phoneme or phoneme) processing; the processing form can be a grouping form or combining related processing forms, etc.), and can also contain all or part of these other additional information as agreed; the various processing mentioned above in this manual (such as sequential combination , interweaving and stitching, and copy processing) to obtain the result that is involved in the content of self-positioning. There may be no other additional information or other additional information in the result.

如一未定位或定位的音素里含有其它附加信息,那对此未定位或定位的音素进行定位后,一般不会改变其中的其它附加信息;如一未定位或定位的类音素里含有其它附加信息,那对此未定位或定位的类音素进行定位后,一般不会改变其中的其它附加信息。If an unlocated or positioned phoneme contains other additional information, the other additional information therein will generally not be changed after the unlocated or positioned phoneme is positioned; if an unlocated or positioned phoneme contains other additional information, After the unlocalized or localized phoneme is localized, generally other additional information therein will not be changed.

在编码规则里和解码规则里一般都有对音素的明确定义,另一方面,可从拼合信息的值里直接得到的用于拼合的那些有关数个音的信息,都应各自被看成是音素(当然这里说的要被看成音素的有关数个音的信息,应具有本说明书前面定义的音素的特点),且这些有关数个音的信息在拼合信息的值里一般是由不同编号来分别表示它们(这些编号也可被看成是音素编号,这些编号与编解码规则里本来定义用作音素编号的那些编号的编号方式一般不同)。本段提到的‘那些有关数个音的信息’中‘那些’是修饰‘有关数个音的信息’的,本段提到的‘这些有关数个音的信息’中‘这些’是修饰‘有关数个音的信息’的。There are generally clear definitions of phonemes in the encoding rules and decoding rules. On the other hand, the information about several sounds that can be directly obtained from the value of the combination information for combination should be regarded as each Phonemes (of course, the information about several tones to be regarded as phonemes here should have the characteristics of the phonemes defined earlier in this manual), and these information about several tones are generally represented by different numbers in the value of the combined information to represent them respectively (these numbers can also be regarded as phoneme numbers, and the numbering methods of these numbers are generally different from those originally defined as phoneme numbers in the codec rules). "Those" in the "information about several tones" mentioned in this paragraph are modified "information about several tones", and "these" in "the information about several tones" mentioned in this paragraph are modifiers 'Information about several tones'.

本申请文件各处提及的‘数个音的组合’,其中的‘数个’是修饰‘音’的,且数个可以是一个即‘数个音的组合’可以是指一个音。The 'combination of several tones' mentioned throughout this application document, 'several' wherein 'several' is modified 'sound', and several can be one, that is, 'combination of several tones' can refer to one sound.

本申请文件各处提及的‘类音素’,‘音素’是不同概念。The 'phoneme' and 'phoneme' mentioned in this application document are different concepts.

解码用音素拼合乐段发音是分一个拼合步骤或多个拼合步骤来完成拼合的,不论是一个拼合步骤或多个拼合步骤,至少有一个拼合步骤是要使用到音素的,使用音素的方式可有多种,比如直接把音素用于拼合,还可以是先把某音素做处理(比如修改处理)然后再将处理后得到的类音素来用于拼合,还可以是对某音素先作定位再做复制操作然后再将复制操作得到的类音素来用于拼合,还可以是把自定位的音素直接拿去发音(比如分步发音方式时),还可以是把未定位的音素进行定位然后把得到的被定位的音素拿去发音(比如分步发音方式时),等;在用音素拼合乐段发音的各拼合步骤中,除了使用音素外,还可能使用到一些别的信息(别的信息比如是有关怎样对某定位的音素中某音的音的音高进行升高或降低处理的信息),还可能使用到一些其它附加信息(音色,被编码乐段的节奏,调式,谱号信息,拼合涉及的操作信息等信息)。(本段各处提到的‘音素’是指解码时从拼合信息的值直接得到的音素;本段各处提到的用音素拼合乐段发音,是指用音素拼合乐段的全部发音)The pronunciation of the phoneme-joined segment for decoding is divided into one joint step or multiple joint steps to complete the joint. No matter it is one joint step or multiple joint steps, at least one of the joint steps needs to use phonemes. The way to use phonemes can be There are many kinds, such as directly using phonemes for merging, or processing a certain phoneme first (such as modification processing) and then using the phonemes obtained after processing for merging, or positioning a certain phoneme first and then Do the copy operation and then use the phonemes obtained by the copy operation for splicing. You can also take the self-positioned phonemes directly for pronunciation (such as in the step-by-step pronunciation method), or you can locate the unlocated phonemes and then put them together. The obtained positioned phoneme is used for pronunciation (such as in the step-by-step pronunciation mode), etc.; in each step of combining the pronunciation of the segment with the phoneme, in addition to using the phoneme, some other information (other information) may also be used For example, information about how to raise or lower the pitch of a sound in a localized phoneme), and may also use some other additional information (timbre, rhythm, mode, clef information of the coded section , information such as operation information involved in merging). (The 'phoneme' mentioned in this paragraph refers to the phoneme obtained directly from the value of the combined information during decoding; the pronunciation of the combined segment with the phoneme mentioned in this segment refers to the entire pronunciation of the combined segment with the phoneme)

在用某编码规则编码生成表示如何用音素来拼合乐段发音的拼合信息的值时,是暗含解码时按此拼合信息的值来用音素来拼合乐段发音要用一些拼合步骤(甚至暗含允许解码时可使用几套拼合步骤方案中任一套拼合步骤方案里的那些拼合步骤,这里不同拼合步骤方案里的拼合步骤不同)的意思的,在用与此编码规则专门对应的解码规则来对此拼合信息的值解码时,一般是按照前面述的暗含的那一些拼合步骤来解码的(本说明书后面的解码规则和编码规则就是这样的),或是按照前面述的几套拼合步骤方案中的某一套方案里的那些拼合步骤来解码的,但在用与此编码规则专门对应的解码规则或一般对应的解码规则来对此拼合信息的值解码时,也可能是按照另外一些拼合步骤来解码的(前面述的暗含的那一些拼合步骤不是这另外一些拼合步骤,前面述的几套拼合步骤方案中的任一套拼合步骤方案里的那些拼合步骤都不是这另外一些拼合步骤);并且如解码时用分步发音方式发音,那一般每次(即每步)是经过一个或多个拼合步骤后得到解码结果的一部分并将其拿去发音。本段以上所述是针对解码时播乐段的全部发音的,如解码时是播乐段的部分发音,那可以是把以上所述播乐段的全部发音用的那些拼合步骤直接或作修改后当成播乐段的部分发音用的那些拼合步骤,当然各拼合步骤里的处理一般也有改变;与解码时用分步发音方式来播乐段的全部发音相比,解码时用分步发音方式来播乐段的部分发音时,可能是少些拼合步骤并且有些步发音少用一些解码数据来发音,也可能拼合步骤不减少但有些步发音少用一些解码数据来发音,或是其它情况。When coding with a certain encoding rule to generate the value of the combination information representing how to use the phoneme to combine the pronunciation of the music section, it is implied that when decoding, the value of the combination information is used to combine the pronunciation of the music section with the phoneme. Some combination steps (even implicitly allowing During decoding, those stitching steps in any set of stitching step schemes in several sets of stitching step schemes can be used, here the meanings of the stitching steps in different stitching step schemes are different), and the decoding rules specially corresponding to this coding rule are used to deal with When the value of this combined information is decoded, it is generally decoded according to the above-mentioned implied combination steps (the decoding rules and encoding rules later in this specification are like this), or according to the above-mentioned sets of combination steps. It is decoded by those stitching steps in a certain set of schemes, but when decoding the value of the stitched information with the decoding rule specially corresponding to this encoding rule or the decoding rule generally corresponding, it may also follow some other stitching steps to be decoded (the above-mentioned implied merging steps are not these other merging steps, and those merging steps in any set of merging step schemes in the aforementioned several sets of merging step schemes are not these other merging steps); And as pronunciation with step-by-step pronunciation mode during decoding, then generally each time (that is, every step) is to obtain a part of the decoding result after one or more stitching steps and take it for pronunciation. The above in this paragraph is aimed at the whole pronunciation of the playing music section during decoding. If it is part of the pronunciation of the playing music section during decoding, it can be directly or modified by those stitching steps used for the entire pronunciation of the playing music section mentioned above. Afterwards, those combining steps used for partial pronunciation of the playing music section, of course, the processing in each joining step is generally changed; compared with using the step-by-step pronunciation method to play the entire pronunciation of the music section when decoding, the step-by-step pronunciation method is used when decoding When playing part of the pronunciation of the music section, it may be that there are fewer stitching steps and some steps are pronounced with less decoding data, or the stitching steps are not reduced but some steps are pronounced with less decoding data, or other situations.

在解码时,可能每个拼合步骤就是一个解码步骤,或是一个解码步骤中含多个拼合步骤,或是一个拼合步骤分散在多个解码步骤里,或是其它情况。至于如何定义每个拼合步骤里做什么,每个解码步骤做什么,这是根据需要来自定义的。During decoding, each merging step may be a decoding step, or one decoding step contains multiple merging steps, or one merging step is dispersed in multiple decoding steps, or other situations. As for how to define what to do in each stitching step and what to do in each decoding step, this is defined according to needs.

本申请文件约定,解码用音素拼合乐段发音时,在用音素拼合乐段发音的所有拼合步骤中用到的所有音素中,是至少有一个音素,此音素中是包含有不同发音时刻值的(前面述的3型音素就是这样,本说明书第49页以‘除非提及时有特别指明有他意,本’开头的那段里举例提到的前面两种音素也是这样的);更进一步,本申请文件所说的乐谱拼合方式,对于一个表示如何用音素来拼合乐段发音的拼合信息的值(也即一个乐段拼合信息的值),在用任一编码规则来编码生成此拼合信息的值时涉及使用的音素(这里说的音素是指上一句‘如何用音素来拼合乐段发音’中的音素)中是至少有一个音素,此音素中是包含有不同发音时刻值的;按任一解码规则对任一乐段拼合信息的值进行解码时涉及用于拼合此乐段发音的音素中是至少有一个音素,此音素中是包含有不同发音时刻值的;音素中包含有不同发音时刻值,那这音素是含多个音的信息的(本句提及的‘多’与后面两句提及的‘多个发音时刻值’里的‘多’不一定是相同数值),前面两句说到的不同的发音时刻值是指多个发音时刻值互不同(因为至少要有两个发音时刻值来比较才能说它们不同),这多个发音时刻值中任一发音时刻值,定是某一个音的发音时刻值(还可能是其它音的发音时刻值,这里说的其它音也是这音素里的音),且这某一个音是这音素里的音(即这音素含多个音的信息,这某一个音就是这多个音中的一个音,本括号里提及的‘多’与本段内前面提及的‘多个发音时刻值’里的‘多’不一定是相同数值),如这多个发音时刻值中的一个发音时刻值是这音素里的一个音的发音时刻值,这多个发音时刻值中的另一个发音时刻值是这音素里的另一个音的发音时刻值,那这两个音定是这音素里的不同音;比如一个发音时刻值与另一个发音时刻值是相差16分之1拍,8分之1拍,8分之3拍,4分之1拍或2分之1拍等;而如是一个五线谱中带冠音的音符,那这个音符与它带的冠音是在同一时间点激发发音的(发音时刻相同),在五线谱中,同一音高同样音长的音,如果在五线谱上不同水平位置连续记录了两次,那这两次音应看为是不同时间点激发的两个音(比如图8所示就是3个不同时间点激发的发音,可把图8中的3个音定义成一个音素,这3个音的发音时刻值都被记录在此音素里,此音素中包含有不同发音时刻值),图13中上面3个音素中最右边的一个音素56,是包含有不同时间点激发的2个发音的(由两个音符分别表示的),而图13中上面左边两个音素56,都是只包含一个时间点激发的1个发音的;本段以上所说的音素,可以包含有多个音各自的发音时刻值,各自的音长值,各音互相之间的音程关系数据,也可只包含有多个音各自的发音时刻值及各自的音长值,还可能有其它情况。作本段的约定,是为了把本说明书述的乐谱拼合方式与平常见的记录乐段谱数据的方式区别开来,因为记录乐段谱数据的方式也可以被看成是由一些音素来拼合,只是对于所使用的任一音素,其中的音(一个或多个音)都是同一时间点激发的。本段所说的激发,就相当于弹钢琴时的对一个键的一次按键动作。This application document stipulates that when decoding the pronunciation of a phoneme-joined passage, among all the phonemes used in all the joining steps of using the phoneme-joined passage to pronounce, there must be at least one phoneme, and this phoneme contains different pronunciation time values (This is the case with the above-mentioned type 3 phonemes, and this is also the case with the first two phonemes mentioned in the paragraph at the beginning of "unless it is specifically mentioned when it is mentioned, this" on page 49 of this manual); furthermore, this The sheet music combination method mentioned in the application document refers to the value of the combination information (that is, the value of a combination information) indicating how to use phonemes to combine the pronunciation of a section, and any encoding rule is used to encode the combination information. There is at least one phoneme in the phoneme used when the value is involved (the phoneme mentioned here refers to the phoneme in the previous sentence "how to use phonemes to put together the pronunciation of a passage"), and this phoneme contains different pronunciation moment values; When a decoding rule decodes the value of any segment combination information, it involves at least one phoneme in the phonemes used to combine the pronunciation of this segment, and this phoneme contains different pronunciation time values; the phonemes contain different pronunciations Time value, then this phoneme contains information of multiple sounds (the "multiple" mentioned in this sentence and the "multiple" in the "multiple pronunciation time values" mentioned in the next two sentences are not necessarily the same value), the front The different pronunciation time values mentioned in the two sentences mean that a plurality of pronunciation time values are different from each other (because at least two pronunciation time values are to be compared to say that they are different), any pronunciation time value in these multiple pronunciation time values, It must be the pronunciation time value of a certain sound (it may also be the pronunciation time value of other sounds, and the other sounds mentioned here are also sounds in this phoneme), and this certain sound is a sound in this phoneme (that is, how many times this phoneme contains) The information of a sound, this certain sound is one of the multiple sounds, the "many" mentioned in this bracket is not necessarily the same as the "many" in the "multiple pronunciation time values" mentioned earlier in this paragraph is the same value), as one of the multiple pronunciation time values is the pronunciation time value of a sound in this phoneme, another pronunciation time value in these multiple pronunciation time values is another pronunciation time value in this phoneme The pronunciation time value of the sound, then these two sounds must be different sounds in this phoneme; for example, the difference between the pronunciation time value of one pronunciation time value and the other pronunciation time value is 1/16 beat, 1/8 beat, 3/8 beat , 1/4 beat or 1/2 beat, etc.; and if it is a note with a crown sound in a staff, then this note and its crown sound are stimulated at the same time point (same pronunciation time), in the staff Among them, if the sound with the same pitch and the same length is recorded twice consecutively at different horizontal positions on the stave, then these two sounds should be regarded as two sounds excited at different time points (for example, as shown in Figure 8, there are three Pronunciation stimulated at different time points, the three tones in Figure 8 can be defined as a phoneme, and the pronunciation time values of these three tones are all recorded in this phoneme, which contains different pronunciation time values), Figure 13 The rightmost phoneme 56 among the top three phonemes in the middle contains two pronunciations stimulated by different time points (represented by two notes respectively), while the top two phonemes 56 on the left in Fig. 13 6. They all contain only one pronunciation stimulated by one time point; the phoneme mentioned above in this paragraph can include the pronunciation time value of multiple sounds, their respective sound length values, and the interval relationship between each sound The data may also only include the respective pronunciation time values and respective sound length values of multiple sounds, and there may be other situations. The purpose of this paragraph is to distinguish the combination of music scores described in this manual from the usual way of recording music score data, because the way of recording music score data can also be regarded as combining some phonemes , but for any phoneme used, the tones (one or more tones) therein are all excited at the same time point. The excitation mentioned in this paragraph is equivalent to a key action on a key when playing the piano.

所以与平常见的记录乐段谱数据的方式相比,按本说明书述的乐谱拼合方式,按编码规则编码生成乐段拼合信息的值,一般是有压缩编码作用的(比如在所用的音素中,有音素是包含有在不同时刻发音的多个音的音长,音高,发音时刻值等信息,用一个音素编号表示此音素,与乐段谱数据里用一个编号只能表示一个音相比,显然一般更节省BIT),相应的,用解码规则按此乐段拼合信息的值解码时,一般就是有压缩解码作用的(因为编码时有压缩编码作用,所以解码时可看成有压缩解码作用)。Therefore, compared with the usual way of recording music score data, according to the music score combination method described in this manual, the value of the music piece combination information generated by coding according to the coding rules generally has a compression coding effect (for example, in the used phoneme , a phoneme contains information such as the length, pitch, and pronunciation time value of multiple sounds pronounced at different times. A phoneme number is used to represent this phoneme, which is similar to that in the music spectrum data. than, obviously generally saving BIT), correspondingly, when using the decoding rules to decode the value of the combined information according to this segment, it generally has the function of compression decoding (because there is a compression coding function during encoding, it can be regarded as compression during decoding) decoding function).

本申请文件约定,解码用音素拼合乐段发音时,是在用音素拼合乐段发音的所有拼合步骤中使用到多个音素的(比如:解码时,在拼合步骤1从拼合信息的值直接得到一个音素比如从拼合信息的值直接取出这个音素的音素编号,在拼合步骤2从拼合信息的值直接得到两个音素比如从拼合信息的值直接取出这两个音素各自的音素编号,这3个音素都拿来用于乐段拼合),也即是用多个音素拼合乐段发音,所以本申请文件所说的拼合信息的值就是表示如何用多个音素拼合乐段发音的拼合信息的值。本段以上各处提到的‘用音素拼合乐段发音’,是用音素拼合乐段的全部发音,如解码时只需播乐段的部分发音,那在解码的所有拼合步骤中可能是只使用到一个音素。(本段各处提到的‘音素’是指解码时从拼合信息的值直接得到的音素)。This application document stipulates that when decoding the pronunciation of a phoneme-joined section, multiple phonemes are used in all the steps of combining the pronunciation of the phoneme-joined section (for example: when decoding, it is directly obtained from the value of the combination information in the combination step 1. For example, the phoneme number of a phoneme is directly obtained from the value of the combination information, and two phonemes are directly obtained from the value of the combination information in the combination step 2. For example, the phoneme numbers of the two phonemes are directly obtained from the value of the combination information. These three The phonemes are all used for segment combination), that is, to combine the pronunciation of the segment with multiple phonemes, so the value of the combined information mentioned in this application document is exactly the value of the combined information indicating how to combine the pronunciation of the segment with multiple phonemes . The "synthesizing the pronunciation of the music section with phonemes" mentioned above in this paragraph is to use the phonemes to combine all the pronunciations of the music section. If only part of the pronunciation of the music section is played during decoding, then it may only be used in all the joining steps of decoding. used to a phoneme. (The 'phoneme' mentioned throughout this paragraph refers to the phoneme obtained directly from the value of the stitched information when decoding).

在编解码中,涉及的组拼形式可以是顺序拼合,交织拼合等。In encoding and decoding, the mosaic forms involved may be sequential mosaic, interleaved mosaic and so on.

在实际编解码中,还可采用一些拼合相关处理形式,比如:1,在编码规则用的编码数据里定义的音素中有一些音素(这一些音素可称为基本音素),对任一基本音素,都有一些它的变化形式,在解码规则用的解码数据里定义的音素中也有按这样定义的这一些基本音素,在编码生成拼合信息的值时,如遇到被编码乐段中有基本音素的变化形式,就在拼合信息的值里既记录基本音素的信息(比如基本音素的音素编号),也记录有关选择它的哪个变化形式来拼合的信息,解码时可以先根据拼合信息的值得到基本音素的信息,然后再根据拼合信息的值里的有关选择它的哪个变化形式来拼合的信息并结合解码数据(此解码数据就是指前面说到的解码数据)来得到使用的基本音素的实际变化形式(本句说的解码动作与上句说的解码动作可以处于同一个拼合步骤里或分处于不同的拼合步骤里或是其它情况),此得到的此基本音素的实际变化形式(类音素)可拿来用于发音(比如分步发音方式时的一步只发此基本音素的实际变化形式的音,或比如得到的此基本音素的实际变化形式就是解码用整步发音方式时解码得到的解码结果的全部并可按此基本音素的实际变化形式播乐段的全部或部分发音,以上两句提到的基本音素比如本身是自定位的),也可拿来用于解码时此处理后面的拼合,以上说的基本音素也是用于拼合乐段发音的音素,In actual encoding and decoding, some related processing forms of stitching can also be used, such as: 1, there are some phonemes in the phonemes defined in the coding data used by the coding rules (these phonemes can be called basic phonemes), and any basic phoneme , there are some variations. The phonemes defined in the decoding data used in the decoding rules also have these basic phonemes defined in this way. When encoding and generating the value of the combined information, if there is a basic The change form of the phoneme is to record the information of the basic phoneme (such as the phoneme number of the basic phoneme) in the value of the combination information, and also record the information about which change form to choose to combine it. When decoding, you can first base on the value of the combination information to the information of the basic phoneme, and then according to the information about which variation form to choose to combine in the value of the combined information and combined with the decoded data (this decoded data refers to the decoded data mentioned above) to obtain the basic phoneme used Actual change form (the decoding action that this sentence says and the decoding action that last sentence says can be in the same stitching step or be in different stitching steps or other situations), the actual change form (like Phoneme) can be used for pronunciation (for example, one step in the step-by-step pronunciation method only sends out the sound of the actual variation form of the basic phoneme, or such as the actual variation form of the basic phoneme that is obtained is decoded and obtained when the whole-step pronunciation method is used for decoding All of the decoding results of the basic phoneme can be played according to the actual variation of the basic phoneme or part of the pronunciation of the music section. The basic phoneme mentioned in the above two sentences is self-positioning, for example), and can also be used for this processing during decoding. In the following combination, the basic phonemes mentioned above are also the phonemes used to combine the pronunciation of the passage.

2:在编码生成拼合信息的值时,如遇到被编码乐段中有某个乐段成分(指数个音的组合)中的音的构成情况与对某音素(此某音素是在编码数据里定义好的)进行定位后得到的被定位的音素里的音的构成情况相似但有差别,那编码时可记录此某音素的信息(比如此某音素的音素编号)和此音素的上下位置值1,也记录此某个乐段成分中的音的构成情况与此被定位的某音素中的音的构成情况的差别信息,解码时可以先根据拼合信息的值直接得到此某音素的信息和此音素的上下位置值1而得到此被定位的音素(本句说的解码动作与下句说的解码动作可以处于同一个拼合步骤里或分处于不同的拼合步骤里或是其它情况),并结合从解码数据(解码数据里也有定义此某音素)得到的此某音素中的音的构成情况以及被编码乐段的此被定位的音素所处声道的调式及谱号等信息而得到此被定位的音素里的各音的音高值,然后再根据拼合信息的值里的此某个乐段成分中的音的构成情况与此被定位的某音素中的音的构成情况的差别信息来得到此某个乐段成分(这里得到的此某个乐段成分,也被称为类音素)中的音的构成情况(比如是在此被定位的某音素中的所有音的基础上进行增删一些音或是修改一些音的音高来得到此某个乐段成分中的音的构成情况,这里增删一些音可以是增删一些冠音或非冠音,增的音也可以是在新的时间点发的音,这里修改一些音的音高可以是修改一些冠音或非冠音的音高),得到的此某个乐段成分可拿来用于发音(比如分步发音方式时的一步只发此某个乐段成分的音,或比如得到的此某个乐段成分就是解码用整步发音方式时解码得到的解码结果的全部并可按此某个乐段成分播乐段的全部或部分发音),也可拿来用于解码时此处理后面的拼合,这里如果在差别信息中可直接得到音素的音素编号(比如某单音被定义成音素,差别信息中可得到某处增的一个此单音的音素编号),那这音素也算是可从拼合信息的值中直接得到的音素,这音素属于用于拼合乐段发音的音素,另外也可把拼合过程中得到的类音素来进行修改(比如在某位置删一个音,某位置加一个音),得到的也是类音素,得到的这个类音素可拿来用于发音,也可拿来用于解码时此处理后面的拼合,2: When encoding generates the value of the combined information, if there is a composition of the tone in a certain section component (combination of exponents of tones) in the encoded section, it is different from that of a certain phoneme (this certain phoneme is in the coded data) Defined in ) after positioning, the sound composition of the located phoneme obtained after positioning is similar but different, so the information of this certain phoneme (such as the phoneme number of this certain phoneme) and the upper and lower positions of this phoneme can be recorded during encoding The value is 1, and also records the difference information between the composition of the sound in this certain section component and the composition of the sound in the positioned phoneme. When decoding, you can directly get the information of this certain phoneme according to the value of the combined information. Get this positioned phoneme with the upper and lower position values of this phoneme (the decoding action of this sentence and the decoding action of the next sentence can be in the same stitching step or in different stitching steps or other situations), It is obtained by combining the composition of the sound in the certain phoneme obtained from the decoded data (the certain phoneme is also defined in the decoded data) and the mode and clef of the channel where the positioned phoneme of the coded segment is located. The pitch value of each sound in the positioned phoneme, and then according to the difference between the composition of the sound in this certain segment component in the value of the splicing information and the composition of the sound in a certain phoneme that is positioned information to obtain the composition of the sound in this certain piece of music (the certain piece of music obtained here is also called a phoneme) (for example, on the basis of all the sounds in a certain phoneme that is positioned here Add or delete some tones or modify the pitch of some tones to obtain the composition of the tones in this certain section composition. Here, adding and deleting some tones can be adding or deleting some crown tones or non-coronal tones, and the added tones can also be in the new The sound made at the time point, modifying the pitch of some sounds here can be modifying the pitch of some crown sounds or non-crown sounds), and the obtained part of this certain section can be used for pronunciation (such as when the pronunciation is in steps The first step only emits the sound of this certain music section component, or for example, the obtained certain music section component is the whole decoding result obtained by decoding when the whole-step pronunciation method is used, and the music section can be played according to this certain music section component All or part of the pronunciation), it can also be used for the combination of this processing after decoding, where if the phoneme number of the phoneme can be directly obtained in the difference information (for example, if a single sound is defined as a phoneme, a certain phoneme can be obtained in the difference information A phoneme number of this single tone added at the place), then this phoneme can also be regarded as a phoneme that can be directly obtained from the value of the combined information. Phoneme-like to modify (such as deleting a sound at a certain position, adding a sound at a certain position), and the resulting phoneme is also a similar phoneme, which can be used for pronunciation, and can also be used for decoding after this processing the combination of

3:对音素的复制,即被编码乐段中出现重复的音组合(此音组合有对应被定义成一个音素)时,在编码时,在拼合信息的值里,只记录一次这个音素的信息(比如此音素的音素编号),并记录这个音素的上下位置值1和一些复制信息,解码时可以先根据拼合信息的值直接得到记录的这个音素的信息和这个音素的上下位置值1而得到被定位的此音素,然后再根据拼合信息的值里的复制信息来复制此被定位的此音素而得到类音素(这句说的类音素就是本句说的被定位的此音素的复制品,本句说的解码动作与上句说的解码动作可以处于同一个拼合步骤里或分处于不同的拼合步骤里或是其它情况),得到的类音素是自定位的,得到的类音素可拿来用于发音(比如分步发音方式时的一步只发得到的类音素的音),也可拿来用于解码时此重复处理后面的拼合,且这种方式还可改进一下,即既记录复制的信息,也记录差别信息,以处理被编码乐段中相似的音组合,另外也可对拼合过程中得到的类音素进行复制,复制得到的也是类音素,复制得到的这个类音素可拿来用于发音,也可拿来用于解码时此处理后面的拼合;3: Copying the phoneme, that is, when there is a repeated sound combination in the coded segment (this sound combination is defined as a phoneme), the information of this phoneme is only recorded once in the value of the combined information during encoding (such as the phoneme number of this phoneme), and record the upper and lower position value 1 of this phoneme and some copy information. When decoding, you can directly obtain the recorded information of this phoneme and the upper and lower position value 1 of this phoneme according to the value of the combined information. The phoneme that is located, and then copy the phoneme that is located according to the copy information in the value of the combined information to obtain the phoneme (the phoneme mentioned in this sentence is the copy of the phoneme that is located in this sentence, The decoding action mentioned in this sentence and the decoding action mentioned in the previous sentence can be in the same stitching step or in different stitching steps or other situations), the obtained phoneme is self-positioning, and the obtained phoneme can be used It is used for pronunciation (for example, one step in the step-by-step pronunciation method only sends out the phoneme-like sound obtained), and it can also be used for the stitching after the repeated processing during decoding, and this method can also be improved, that is, both record and copy information, also record the difference information, in order to deal with the similar sound combination in the coded section, also can copy the phoneme obtained in the combination process, the copied phoneme is also the phoneme, the copied phoneme can be used It is used for pronunciation, and can also be used for the stitching after this processing during decoding;

还可能有除以上这3种拼合相关处理形式外的其它拼合相关处理形式(以上3种情况涉及音素时主要是以音素只包含数个音的信息而不包含其它附加信息为例的,如音素里除包含数个音的信息外还有比如音色等其它附加信息,那也可对定位或未定位的此音素进行修改,或复制等处理且这里说的修改甚至可以是修改定位或未定位的音素里的音色等信息,且这些处理得到的类音素里仍有这些比如音色等其它附加信息,处理得到的类音素可拿去发音或用于后面的拼合,这里说的处理形式就属于本句括号外说的其它拼合相关处理形式),本说明书前面述的在解码拼合时对音素或类音素进行定位,也可被看成是一种拼合相关处理形式;以上这些拼合相关处理形式在拼合时有对参加拼合的定位或未定位的音素作修改,或是按参加拼合的音素来作定位并产生比如复制生成类音素,或是对音素或类音素进行定位,或是其它情况,拼合时有用以上这些拼合相关处理形式时,仍是编码生成表示如何用音素来拼合乐段发音的拼合信息的值,解码时是按表示如何用音素来拼合乐段发音的拼合信息的值来解码发音。There may also be other combination-related processing forms other than the above three combination-related processing forms (when the above three cases involve phonemes, it is mainly based on the fact that the phoneme only contains the information of several sounds and does not contain other additional information, such as phoneme In addition to the information containing several sounds, there are other additional information such as timbre, which can also modify or copy the positioned or unlocated phoneme, and the modification mentioned here can even modify the positioned or unlocated phoneme. Information such as timbre in the phoneme, and these processed phonemes still have other additional information such as timbre, etc. The processed phoneme can be used for pronunciation or used for subsequent splicing. The processing form mentioned here belongs to this sentence Other collage-related processing forms outside the parentheses), the positioning of phonemes or phoneme-like elements when decoding and collating mentioned above in this manual can also be regarded as a collage-related processing form; the above-mentioned collage-related processing forms are It is useful to modify the phonemes that participate in the combination of positioning or non-location, or to locate and generate phonemes that participate in the combination, such as copying and generating phonemes, or to locate phonemes or phonemes, or other situations. During the above processing forms related to merging, encoding still generates the value of the merging information representing how to use phonemes to merge the pronunciation of the segment, and decoding is to decode the pronunciation according to the value of the merging information indicating how to use phonemes to merging the pronunciation of the segment.

除以上3种拼合相关处理形式外,还可能有一些别的拼合相关处理形式,用音素拼合一乐段发音时可使用0种或数种组拼形式(组拼形式的种类包括本说明书前面述的顺序拼合,交织拼合,以及本说明书没述及的其它组拼形式等,其它组拼形式可以涉及或没涉及对其它附加信息进行的处理,其它组拼形式甚至可以是只对其它附加信息进行处理,任一组拼形式可以是或不是按未定位或定位的音素里或解码时得到的类音素里的操作信息来作相应处理的)且在拼合此一乐段发音时任一组拼形式可被多次使用,且拼合此一乐段发音时可使用0种或数种拼合相关处理形式(拼合相关处理形式的种类包括本说明书前面述的对音素进行修改,对定位的音素的复制处理,以及本说明书没述及的别的拼合相关处理形式等,别的拼合相关处理形式可以涉及或没涉及对其它附加信息进行的处理,别的拼合相关处理形式甚至可以是只对其它附加信息进行处理,任一拼合相关处理形式可以是或不是按未定位或定位的音素里或解码时得到的类音素里的操作信息来作相应处理的)且在拼合此一乐段发音时任一拼合相关处理形式可被多次使用,以上各处说的使用0种即不使用;比如把一些定位的音素全部顺序拼合起来,把这些定位的音素顺序转成发音即形成乐段发音,再比如可以把一些定位的音素交织拼合起来得到自定位的一个类音素,把另一定位的音素作修改来得到自定位的另一个类音素,再把上两句各自得到的自定位的类音素按其对应的发音的发音先后顺序拼合起来,并把顺序拼合的结果里这两个自定位的类音素顺序转成发音即形成乐段发音;甚至解码过程中可先按某组拼形式用几个类音素来拼合得到一个类音素,然后再把此得到的类音素拿去跟别的类音素按另外的组拼形式来拼合得到另外的类音素,这另外的类音素可被用于发音(比如这另外的类音素就是解码用整步发音方式时解码得到的解码结果的全部并可按这另外的类音素播乐段的全部或部分发音,上句说的情况中的类音素是以是自定位的为例的),也可继续被用于此拼合处理后面的拼合步骤的拼合;有关如何按0种或数种组拼形式和0种或数种拼合相关处理形式来拼合还可参看本说明书后面述的编码举例1和解码举例1,编码举例2和解码举例2。本段以上所述的用音素拼合一乐段发音是指用音素拼合一乐段的全部发音,本段以上所述的形成乐段发音是指形成乐段的全部发音,对于用音素拼合一乐段的部分发音(解码时播乐段的部分发音时)显然可按以上所述用音素拼合一乐段的全部发音的情况类推,对于形成乐段的部分发音(解码时播乐段的部分发音时)显然可按以上所述形成乐段的全部发音的情况类推。In addition to the above 3 kinds of combination-related processing forms, there may also be some other combination-related processing forms. Zero or several combination forms can be used when combining phonemes to pronounce a music section (the types of combination forms include the aforementioned Sequential splicing, interleaving splicing, and other splicing forms not mentioned in this manual, etc. Other splicing forms may or may not involve the processing of other additional information, and other splicing forms may even only process other additional information. processing, any grouping form may or may not be processed according to the operation information in the unlocated or positioned phoneme or the phoneme-like obtained during decoding) and any grouping form when combining the pronunciation of this music section It can be used multiple times, and 0 or several related processing forms can be used when assembling the pronunciation of this passage (the types of related processing forms include modifying the phonemes mentioned earlier in this specification, and copying the positioned phonemes. , and other combination-related processing forms not mentioned in this manual, etc. Other combination-related processing forms may or may not involve processing other additional information, and other combination-related processing forms may even only perform processing on other additional information processing, any combination-related processing form may or may not be processed according to the operation information in the unlocated or positioned phoneme or the phoneme-like obtained during decoding) and any combination-related The processing form can be used multiple times, the use of 0 mentioned above is not used; for example, put together all the positioned phonemes in sequence, and convert the sequence of these positioned phonemes into pronunciation to form the pronunciation of the segment, and another example can be put Some localized phonemes are interleaved and put together to obtain a self-localized phoneme, and another localized phoneme is modified to obtain another self-localized phoneme, and then the self-localized phoneme obtained in the previous two sentences is used according to its corresponding The pronunciation sequence of the pronunciation is put together, and the sequence of the two self-positioned phonemes in the result of the sequence is converted into pronunciation to form the pronunciation of the segment; even in the decoding process, several phonemes can be used in a certain group spelling form first. Combine to get a phoneme, and then take the obtained phoneme with other phonemes in another combination to get another phoneme, and this other phoneme can be used for pronunciation (such as this other The class phoneme is exactly the whole decoding result obtained by decoding when the whole-step pronunciation mode is used, and the whole or part of the music section can be played according to this other class phoneme. example), can also continue to be used for the stitching of the stitching steps behind this stitching process; how to stitch together by 0 or several combination forms and zero or several stitching related processing forms can also be referred to later in this specification Encoding Example 1 and Decoding Example 1, Encoding Example 2 and Decoding Example 2. Combining a music section pronunciation with phonemes described above in this paragraph refers to combining all pronunciations of a music section with phonemes, and forming a music section pronunciation described above in this paragraph refers to forming all pronunciations of a music section. The partial pronunciation of section (when decoding, when playing the partial pronunciation of music section) can obviously be combined with the situation of all the pronunciations of a music section with phonemes as described above by analogy, for forming the partial pronunciation of music section (when decoding, broadcast the partial pronunciation of music section When) obviously can be analogized by the above-mentioned situation of forming all the pronunciations of the music section.

上一段是针对解码时按整步发音方式来说的,对于分步发音方式也是与整步发音方式类似的,在解码用分步发音方式时,用音素拼合一乐段发音时可使用0种或数种组拼形式(组拼形式的种类包括本说明书前面述的顺序拼合,交织拼合,以及本说明书没述及的其它组拼形式等,其它组拼形式可以涉及或没涉及对其它附加信息进行的处理,其它组拼形式甚至可以是只对其它附加信息进行处理,任一组拼形式可以是或不是按未定位或定位的音素里或解码时得到的类音素里的操作信息来作相应处理的)且在拼合此一乐段发音时任一组拼形式可被多次使用,且拼合此一乐段发音时可使用0种或数种拼合相关处理形式(拼合相关处理形式的种类包括本说明书前面述的对音素进行修改,对定位的音素的复制处理,以及本说明书没述及的别的拼合相关处理形式等,别的拼合相关处理形式可以涉及或没涉及对其它附加信息进行的处理,别的拼合相关处理形式甚至可以是只对其它附加信息进行处理,任一拼合相关处理形式可以是或不是按未定位或定位的音素里或解码时得到的类音素里的操作信息来作相应处理的)且在拼合此一乐段发音时任一拼合相关处理形式可被多次使用,以上各处说的使用0种即不使用;比如用分步发音方式时,可以第1步把一些从拼合信息的值里直接得到的音素先各自作定位再交织拼合起来得一个自定位的类音素,并按这个自定位的类音素发出音来,第2步再把从拼合信息的值里直接得到的另一音素先作定位再作复制处理来得到自定位的另一个类音素,并把被定位的这另一音素和这自定位的另一个类音素顺序拼合起来,并按这顺序拼合的结果里被定位的这另一音素和这自定位的另一个类音素顺序发出音来,这样前面自定位的类音素的发音和后面被定位的这另一音素的发音和自定位的这另一个类音素的发音就形成乐段发音。本段以上所述的用音素拼合一乐段发音是指用音素拼合一乐段的全部发音,本段以上所述的形成乐段发音是指形成乐段的全部发音,对于用音素拼合一乐段的部分发音(解码时播乐段的部分发音时)显然可按以上所述用音素拼合一乐段的全部发音的情况类推,对于形成乐段的部分发音(解码时播乐段的部分发音时)显然可按以上所述形成乐段的全部发音的情况类推。The previous paragraph is for the whole-step pronunciation method when decoding, and the step-by-step pronunciation method is similar to the whole-step pronunciation method. When the step-by-step pronunciation method is used for decoding, 0 types can be used when combining phonemes to pronounce a paragraph. or several combination forms (the types of combination forms include the above-mentioned sequential combination in this specification, interweaving combination, and other combination forms not mentioned in this specification, and other combination forms may or may not involve other additional information. The processing carried out, other combination forms can even only process other additional information, and any combination form can be or not according to the operation information in the unlocated or located phoneme or the phoneme that is obtained during decoding. processing) and any combined form can be used multiple times when assembling the pronunciation of this section, and 0 or several related processing forms can be used when assembling the pronunciation of this section (the types of related processing forms include The modification of phonemes mentioned above in this manual, the copying of positioned phonemes, and other related processing forms of stitching that are not mentioned in this manual, etc., and other related processing forms of stitching may or may not involve the processing of other additional information processing, and other combination-related processing forms may even only process other additional information. Any combination-related processing form may or may not be performed according to the operation information in the unlocated or located phoneme or in the phoneme-like obtained during decoding. Correspondingly processed) and any combination related processing form can be used multiple times when assembling the pronunciation of this piece of music, and the use of 0 mentioned above is not used; for example, when using the step-by-step pronunciation method, you can put the Some phonemes directly obtained from the value of the mosaic information are positioned separately and then interleaved and stitched together to obtain a self-positioned phoneme, and the phoneme is pronounced according to this self-positioned phoneme, and the second step is to put the value from the mosaic information The directly obtained another phoneme is first positioned and then copied to obtain another self-positioned phoneme, and the other phoneme that is positioned and the self-located another phoneme are sequentially combined, and they are combined in this order In the result, this other phoneme and this self-localized another phoneme are pronounced sequentially, so that the pronunciation of the former self-localized phoneme and the pronunciation of this other phoneme and the self-localized other phoneme The pronunciation of a phoneme just forms the pronunciation of a phrase. Combining a music section pronunciation with phonemes described above in this paragraph refers to combining all pronunciations of a music section with phonemes, and forming a music section pronunciation described above in this paragraph refers to forming all pronunciations of a music section. The partial pronunciation of section (when decoding, when playing the partial pronunciation of music section) can obviously be combined with the situation of all the pronunciations of a music section with phonemes as described above by analogy, for forming the partial pronunciation of music section (when decoding, broadcast the partial pronunciation of music section When) obviously can be analogized by the above-mentioned situation of forming all the pronunciations of the music section.

常规情况,任一用于拼合的定位(自定位或被定位)的音素,此定位的音素内的各音是属于同一个声道的(即此音素是定位在此声道里的,可称这样的定位的音素是定位的单声道音素),常规情况,任一拼合时得到的定位的类音素,此定位的类音素内的各音是属于同一个声道的(即此类音素是定位在此声道里的,可称这样的定位的类音素是类定位的单声道类音素);顺序拼合,交织拼合,复制处理等都可以是用定位在同一个声道里的定位的单声道音素来处理(比如用于顺序拼合的两个定位的音素中,一个定位的音素是定位在声道1里的,另一个定位的音素也是定位在声道1里的),另也可以是用定位在不同声道里的定位的单声道音素(比如用于处理的一个定位的音素是定位在声道1里的,用于处理的另一个定位的音素是定位在声道2里的)来作顺序拼合,交织拼合等处理,另也可以是用定位在不同声道里的定位的单声道类音素(比如用于处理的一个定位的类音素是定位在声道1里的,用于处理的另一个定位的类音素是定位在声道2里的)来作顺序拼合,交织拼合等处理,另也可以是用定位在不同声道里的定位的单声道音素和定位的单声道类音素(比如用于处理的一个定位的音素是定位在声道1里的,用于处理的另一个定位的类音素是定位在声道2里的)来作顺序拼合,交织拼合等处理,这些处理的原理可参考本说明书前面述的都是用定位在同一个声道里的定位的单声道音素,定位的单声道类音素来作顺序拼合,交织拼合等处理的阐述(本说明书前面述的顺序拼合,交织拼合等处理中的任一处理里,涉及的定位的音素默认是以是定位的单声道音素为例的,涉及的定位的类音素默认是以是定位的单声道类音素为例的,且涉及的定位的音素和定位的类音素默认都是以是定位在同一个声道里为例的),本申请文件所说的用音素来拼合乐段发音,拼合时是可以有用定位在不同声道里的定位的单声道音素来作处理的,也可以有用定位在不同声道里的定位的单声道类音素来作处理的,也可以有用定位在不同声道里的定位的单声道音素和定位的单声道类音素来作处理的,比如是作诸如顺序拼合,交织拼合等这样的处理。本段所说的定位的音素,定位的类音素里可以有或没有其它附加信息,如有其它附加信息,那其它附加信息中的比如操作信息可以是用于一个声道或用于多个声道的拼合处理的。Under normal circumstances, for any phoneme that is positioned (self-located or positioned) for splicing, each sound in this positioned phoneme belongs to the same channel (that is, the phoneme is positioned in this channel, which can be called The phoneme of such location is the monophonic phoneme of location), conventional situation, the phoneme of location that obtains during any splicing, each sound in the phoneme of this location belongs to the same channel (that is, this type of phoneme is If it is located in this channel, it can be said that such a localized phoneme is a class-located monophonic phoneme); sequence stitching, interweaving stitching, copy processing, etc. can all be positioned in the same channel monophonic phonemes (for example, among the two positioned phonemes used for sequential splicing, one positioned phoneme is positioned in channel 1, and the other positioned phoneme is also positioned in channel 1), and the other is also Can be monophonic with localized phonemes positioned in different channels (e.g. one localized phoneme for processing is localized in channel 1, another localized phoneme for processing is localized in channel 2 in) to do sequential stitching, interleaved stitching, etc., and also can use monophonic phonemes positioned in different channels (for example, a positioned phoneme used for processing is positioned in channel 1 Another positioning phoneme used for processing is located in channel 2) to do sequential stitching, interleaved stitching and other processing, and it can also be a monophonic phoneme positioned in a different channel and Positioned monophonic phonemes (for example, one localized phoneme for processing is positioned in channel 1, and another positioned phoneme for processing is positioned in channel 2) for sequential splicing, Interweaving and splicing and other processing, the principles of these processing can refer to the above-mentioned in this manual, all of which are positioned in the same channel. Explanation (in any processing of the above-mentioned order stitching, interweaving stitching, etc. in this manual, the phonemes involved in the positioning default to the monophonic phonemes that are positioned as an example, and the phonemes involved in the positioning default to is the monophonic phoneme that is positioned as an example, and the phonemes that are involved and the phonemes that are positioned are positioned in the same channel as an example by default), the application document says that the phonemes are combined The pronunciation of the passage can be processed with monophonic phonemes positioned in different channels when assembling, or it can be processed with monophonic phonemes positioned in different channels. It can be processed with localized monophonic phonemes and localized monophonic phonemes located in different channels, for example, processing such as sequence stitching, interleaving stitching, and the like. The localized phoneme mentioned in this paragraph may or may not have other additional information in the localized phoneme. If there is other additional information, the other additional information such as operation information can be used for one channel or for multiple voices. Tao's flattening process.

特殊情况,如有用于拼合的定位(自定位或被定位)的音素,此定位的音素内的各音是属于不同声道的(即此音素是定位在多个声道里的,可称这样的定位的音素是定位的多声道音素),或如有拼合时得到的定位的类音素,此定位的类音素内的各音是属于不同声道的(即此类音素是定位在多个声道里的,可称这样的定位的类音素是定位的多声道类音素),那拼合时,可以使用数个定位的多声道音素来处理,也可以使用数个定位的多声道类音素来处理,也可以使用数个定位的多声道音素和数个定位的单声道音素来处理,也可以使用数个定位的多声道音素和数个定位的多声道类音素来处理,也可以使用数个定位的多声道类音素和数个定位的单声道类音素来处理,等,处理的形式仍可以是顺序拼合,交织拼合,复制处理等,其处理原理可参考本说明书前面述的都是用数个定位的单声道音素来作顺序拼合,交织拼合,复制处理等处理的阐述,本申请文件所说的用音素来拼合乐段发音,拼合时是可以有使用定位的多声道音素(还可能使用到别的类音素,音素,定位的音素等)来作处理的,也可以有使用定位的多声道类音素(还可能使用到别的类音素,音素,定位的音素等)来作处理的,比如是作诸如顺序拼合,交织拼合,复制处理等这样的处理。本段所说的定位的音素,定位的类音素里可以有或没有其它附加信息,如有其它附加信息,那其它附加信息中的比如操作信息可以是用于一个声道或用于多个声道的拼合处理的。In special cases, if there is a phoneme for positioning (self-positioning or being positioned) for splicing, each sound in the phoneme of this location belongs to different channels (that is, the phoneme is positioned in multiple channels, which can be called like this The localized phoneme is a localized multi-channel phoneme), or if there is a localized phoneme obtained during splicing, each sound in this localized phoneme belongs to a different channel (that is, this type of phoneme is localized in multiple In the channel, it can be said that such a localized phoneme is a localized multi-channel phoneme), and when assembling, it can be processed with several localized multi-channel phonemes, or several localized multi-channel phonemes can be used. It can also be processed by using several positioned multi-channel phonemes and several positioned monophonic phonemes, or it can be processed by using several positioned multi-channel phonemes and several positioned multi-channel phonemes Processing can also be processed by using several positioned multi-channel phonemes and several positioned monophonic phonemes, etc. The processing form can still be sequence stitching, interleaving stitching, copy processing, etc. The processing principle can refer to The foregoing descriptions of this specification all use several positioned monophonic phonemes to do sequence stitching, interweaving stitching, copy processing and other processing elaborations. The use of phonemes in this application document to stitch together the sound of a segment can be done when stitching together. Use localized multi-channel phonemes (may also use other class phonemes, phonemes, localized phonemes, etc.) for processing, or use localized multi-channel phonemes (may also use other class phonemes, Phonemes, positioned phonemes, etc.) are processed, such as processing such as sequence stitching, interleaving stitching, copy processing, etc. The localized phoneme mentioned in this paragraph may or may not have other additional information in the localized phoneme. If there is other additional information, the other additional information such as operation information can be used for one channel or for multiple voices. The combination of the Tao is processed.

本申请文件所说的定位的音素属于某声道,是指此定位的音素里的各音所处五线谱的位置都属于某声道,而不管此定位的音素里有无其它附加信息;本申请文件所说的定位的类音素属于某声道,是指此定位的类音素里的各音所处五线谱的位置都属于某声道,而不管此定位的类音素里有无其它附加信息。The positioned phoneme in this application document belongs to a certain channel, which means that the position of each sound in the positioned phoneme belongs to a certain channel, regardless of whether there is any other additional information in the positioned phoneme; this application The document said that the located phoneme belongs to a certain channel, which means that the positions of the staves of each sound in the located phoneme belong to a certain channel, regardless of whether there is any other additional information in the located phoneme.

本申请文件各处有说到音素里包含一个或多个音的信息,对于这一个或多个音中的任一音(如这一个或多个音就是一个音,那这任一音就是这一个音,如这一个或多个音是多个音,那这任一音就是这多个音中的任一音),不一定是有关它的所有信息都包含在这音素里。This application file has the information that says to comprise one or more sounds in the phoneme everywhere, for any sound in this one or more sounds (as this one or more sounds is exactly a sound, then this any sound is this A tone, such as the one or more tones are multiple tones, then any tone is any tone in the plurality of tones), not necessarily all information about it is included in this phoneme.

除非提及时有特别指明有他意,本说明书除本段和上一段外各处提及的音素,默认是指1型,2型,3型音素,实际中可能还用到一些别的形式的音素,别的形式的音素也含有数个音的信息,比如音素里包括多个音各自的发音时刻值,和这多个音各音互相之间的音程关系数据,不包括这多个音各自的音长值(这种音素中是包含有不同发音时刻值的),而在拼合信息的值里可放这个音素的音素编号,并把这音素里的多个音各自的音长值当成其它信息也放在此拼合信息的值里,解码时可把这音素编号和这些音的音长值合起来得到一个类音素(如这音素里有其它附加信息,那这音素里的所有其它附加信息也可照搬放到这个类音素里),以上几句说的情况可以看成是用音素拼合乐段发音时的一种拼合相关处理形式(本说明书前面也有提到拼合相关处理形式),得到的这个类音素可用于后面的拼合处理等,这个类音素可用于本说明书前面述的顺序拼合,交织拼合等组拼形式以及本说明书没提到的组拼形式,也可用于本说明书前面述的复制处理,定位等拼合相关处理形式以及本说明书没提到的拼合相关处理形式。以上说的这种情况还可有一种变形,即音素里包括多个音各自的发音时刻值,和这多个音各自的音长值,不包括这多个音各音互相之间的音程关系数据,而在拼合信息的值里可放这个音素的音素编号,并把这音素里的多个音各音互相之间的音程关系数据当成其它信息也放在此拼合信息的值里。再比如音素里包括多个音各自的发音时刻值,和这多个音各音互相之间的音程关系数据,和这多个音中部分音各自的音高值不包括另外部分音各自的音高值,和这多个音各自的音长值(这种音素中是包含有不同发音时刻值的),而在解码拼合时可对这音素进行定位得到一个被定位的音素(如这音素里有其它附加信息,那这音素里的所有其它附加信息一般可照搬放到这个被定位的音素里),得到的这个被定位的音素可用于后面的拼合处理等,这个被定位的音素可用于本说明书前面述的顺序拼合,交织拼合等组拼形式以及本说明书没提到的组拼形式,也可用于本说明书前面述的复制处理,定位等拼合相关处理形式以及本说明书没提到的拼合相关处理形式。再比如音素里包括多个音各音互相之间的音程关系数据,和这多个音各自的音长值,不包含这多个音中任一音的发音时刻值或只包含这多个音中部分音各自的发音时刻值,而在拼合信息的值里可放这个音素的音素编号,并把这音素里的多个音各自的发音时刻值当成其它信息也放在此拼合信息的值里,解码时可把这音素编号和这些发音时刻值合起来得到一个类音素(如这音素里有其它附加信息,那这音素里的所有其它附加信息也可照搬放到这个类音素里),以上几句说的情况可以看成是用音素拼合乐段发音时的一种拼合相关处理形式(本说明书前面也有提到拼合相关处理形式),得到的这个类音素可用于后面的拼合处理等,这个类音素可用于本说明书前面述的顺序拼合,交织拼合等组拼形式以及本说明书没提到的组拼形式,也可用于本说明书前面述的复制处理,定位等拼合相关处理形式以及本说明书没提到的拼合相关处理形式。本申请所说的用音素来拼合乐段发音时可以是包括本段以上所述或没述及的有关别的形式的音素的情况中的一种或多种情况的。本段各处说到的音素是指定义的音素(编码规则里定义的,或解码规则里定义的,或既在编码规则定义也在解码规则里定义的),或是放在拼合信息的值里的被看成是音素的信息集合。本段以上所说的各情况里的音素里可没有或有数个其它附加信息(任一其它附加信息比如是音色或节奏信息或调式信息或操作信息等),对于本段以上所说的各情况里的含其它附加信息的音素和含其它附加信息的类音素,那解码拼合时可根据这些其它附加信息来作相应处理,其处理方式与本说明书前面那些段述及的有关其它附加信息的处理方式类似。Unless otherwise specified when mentioned, the phonemes mentioned in this manual except this paragraph and the previous paragraph refer to type 1, type 2, and type 3 phonemes by default, and some other types of phonemes may be used in practice , other forms of phonemes also contain information about several tones. For example, phonemes include the pronunciation time values of each of the multiple tones, and the interval relationship data between each of the multiple tones, excluding the individual tones of the multiple tones Sound length value (this kind of phoneme contains different pronunciation time values), and the phoneme number of this phoneme can be placed in the value of the combined information, and the sound length values of multiple sounds in this phoneme can be regarded as other information It is also placed in the value of this combined information. When decoding, this phoneme number and the sound length value of these sounds can be combined to obtain a class phoneme (if there are other additional information in this phoneme, then all other additional information in this phoneme will also be included). can be copied into this class of phonemes), the situation mentioned in the above sentences can be regarded as a kind of related processing form when combining phonemes to combine the sound of a section (the related processing form of combining is also mentioned in the front of this manual), and the obtained The phoneme-like can be used for subsequent combination processing, etc. This phoneme can be used for the combination forms such as sequential combination and interweaving combination mentioned above in this specification, as well as the combination forms not mentioned in this specification, and can also be used for the copy processing mentioned above in this specification. , positioning and other related processing forms of combining and other related processing forms not mentioned in this manual. There is also a variant of the above-mentioned situation, that is, the phoneme includes the respective pronunciation time values of multiple tones, and the respective sound length values of these multiple tones, and does not include the interval relationship between each of these multiple tones. Data, and the phoneme number of this phoneme can be placed in the value of the combined information, and the interval relationship data between the multiple sounds in the phoneme is also placed in the value of the combined information as other information. Another example is that the phoneme includes the pronunciation time values of multiple tones, and the interval relationship data between each of the multiple tones, and the respective pitch values of some of the multiple tones do not include the respective pitches of other tones High value, and the respective sound length values of these multiple sounds (this phoneme contains different pronunciation time values), and when decoding and merging, this phoneme can be positioned to obtain a positioned phoneme (such as in this phoneme) If there are other additional information, then all other additional information in this phoneme can generally be copied into this positioned phoneme), and the obtained positioned phoneme can be used for subsequent stitching processing, etc., and this positioned phoneme can be used in this The combination forms such as sequence stitching, interweaving stitching mentioned above in the manual and the stitching forms not mentioned in this manual can also be used in the copy processing, positioning and other stitching related processing forms mentioned earlier in this manual and the stitching related forms not mentioned in this manual processing form. Another example is that the phoneme includes the interval relationship data between each of the multiple tones, and the respective length values of the multiple tones, and does not include the pronunciation time value of any of the multiple tones or only includes the multiple tones The pronunciation time values of the middle part sounds, and the phoneme number of this phoneme can be placed in the value of the combined information, and the pronunciation time values of the multiple sounds in the phoneme can also be placed in the value of the combined information as other information , when decoding, this phoneme number and these pronunciation time values can be combined to obtain a phoneme (if there is other additional information in this phoneme, then all other additional information in this phoneme can also be copied into this phoneme), the above The situation described in a few sentences can be regarded as a combination related processing form when the phoneme is used to combine the pronunciation of a section (the related processing form of combination is also mentioned earlier in this manual), and the obtained phonemes can be used for subsequent combination processing, etc., this The phonemes can be used in the combination forms such as sequential combination and interweaving combination mentioned above in this specification, and the combination forms not mentioned in this specification. The mentioned flattening related processing form. The use of phonemes to assemble the pronunciation of the passage mentioned in this application may include one or more of the situations related to other forms of phonemes mentioned above or not mentioned in this paragraph. The phoneme mentioned in this paragraph refers to the defined phoneme (defined in the encoding rule, or in the decoding rule, or defined in both the encoding rule definition and the decoding rule), or the value placed in the splicing information The information in is regarded as a collection of phonemes. There may be no or several other additional information (any other additional information such as timbre or rhythm information or mode information or operation information, etc.) in the phoneme in each situation mentioned above in this paragraph. For each situation mentioned above in this paragraph The phonemes containing other additional information and the phonemes containing other additional information can be processed according to these other additional information when decoding and merging, and the processing method is the same as that of other additional information mentioned in the previous paragraphs of this specification In a similar way.

本说明书各处述及音素时,如提及此音素里的发音时刻值,那默认此音素是含有音的发音时刻值的。When referring to phonemes throughout this manual, if the pronunciation time value in this phoneme is mentioned, it is assumed that this phoneme contains the pronunciation time value of the sound.

本说明书所称编码是将乐段的乐谱,按编码规则编码生成拼合信息的值,解码是编码的逆过程,是按解码规则来对拼合信息的值进行解码并播放出音乐来的。Encoding referred to in this manual refers to encoding the music score of the music section according to the encoding rules to generate the value of the combined information, and decoding is the inverse process of encoding, which decodes the value of the combined information according to the decoding rules and plays the music.

关于乐谱拼合,对于不同的编码规则,可能使用到不同的音素定义方式,编码方式,或乐段拼合信息的值的格式等,对于不同的解码规则,可能使用到不同的音素定义方式,或解码方式等。With regard to sheet music merging, for different encoding rules, different phoneme definition methods, encoding methods, or the format of the value of the segment merging information may be used, and for different decoding rules, different phoneme definition methods may be used, or decoding way etc.

针对不同应用场合的不同编码规则(比如用于简单乐谱书中的乐段编码和用于复杂乐谱书中的乐段编码就是不同应用场合),不同编码规则涉及定义使用的音素个数及种类可能不同;一个编码规则可对多个不同乐段分别编码各自生成乐段拼合信息的值。Different coding rules for different applications (for example, the coding of passages in simple music books and the coding of passages in complex music books are different applications), and different coding rules involve defining the number and types of phonemes used. Different; one encoding rule can encode the values for generating segment combination information for multiple different segments respectively.

针对不同应用场合的不同解码规则(比如用于表示简单乐谱书中的乐段的发音的拼合信息的值解码和用于表示复杂乐谱书中的乐段的发音的拼合信息的值解码就是不同应用场合),不同解码规则涉及定义使用的音素个数及种类可能不同;一个解码规则可对多个不同乐段拼合信息的值分别解码以播各自发音。Different decoding rules for different applications (for example, the value decoding of the combined information used to represent the pronunciation of a passage in a simple score book and the value decoding of the combined information used to represent the pronunciation of a passage in a complex score book are different applications. Occasions), different decoding rules involve the definition of the number and types of phonemes used may be different; one decoding rule can decode the values of a plurality of different music segment combination information to broadcast their respective pronunciations.

对乐段的编码是可以适用于一个声道或多个声道的乐段的,对乐段拼合信息的值的解码发音是可以适用于一个声道或多个声道的。The coding of the segment can be applied to the segment of one channel or multiple channels, and the decoding pronunciation of the value of the combination information of the segment can be applied to one channel or multiple channels.

显然,用一个编码规则编码时,同一个音素一般可用于不同乐段的编码(这里不同乐段的编码都是用此编码规则来作的),并且在用此编码规则来编码生成的不同乐段的拼合信息的值里一般是用相同的音素编号来表示同一个音素;用一个解码规则解码时,同一个音素一般可用于在对不同乐段拼合信息的值进行解码时的拼合处理(这里对不同乐段拼合信息的值进行解码都是按此解码规则来作的)。Apparently, when coding with one coding rule, the same phoneme can generally be used for coding different music sections (here, the coding of different music sections is all done with this coding rule), and when using this coding rule to code different music pieces generated Generally, the same phoneme number is used to represent the same phoneme in the value of the combination information of a segment; when decoding with a decoding rule, the same phoneme can generally be used for combination processing when decoding the values of the combination information of different music segments (here Decoding the values of the combined information of different music segments is done according to this decoding rule).

显然,按一编码规则对一乐段编码生成拼合信息的值时,一般是只用到此编码规则里定义的所有音素中的一部分音素,按一解码规则对一拼合信息的值进行解码时,一般是只用到此解码规则里定义的所有音素中的一部分音素。Obviously, when encoding a music segment according to a coding rule to generate the value of the combined information, generally only a part of phonemes in all the phonemes defined in the coding rule are used, and when decoding the value of a combined information according to a decoding rule, Generally, only some of the phonemes defined in this decoding rule are used.

如用一编码规则编码生成的某乐段拼合信息的值,可被一解码规则解码发音,那此解码规则与此编码规则可以是专门对应或一般对应的,专门对应是指此解码规则是专门用于对用此编码规则编码生成的各拼合信息的值进行解码发音的,一般对应是指此解码规则不是专门用于对用此编码规则编码生成的各拼合信息的值进行解码发音的但此解码规则至少可以对用此编码规则编码生成的某一拼合信息的值进行解码发音;如一解码规则与一编码规则是专门对应的,那此解码规则所用的解码数据与此编码规则所用的编码数据一般是相同或兼容的,如一解码规则与一编码规则是一般对应的,那此解码规则所用的解码数据与此编码规则所用的编码数据一般是相同或兼容的。If the value of a piece of combination information generated by encoding with an encoding rule can be decoded and pronounced by a decoding rule, then the decoding rule and the encoding rule can be specifically or generally corresponding, and the special correspondence means that the decoding rule is specialized It is used to decode and pronounce the value of each piece of information encoded with this encoding rule. Generally, it means that the decoding rule is not specially used to decode and pronounce the value of each piece of information encoded with this encoding rule. The decoding rule can at least decode and pronounce the value of a piece of information generated by encoding with this encoding rule; if a decoding rule corresponds specifically to an encoding rule, then the decoded data used by the decoding rule and the encoded data used by the encoding rule Generally, they are the same or compatible. If a decoding rule generally corresponds to an encoding rule, then the decoded data used by the decoding rule is generally the same or compatible with the encoded data used by the encoding rule.

对于一编码规则,根据其编码细则(比如定义使用的音素等),其可能可对任意乐段都可编码生成拼合信息的值,也可能只能对一些乐段可分别编码生成拼合信息的值,而对有些乐段不能分别编码生成拼合信息的值。For an encoding rule, according to its encoding rules (such as defining the phoneme used, etc.), it may be able to encode the value of the combination information for any segment, or it may only be able to encode the value of the combination information for some segments separately , while some music segments cannot be coded separately to generate the value of the splicing information.

在有些乐谱中,可能会出现力度,滑音等特殊记号,不过一般按音乐合成方式播放音乐的单片机或音乐芯片不具备处理这些特殊记号的功能,所以本说明书以下阐述的编码规则举例和解码规则举例里也没涉及这些记号,对乐段乐谱进行编码时忽略这些记号,只对主要乐调编码,解码播放时只把主要的乐调播出;如有些种类单片机芯片或作为发音器件的带喇叭的音乐芯片有处理这些记号的功能,那可以在编码规则和解码规则里处理这些记号,编码生成的乐段拼合信息的值也就记录有有关这些记号的信息(可把这些记号的信息归入其它附加信息),播放时可由单片机芯片对拼合信息的值解码并由单片机芯片或作为发音器件的带喇叭的音乐芯片播放带有力度,滑音等色彩的音乐。In some music scores, there may be special marks such as dynamics and portamento, but generally the single-chip microcomputer or music chip that plays music in the way of music synthesis does not have the function to process these special marks, so the examples of encoding rules and decoding rules described below in this manual There are no references to these marks, these marks are ignored when encoding the music score, only the main tune is encoded, and only the main tune is broadcast when decoding and playing; such as some types of single-chip microcomputer chips or as sounding devices with loudspeakers The music chip has the function of processing these marks, so these marks can be processed in the encoding rules and decoding rules, and the value of the segment combination information generated by the encoding also records information about these marks (the information of these marks can be classified into other Additional information), during playback, the value of the combined information can be decoded by the single-chip microcomputer chip and played by the single-chip microcomputer chip or a music chip with a speaker as a sounding device with strength, portamento and other colors.

一般可将常用的音素做成音素数据库(可以是:音素数据库里的每个记录项里存放一个音素的信息,比如音素编号,音素里各音的发音时刻值等,如音素包含其它附加信息,那其它附加信息一般也放在此音素的记录项里),按编码规则编码时一般要使用音素数据库,按解码规则解码时一般也要使用音素数据库,如某解码规则与某编码规则一般对应或专门对应,那按此解码规则解码时使用的音素数据库与按此编码规则编码时使用的音素数据库一般是相同或兼容的。Generally, commonly used phonemes can be made into a phoneme database (it can be: information of a phoneme is stored in each record item in the phoneme database, such as the phoneme number, the pronunciation time value of each sound in the phoneme, etc., as the phoneme contains other additional information, The other additional information is generally also placed in the record item of this phoneme), generally the phoneme database will be used when encoding according to the encoding rules, and the phoneme database will generally be used when decoding according to the decoding rules, such as a certain decoding rule and a certain encoding rule generally correspond or Specifically, the phoneme database used when decoding according to this decoding rule is generally the same or compatible with the phoneme database used when encoding according to this encoding rule.

定义音素来拼合音乐乐段的目的是可把音素数据库存放在拾音器内的存储器里,按解码规则来解码的解码程序和其它解码数据也存放在拾音器内的存储器里,可让各有声音乐读物整页面上诸个码字中的全部或部分码字中各个码字的值是与数个按编码规则来编码生成的乐段拼合信息的值有关的,用拾音器即可读码和解码播放出音乐,一个拾音器可通读各种有这种码字的有声音乐读物。The purpose of defining phonemes to combine music segments is to store the phoneme database in the memory in the pickup, and the decoding program and other decoding data that are decoded according to the decoding rules are also stored in the memory in the pickup, so that each audible music book can be integrated. The value of all or part of the codewords on the page is related to the value of the combination information generated by encoding according to the coding rules, and the code can be read and decoded to play music with a pickup , a pickup can read through various audio music books that have this code word.

解码程序及解码数据是按照解码规则来对乐段拼合信息的值进行解码的,解码规则是与对乐段进行编码得到此乐段拼合信息的值时所使用的编码规则专门对应或一般对应的。拾音器里解码所使用的解码数据一般应与编码时使用的编码数据相同或兼容。解码时是用解码规则按乐段的拼合信息的值解码,一般是得到解码结果的全部后并按解码结果播乐段发音;解码时可使用整步发音方式(即解码时是解码得到解码结果的全部后,再把解码结果拿去播乐段的全部发音或部分发音),也可使用分步发音方式(即解码时是每次得到解码结果的一部分,即把得到的解码结果的一部分拿去发音,照这样到解码结束即形成乐段的全部发音或部分发音,这里说的每次也即每步),也可使用除整步发音方式和分步发音方式以外的其它发音方式来播音乐;因为播放音乐时是要精确确定各个音在什么时刻发音的,所以解码采用分步发音方式时,一般在MCU里需要作记时,以便在各合适时刻作发各步音的动作。整步发音方式和分步发音方式和其它发音方式都是按音乐合成方式播音乐的。且解码涉及的解码结果一般是乐段的谱数据,还可能是其它形式的数据(也是按音乐合成方式播音乐)。The decoding program and the decoded data decode the value of the piece combination information according to the decoding rules. The decoding rules correspond specifically or generally to the encoding rules used when encoding the piece to obtain the value of the piece combination information. . The decoded data used for decoding in the pickup should generally be the same or compatible with the encoded data used for encoding. When decoding, use the decoding rules to decode according to the value of the combined information of the music section. Generally, after obtaining all the decoding results, the music section is pronounced according to the decoding result; when decoding, you can use the full-step pronunciation method (that is, when decoding, the decoding result is obtained by decoding After all the decoding results are taken to play the whole or partial pronunciation of the music section), the step-by-step pronunciation method can also be used (that is, a part of the decoding result is obtained each time when decoding, that is, a part of the obtained decoding result is taken Pronunciation, in this way until the end of decoding, the whole or partial pronunciation of the music section will be formed, and each time mentioned here is also every step), and other pronunciation modes other than the whole-step pronunciation mode and the step-by-step pronunciation mode can also be used to broadcast Music; because when playing music, it is necessary to accurately determine when each sound is pronounced, so when the decoding adopts a step-by-step pronunciation method, it is generally necessary to record the time in the MCU, so that the action of each step sound can be performed at each appropriate moment. The whole-step pronunciation mode, the step-by-step pronunciation mode and other pronunciation modes all play music according to the music synthesis mode. And the decoding result involved in the decoding is generally the spectral data of the music section, and may also be other forms of data (music is also played in a music synthesis manner).

采用分步发音方式时,在解码过程中得到的类音素,可能用于与别的类音素按顺序拼合形式形成解码结果的部分并按解码结果的部分发出音来(这句和上句说到的类音素一般是自定位的,形成的解码结果的部分一般是自定位的),也可能直接发出音来(比如是把在解码过程中得到的一个自定位的类音素当成分步发音方式时的某一步得到的解码结果的一部分并用它发音),也可能用于后面更多拼合处理,还可能是其它情况。When the step-by-step pronunciation method is adopted, the phonemes obtained in the decoding process may be used to form parts of the decoding results in order to form part of the decoding results with other phonemes in sequence (this sentence and the previous sentence said The phoneme-like is generally self-positioning, and the part of the decoding result formed is generally self-positioning), and it may also be pronounced directly (for example, when a self-positioning phoneme obtained in the decoding process is regarded as a step-by-step pronunciation method A part of the decoding result obtained in a certain step and use it to pronounce), may also be used for more stitching processing later, and may be other situations.

解码用整步发音方式时,可能是只用音素作一些拼合处理来得到解码结果(比如音素里已有节奏,调式,谱号等信息所以得到的解码结果里也有节奏,调式,谱号等信息),也可能是用音素作一些拼合处理得到某值,并把这个某值与一些其它附加信息(比如被编码乐段的所有声道各自的节奏,调式,谱号信息)合起来成为解码结果(本说明书后面的各解码举例里的整步发音方式就是这样),还可能是别的情况;按整步发音方式发音时,可以是MCU按解码结果播乐段发音,也可以是MCU把解码结果传给MCU外接的音乐芯片来播乐段发音。本说明书第83页述整步发音方式时提及的解码结果也可以是本段以上所述这些情况,本说明书第12,13页述按整步发音方式发音时提及的解码结果也可以是本段以上所述这些情况。When the whole-step pronunciation method is used for decoding, it may only use phonemes to do some splicing processing to obtain the decoding result (for example, there are rhythm, mode, clef and other information in the phoneme, so the obtained decoding result also has rhythm, mode, clef and other information ), it is also possible to use phonemes to do some splicing processing to get a certain value, and combine this certain value with some other additional information (such as the respective rhythm, mode, and clef information of all channels of the coded section) to become the decoding result (The full-step pronunciation method in each decoding example at the back of this manual is like this), and it may also be other situations; The result is sent to the music chip connected to the MCU to play the sound of the music segment. The decoding result mentioned when the full-step pronunciation method is mentioned on the 83rd page of this specification can also be these situations mentioned above in this paragraph, and the decoding result mentioned when the 12th and 13 pages of this specification describe the pronunciation by the whole-step pronunciation method can also be These situations are described above in this paragraph.

解码用分步发音方式时,可能每一步是只用音素作一些拼合处理来得到解码结果的部分(被编码乐段的节奏,调式,谱号等信息可以在解码开始时在拼合信息的值里的比如编码头里得到,并在解码开始时就根据得到的被编码乐段的所有声道各自的节奏,调式,谱号等信息作好MCU发音的参数设置或传给MCU外接的音乐芯片来作好发音的参数设置,这里解码结果不包括节奏,调式,谱号等信息),也可能每一步是用音素作一些拼合处理得到某值,并把这个某值与一些其它附加信息(比如从拼合信息的值里的编码头里得到的被编码乐段的所有声道各自的节奏,调式,谱号信息)合起来成为解码结果的部分,也可能每一步是用音素作一些拼合处理得到解码结果的部分,这些音素里本来就含有一些其它附加信息(比如节奏,调式,谱号信息),还可能是别的情况;按分步发音方式发音时,可以是每步MCU按解码结果的部分发音,也可以是每步MCU把解码结果的部分传给MCU外接的音乐芯片来发音。(本段说的解码结果的部分是被拿去用于分步发音方式的一步发音的)。本说明书第83,84页述分步发音方式时提及的解码结果也可以是本段以上所述这些情况,本说明书第12,13页述按分步发音方式发音时提及的解码结果也可以是本段以上所述这些情况。When the decoding uses the step-by-step pronunciation method, each step may only use the phonemes to do some stitching processing to obtain the decoding result (the rhythm, mode, clef and other information of the coded section can be included in the value of the stitching information at the beginning of decoding For example, it is obtained from the encoding header, and at the beginning of decoding, the parameters of the MCU pronunciation are set according to the rhythm, mode, clef and other information of all the channels of the encoded section or transmitted to the external music chip of the MCU. Set up the parameters of the pronunciation, the decoding result here does not include information such as rhythm, mode, clef, etc.), and it is also possible that each step uses phonemes to do some splicing processing to obtain a certain value, and combine this certain value with some other additional information (such as from The rhythm, mode, and clef information of all the channels of the coded segment obtained in the encoding header in the value of the combined information) are combined to become part of the decoding result, and each step may be decoded by doing some combining processing with phonemes In the part of the result, these phonemes originally contain some other additional information (such as rhythm, mode, and clef information), and there may be other situations; when pronouncing by step-by-step pronunciation, it can be the part of the decoding result of each step MCU Pronunciation can also be that the MCU sends the part of the decoding result to the music chip external to the MCU for pronunciation in each step. (the part of the decoding result mentioned in this paragraph is taken for the one-step pronunciation of the step-by-step pronunciation mode). The decoding result mentioned when the 83rd and 84th pages of this manual describe the step-by-step pronunciation method can also be these situations described above in this paragraph, and the decoding result mentioned when the 12th and 13th pages of this manual describe the pronunciation by the step-by-step pronunciation method can also be These could be the cases described above in this paragraph.

采用分步发音方式时,一般要确定每步做什么,这可按约定默认每步做什么,也可在拼合信息的值里比如拼合信息的值的编码头里放些标志BIT,标志每步做什么(这些标志BIT可看成是拼合涉及的操作信息,属于其它附加信息),也可以在解码时,先根据拼合信息的值里比如拼合信息的值的编码头里的信息,找到拼合信息的值里各声道的数据的位置,再查看各声道的数据,按其解码会产生哪些要拿去发音的被定位的音素及定位的类音素,以及它们发音占的时间,从而确定后面按分步发音方式发音时每步要作些什么,一般要使按每步拿去发音的数据发音而得到的不同声道的发音的持续时间相同。When using the step-by-step pronunciation method, it is generally necessary to determine what each step does. This can be done by default according to the agreement, or you can put some flags BIT in the value of the combined information, such as the encoding header of the value of the combined information, to mark each step. What to do (these flags BIT can be regarded as the operation information involved in the combination, which belongs to other additional information), or when decoding, first find the combination information according to the value of the combination information, such as the information in the encoding header of the value of the combination information The position of the data of each channel in the value of the value, and then check the data of each channel, according to its decoding, which localized phonemes and localized phonemes to be pronounced will be generated, as well as the time occupied by their pronunciation, so as to determine the following What should be done in each step when pronouncing by the step-by-step pronunciation mode, generally the duration of the pronunciation of different vocal channels obtained by pronouncing the data that is taken for pronunciation in each step is the same.

如用一编码规则对一乐段编码生成一拼合信息的值,如用一解码规则解码时,是只用这个拼合信息的值的一部分(比如前半部分)就可解码播出此乐段的部分的发音(这里‘此乐段的部分’是修饰‘发音’的),那这种情况可等同把此乐段的部分(即部分乐段)看成是另一个乐段,可把前面述的这个拼合信息的值的一部分看成是这另一乐段的拼合信息的值(本句和本句后面几句话成立的条件是:前面几句述的拼合信息的值的一部分要符合本说明书前面述的拼合信息的值的条件,比如是表示如何用多个音素来拼合本段前述部分乐段的发音的),这另一乐段的拼合信息的值也是由前述编码规则编码生成的,前面述的解码,就是用解码规则对这另一乐段的拼合信息的值进行解码播这另一乐段的发音。If an encoding rule is used to encode a piece of music to generate a value of combined information, if a decoding rule is used to decode it, only a part of the value of the combined information (such as the first half) can be used to decode and play the part of the piece of music Pronunciation (here 'part of this paragraph' is to modify the 'pronunciation'), then this situation can be equivalent to the part of this paragraph (that is, part of the paragraph) as another paragraph, the aforementioned A part of the value of this combination information is regarded as the value of the combination information of this other music section (the condition that this sentence and the following sentences of this sentence are established is: a part of the value of the combination information described in the previous sentences will meet this specification sheet The condition of the value of the above-mentioned combination information, such as indicating how to use multiple phonemes to combine the pronunciation of the preceding part of this section), the value of the combination information of this other section is also encoded by the aforementioned coding rules. The aforementioned decoding is to use the decoding rules to decode the value of the combined information of this other music section and broadcast the pronunciation of this other music section.

如解码时,是从数个码字里得到多个拼合信息的值,并按解码规则对这多个拼合信息的值分别解码得到各自的解码结果(可以是按同一个解码规则对这多个拼合信息的值各自解码,或是对这多个拼合信息的值各自按不同的解码规则来解码),最后把这些解码结果顺序拿去发音,这种情况可等同看成每次是对一个拼合信息的值进行解码以得到解码结果并把解码结果拿去发音。Such as when decoding, be to obtain the value of a plurality of mosaic information from several codewords, and decode the value of these a plurality of mosaic information respectively according to the decoding rule to obtain respective decoding results (it can be to use the same decoding rule for these multiple The values of the combined information are decoded separately, or the values of the multiple combined information are decoded according to different decoding rules), and finally these decoding results are sequentially pronounced. This situation can be regarded as a combined The value of the message is decoded to obtain the decoded result and the decoded result is used for pronunciation.

拼合信息的值也就是表示如何用音素来拼合乐段发音的拼合信息的值,这里的乐段在编码时和解码时也可被称为被编码乐段。The value of the combination information is also the value of the combination information indicating how to use phonemes to combine the pronunciation of the music section, and the music section here may also be called an encoded music section during encoding and decoding.

如提及时无特别指明有他意,本申请文件各处提及解码结果时,默认是指对乐段的拼合信息的值进行解码而得的解码结果的全部,此解码结果的全部本来是用于播此被编码乐段的全部发音的。If there is no special indication of other intentions when mentioned, when the decoding results are mentioned everywhere in this application document, the default refers to all the decoding results obtained by decoding the value of the combination information of the music section. The entire decoding results are originally used for Play the whole pronunciation of this coded music section.

本说明书前面说到的1型音素是含一个音的信息的音素,本说明书前面说到的2型音素,3型音素是含多个音的信息的音素。The type 1 phoneme mentioned earlier in this manual is a phoneme containing information of one sound, and the type 2 phoneme and type 3 phoneme mentioned earlier in this manual are phonemes containing information of multiple sounds.

数个音的组合也算音组合。The combination of several tones is also considered to be a combination of tones.

本申请文件所说的音组合的音的构成情况(这音组合里含多个音的信息),包括此音组合里的音的个数,此音组合里的各音的发音时刻值(各音的发音时刻值以音组合里最开始发音的音的时刻算起),各音的音高值,各音的音长值。The composition of the sound of the sound combination mentioned in this application document (the information of multiple sounds in this sound combination), including the number of sounds in this sound combination, the pronunciation time value of each sound in this sound combination (each The pronunciation time value of the sound is counted from the time of the first sound in the sound combination), the pitch value of each sound, and the sound length value of each sound.

本申请文件所说的未定位的音素里的音的构成情况(这未定位的音素里含多个音的信息),包括此未定位的音素里的音的个数,此未定位的音素里的这多个音各音的发音时刻值(各音的发音时刻值以这未定位的音素里最开始发音的音的时刻算起),这多个音各音互相之间的音程关系数据,这多个音各音的音长值。(当然本段说未定位的音素里的音的构成情况里有这些数据,是在从这未定位的音素里能得到这些数据的前提下)The composition of the sound in the unlocated phoneme mentioned in this application document (the information of multiple sounds in this unlocated phoneme), including the number of sounds in this unlocated phoneme, the number of sounds in this unlocated phoneme The pronunciation time value of each sound of these multiple sounds (the pronunciation time value of each sound is calculated from the time of the sound that is first pronounced in this unlocated phoneme), the interval relationship data between each sound of these multiple sounds, The sound length value of each sound of these multiple sounds. (Of course, this paragraph says that there are these data in the composition of the sound in the unlocated phoneme, under the premise that these data can be obtained from the unlocated phoneme)

本申请文件所说的定位(自定位或被定位)的音素里的音的构成情况(这定位的音素里含多个音的信息),包括此定位的音素里的音的个数,此定位的音素里的这多个音各音的发音时刻值(各音的发音时刻值以这定位的音素里最开始发音的音的时刻算起),这多个音各音的音高值,这多个音各音的音长值。(当然本段说定位的音素里的音的构成情况里有这些数据,是在从这定位的音素里能得到这些数据的前提下)The composition of the sound in the phoneme of the positioning (self-positioning or being positioned) mentioned in this application document (the information of multiple sounds in the phoneme of this positioning), including the number of sounds in the phoneme of this positioning, this positioning The pronunciation time value of each of these multiple tones in the phoneme (the pronunciation time value of each sound is calculated from the moment of the first pronunciation in the phoneme of this location), the pitch value of each of these multiple tones, this The sound length value of each sound of multiple sounds. (Of course, this paragraph says that there are these data in the composition of the sound in the positioned phoneme, under the premise that these data can be obtained from the positioned phoneme)

本申请文件所说的未定位的类音素里的音的构成情况(这未定位的类音素里含多个音的信息),包括此未定位的类音素里的音的个数,此未定位的类音素里的这多个音各音的发音时刻值(各音的发音时刻值以这未定位的类音素里最开始发音的音的时刻算起),这多个音各音互相之间的音程关系数据,这多个音各音的音长值。(当然本段说未定位的类音素里的音的构成情况里有这些数据,是在从这未定位的类音素里能得到这些数据的前提下)The composition of the sound in the unlocated phoneme mentioned in this application document (the information of multiple sounds in this unlocated phoneme), including the number of sounds in this unlocated phoneme, this unlocated The pronunciation time value of each of these multiple sounds in the class phoneme (the pronunciation time value of each sound is calculated from the moment of the sound that is first pronounced in this unlocated class phoneme), and the mutual relationship between each sound of these multiple sounds The interval relationship data of the multiple tones, the sound length value of each tone of these multiple tones. (Of course, this paragraph says that there are these data in the composition of the sound in the unlocated phoneme, under the premise that these data can be obtained from the unlocated phoneme)

本申请文件所说的定位(自定位或被定位)的类音素里的音的构成情况(这定位的类音素里含多个音的信息),包括此定位的类音素里的音的个数,此定位的类音素里的这多个音各音的发音时刻值(各音的发音时刻值以这定位的音素里最开始发音的音的时刻算起),这多个音各音的音高值,这多个音各音的音长值。(当然本段说定位的类音素里的音的构成情况里有这些数据,是在从这定位的类音素里能得到这些数据的前提下)The composition of the sound in the positioned (self-positioned or positioned) phoneme in this application document (the information of multiple sounds in this positioned phoneme), including the number of sounds in this positioned phoneme , the pronunciation time value of each of these multiple sounds in the phoneme of this location (the pronunciation time value of each sound is calculated from the moment of the sound that is first pronounced in the phoneme of this location), the sound of each sound of these multiple sounds High value, the sound length value of each sound of these multiple sounds. (Of course, this paragraph says that there are these data in the composition of the sound in the positioned phoneme, under the premise that these data can be obtained from the positioned phoneme)

本申请文件所说的音组合的音的构成情况(这音组合里只含一个音的信息),包括音组合里的这个音的音高值,这个音的音长值。The composition of the sound of the sound combination mentioned in this application document (the information of only one sound in this sound combination), includes the pitch value of this sound in the sound combination, and the sound length value of this sound.

本申请文件所说的未定位的音素里的音的构成情况(这未定位的音素里只含一个音的信息),包括此未定位的音素里的这个音的音长值。(当然本段说未定位的音素里的音的构成情况里有这些数据,是在从这未定位的音素里能得到这些数据的前提下)The composition of the sound in the unlocated phoneme mentioned in this application document (the information of only one sound in the unlocated phoneme) includes the sound length value of the sound in the unlocated phoneme. (Of course, this paragraph says that there are these data in the composition of the sound in the unlocated phoneme, under the premise that these data can be obtained from the unlocated phoneme)

本申请文件所说的定位(自定位或被定位)的音素里的音的构成情况(这定位的音素里只含一个音的信息),包括此定位的音素里的这个音的音高值,这个音的音长值。(当然本段说定位的音素里的音的构成情况里有这些数据,是在从这定位的音素里能得到这些数据的前提下)The composition of the sound in the localized (self-localized or localized) phoneme mentioned in this application document (this localized phoneme only contains information about one sound), including the pitch value of the sound in this localized phoneme, The duration value of this sound. (Of course, this paragraph says that there are these data in the composition of the sound in the positioned phoneme, under the premise that these data can be obtained from the positioned phoneme)

本申请文件所说的未定位的类音素里的音的构成情况(这未定位的类音素里只含一个音的信息),包括此未定位的类音素里的这个音的音长值。(当然本段说未定位的类音素里的音的构成情况里有这些数据,是在从这未定位的类音素里能得到这些数据的前提下)The composition of the sound in the unlocalized phoneme mentioned in this application document (the information of only one sound in this unlocated phoneme), includes the sound length value of this sound in this unlocated phoneme. (Of course, this paragraph says that there are these data in the composition of the sound in the unlocated phoneme, under the premise that these data can be obtained from the unlocated phoneme)

本申请文件所说的定位(自定位或被定位)的类音素里的音的构成情况(这定位的类音素里只含一个音的信息),包括此定位的类音素里的这个音的音高值,这个音的音长值。(当然本段说定位的类音素里的音的构成情况里有这些数据,是在从这定位的类音素里能得到这些数据的前提下)The composition of the sound in the localized (self-positioned or localized) phoneme mentioned in this application document (this localized phoneme only contains the information of one sound), including the sound of this sound in the localized phoneme High value, the sound length value of this sound. (Of course, this paragraph says that there are these data in the composition of the sound in the positioned phoneme, under the premise that these data can be obtained from the positioned phoneme)

以上9段提到的未定位的音素里,可以没有或有其它附加信息;以上9段提到的定位的音素里,可以没有或有其它附加信息;以上9段提到的未定位的类音素里,可以没有或有其它附加信息;以上9段提到的定位的类音素里,可以没有或有其它附加信息。The unlocated phonemes mentioned in the above 9 paragraphs may have no or other additional information; the located phonemes mentioned in the above 9 paragraphs may not have or have other additional information; the unlocated phonemes mentioned in the above 9 paragraphs may have other additional information In , there may be no or other additional information; in the localized phoneme mentioned in paragraph 9 above, there may be no or other additional information.

以下11段各处提到的未定位(或定位)的音素里,可以没有或有其它附加信息;以下11段各处提到的未定位(或定位)的类音素里,可以没有或有其它附加信息。In the unlocated (or located) phonemes mentioned in the following 11 paragraphs, there may be no or other additional information; in the unlocated (or located) phonemes mentioned in the following 11 paragraphs, there may be no or other additional information Additional Information.

本说明书各处说到的定位的音素,定位的类音素的上下位置值,是为了表示定位的音素,定位的类音素在五线谱上的上下位置,根据不同的应用需要,定位的音素,定位的类音素的上下位置值可使用各种不同的标准,本说明书各处说到的音的音高值,是为了表示音在五线谱上的上下位置,根据不同的应用需要,音的音高值可使用不同的标准,可以由定位的音素的上下位置值来得到定位的音素里音的音高值,可以由定位的类音素的上下位置值来得到定位的类音素里音的音高值,并且定位的音素或定位的类音素的上下位置值与音高值可使用相同或不同的计算标准,而且不同标准的音高值可根据约定互相转换。The positioned phoneme and the upper and lower position values of the positioned phoneme mentioned in this manual are to represent the positioned phoneme and the positioned phoneme’s upper and lower positions on the stave. According to different application needs, the positioned phoneme and the positioned Various standards can be used for the upper and lower position values of phonemes. The pitch value of the sound mentioned in this manual is to indicate the upper and lower positions of the sound on the stave. According to different application needs, the pitch value of the sound can be Using different criteria, the pitch value of the localized phoneme can be obtained from the upper and lower position values of the localized phoneme, and the pitch value of the localized phoneme-like sound can be obtained from the localized phoneme-like upper and lower position values, and The upper and lower position values and pitch values of a localized phoneme or a localized phoneme-like can use the same or different calculation standards, and pitch values of different standards can be mutually converted according to agreement.

比如一个定位的音素或定位的类音素里的各音的音高值,既可以按五线谱的此定位的音素或定位的类音素所处声道的调式是C大调及高音谱号(或是被编码乐段的此定位的音素或定位的类音素所处声道的实际调式及实际谱号)时来计算(确定)音的音调值(音调值作为音高值),也可以不管调式及谱号,而按五线谱上距离单位来计算(确定)音高值(一般要把五线谱上垂直方向的某个位置作为起点上下位置,计算某一音的音高值就是计算这某一音的上下位置比起点上下位置高或低多少个距离单位;距离单位的概念在本说明书后面有),这两种音高值可在解码时结合调式及谱号互相转换。For example, the pitch value of each tone in the phoneme of a location or the class phoneme of location, both can be C major and the treble clef (or the mode of the sound channel where the phoneme of this location or the class phoneme of location is located in the stave) The pitch value of the sound (the pitch value is used as the pitch value) can be calculated (determined) during the actual mode and the actual clef number of the located phoneme or the positioned phoneme of the coded section, or regardless of the mode and clef, and calculate (determine) the pitch value according to the distance unit on the stave (generally, a certain position in the vertical direction on the stave is used as the upper and lower positions of the starting point, and calculating the pitch value of a certain sound is to calculate the upper and lower positions of this certain sound How many distance units are the position higher or lower than the upper and lower positions of the starting point; the concept of the distance unit will be found later in this manual), and these two pitch values can be converted to each other in combination with the mode and clef during decoding.

对音素,类音素进行定位的值(比如上下位置值1,或解码过程中处理得到的某值),可以是按被编码乐段相应声道的实际的调式及谱号来计算(确定)用于定位的音(一般可以是音素或类音素里最先发音的所有音的最低音)的音调值(这个音调值就是进行定位的值),被定位后的音素(或类音素)里的各音的音高值则可根据用于对这音素(或类音素)进行定位的值(比如上下位置值1,或解码过程中处理得到的某值)以及这个音素(或类音素)里的各音互相之间的音程关系数据来确定;对涉及音素,类音素定位的值(比如上下位置值1,上下位置值2,或解码过程中处理得到的某值),也可以把五线谱上用于定位的音(一般可以是音素或类音素里最先发音的所有音的最低音)的上下位置比起点上下位置(起点上下位置比如是被定位的音所处声道的五线谱上五条线中最中间一线)高或低的距离单位的多少来作为定位的值,自定位的音素(或自定位的类音素)里的各音的音高值可根据此自定位的音素(或自定位的类音素)里的上下位置值2以及这个音素(或类音素)里的各音互相之间的音程关系数据来确定,被定位后的音素(或被定位后的类音素)里的各音的音高值则可根据用于对这音素(或类音素)进行定位的值(比如上下位置值1,或解码过程中处理得到的某值)以及这个音素(或类音素)里的各音互相之间的音程关系数据来确定(确定时还可能用到被编码乐段的相应声道的调式及谱号信息);对涉及音素,类音素定位的值(比如上下位置值1,上下位置值2,或解码过程中处理得到的某值),还可以是其它情况;本段和上一段提到的起点上下位置是指在五线谱垂直方向上自定义的一个上下位置。The value for positioning the phoneme and phoneme (such as the upper and lower position value 1, or a certain value obtained during the decoding process) can be calculated (determined) according to the actual mode and clef of the corresponding channel of the encoded music segment. Based on the pitch value of the localized sound (generally it can be the lowest tone of all the sounds pronounced first in the phoneme or phoneme) (this pitch value is the value for positioning), each in the localized phoneme (or phoneme) The pitch value of the sound can be based on the value used to locate this phoneme (or phoneme) (such as the upper and lower position value 1, or a certain value obtained during the decoding process) and each phoneme (or phoneme) in this phoneme (or phoneme) The interval relationship data between the tones is determined; for the value related to the phoneme and phoneme-like positioning (such as the upper and lower position value 1, the upper and lower position value 2, or a certain value obtained by processing in the decoding process), the staff can also be used for The upper and lower positions of the positioned sound (generally, it can be the lowest sound of all sounds pronounced first in the phoneme or phoneme) are higher than the upper and lower positions of the starting point (the upper and lower positions of the starting point are, for example, the highest among the five lines on the staff of the voice channel where the positioned sound is located) middle line) high or low distance unit as the positioning value, the pitch value of each sound in the self-positioning phoneme (or self-positioning class phoneme) can be based on this self-positioning phoneme (or self-positioning class phoneme) Phoneme) in the upper and lower position value 2 and the interval relationship data between each sound in this phoneme (or phoneme) to determine, the phoneme after being positioned (or the phoneme after being positioned) each tone in The high value can be based on the value used to locate this phoneme (or phoneme) (such as the upper and lower position value 1, or a certain value obtained during decoding) and the relationship between the sounds in this phoneme (or phoneme) (the mode and clef information of the corresponding channel of the coded section may also be used when determining); for the phoneme involved, the value of the phoneme-like positioning (such as the upper and lower position value 1, the upper and lower position value 2 , or a certain value obtained during the decoding process), it can also be other situations; the up and down positions of the starting point mentioned in this paragraph and the previous paragraph refer to a custom up and down position in the vertical direction of the staff.

如这些音高值不是按被编码乐段相应声道的实际的调式及谱号来计算(确定)的音的音调值,那在解码要发音时一般可把解码得到的这些音高值转成按被编码乐段相应声道的实际的调式及谱号来计算(确定)的音的音调值;本说明书后面各解码举例(解码举例1和解码举例2)里提及的音的实际的音高值,是按被编码乐段相应声道实际的调式(C大调)及谱号来计算(确定)的音的音调值。If these pitch values are not the pitch values of the sound calculated (determined) by the actual mode and clef of the corresponding channel of the coded music section, then these pitch values obtained by decoding can generally be converted into when decoding is to be pronounced Calculate (determine) the pitch value of the tone according to the actual mode and clef of the corresponding channel of the encoded music segment; The high value is the pitch value of the sound calculated (determined) according to the actual mode (C major) and clef of the corresponding channel of the coded music section.

本说明书各处说到把解码得到的定位的音素或定位的类音素拿去发音,如定位的音素或定位的类音素里的音的音高值不是按被编码乐段的相应声道(即此定位的音素或定位的类音素所处声道)的实际的调式及谱号来计算(确定)的音的音调值,那一般要把这些音高值转成按被编码乐段的相应声道(即此定位的音素或定位的类音素所处声道)的实际的调式及谱号来计算(确定)的音的音调值,再拿去发音。This instruction manual mentions that the phoneme of location or the phoneme of location obtained by decoding is used for pronunciation, such as the pitch value of the sound in the phoneme of location or the phoneme of location is not according to the corresponding channel of the coded music section (i.e. To calculate (determine) the pitch value of the sound by the actual mode and clef of the localized phoneme or the localized phoneme-like channel), then generally these pitch values should be converted into corresponding tones according to the coded section. Calculate (determine) the pitch value of the sound according to the actual mode and clef of the track (that is, the sound channel where the phoneme or phoneme of the location is located), and then use it for pronunciation.

本说明书所说的某音素的上下位置值1,是指放在拼合信息的值里,用于对这个音素进行定位的上下位置值;本说明书所说的某类音素的上下位置值1,是指放在拼合信息的值里,用于对这个类音素进行定位的上下位置值。The upper and lower position value 1 of a certain phoneme mentioned in this manual refers to the upper and lower position value placed in the combined information value for positioning this phoneme; the upper and lower position value 1 of a certain type of phoneme mentioned in this manual is Refers to the upper and lower position values placed in the value of the spliced information for positioning this type of phoneme.

本申请文件所说的对音素进行定位,是指按拼合信息的值里放的上下位置值来确定这个音素的上下位置值,或是按解码过程中处理得到的某值来确定这个音素的上下位置值。本申请文件所说的对类音素进行定位,是指按拼合信息的值里放的上下位置值来确定这个类音素的上下位置值,或是按解码过程中处理得到的某值来确定这个类音素的上下位置值。The positioning of the phoneme mentioned in this application document refers to determining the upper and lower position values of the phoneme according to the upper and lower position values in the value of the combined information, or determining the upper and lower position values of the phoneme according to a certain value obtained during the decoding process. position value. The positioning of the class phoneme mentioned in this application document refers to determining the upper and lower position values of the class phoneme according to the upper and lower position values in the value of the combined information, or determining the class phoneme according to a certain value obtained during the decoding process. The upper and lower position values of the phoneme.

本说明书所说的音素(或音组合或类音素或被定位的音素里)里含的多个音各音互相之间的音程关系数值(这音素或音组合或类音素或被定位的音素里只含这多个音的信息),包括这多个音中每两个音之间的音程关系值,两个音之间的音程关系值包括这两个音哪个音更高的信息以及这两个音音高的差值,音高的差值可以是根据不同标准来计量的,一般是计量各音之间有多少音步。The phoneme (or sound combination or quasi-phoneme or positioned phoneme) mentioned in this manual contains the interval relationship value between each tone of multiple sounds (in this phoneme or sound combination or quasi-phoneme or positioned phoneme) only contains the information of these multiple tones), including the interval relationship value between each two tones in the multiple tones, and the interval relationship value between the two tones includes the information of which of the two tones is higher and the two tones The difference in pitch, the difference in pitch can be measured according to different standards, generally how many steps there are between each sound.

本说明书各处所说的3个音互相之间的音程关系数据,比如是包括第2音比第1音高(或低)多少音步,第3音比第1音高(或低)多少音步,(而第3音比第2音高或低多少音步则可由前面两句所述内容推出);本说明书各处所说的音素里(或音组合或类音素或被定位的音素里)里含的多个音各音互相之间的音程关系数据的意思也按本段以上所述类推。The interval relationship data between the three tones mentioned in this manual, for example, includes how much the second tone is higher (or lower) than the first pitch, and how much the third tone is higher (or lower) than the first pitch step, (and how much the third tone is higher or lower than the second tone can be deduced from the content of the previous two sentences); in the phonemes mentioned in this manual (or sound combinations or phonemes or positioned phonemes) The meaning of the interval relationship data between each tone of a plurality of tones contained in it is also by analogy as described above in this paragraph.

如音素里记录的各音互相之间的音程关系数据中每两个音之间的音程关系值里音高的差值是按音步来计量的,如对此音素进行定位,用上下位置值1确定此音素在一个有确定调式及谱号的五线谱一声道上的上下位置值后(一般是确定音素里某一音比如最先发音的所有音中最低的音在有确定调式及谱号的五线谱这声道上的上下位置值),那可按此声道的确定调式及谱号以及用于定位此音素的上下位置值1(此上下位置值1就是本段前面述的上下位置值1)按此音素里各音之间的音程关系数据得到此被定位的音素音素里各音的音调值(也是音高值)。如把本段以上各处提到的‘音素’换成‘类音素’,那本段内容仍成立。For example, in the interval relationship data between each tone recorded in the phoneme, the pitch difference in the interval relationship value between every two tones is measured by footsteps. If the phoneme is positioned, use the upper and lower position values 1. After determining the upper and lower position values of this phoneme on a channel of a stave with a certain mode and clef (generally, it is determined that a certain tone in the phoneme, such as the lowest tone among all the sounds pronounced first, has a certain mode and clef The upper and lower position values on the channel of the stave), that can be determined according to the mode and clef of this channel and the upper and lower position value 1 used to locate this phoneme (this upper and lower position value 1 is the above-mentioned upper and lower position value in this paragraph 1) According to the interval relationship data between the tones in the phoneme, the pitch value (also the pitch value) of the tones in the located phoneme is obtained. If the "phoneme" mentioned above in this paragraph is replaced with "phoneme-like", then the content of this paragraph is still valid.

各音素上下位置的确定可以以各音素中的第1次发的音中的最低音为准(如果音素只含有一个音的信息,那这个音就是这个音素中的第1次发的音中的最低音)。各个类音素上下位置的确定可以以各个类音素中的第1次发的音中的最低音为准(如果类音素只含有一个音的信息,那这个音就是这个类音素中的第1次发的音中的最低音)。The determination of the upper and lower positions of each phoneme can be based on the lowest sound in the first sound in each phoneme (if the phoneme only contains information about one sound, then this sound is the first sound in the phoneme). lowest note). The determination of the upper and lower positions of each type of phoneme can be based on the lowest sound in the first sound of each type of phoneme (if the type of phoneme only contains information about one sound, then this sound is the first sound in this type of phoneme) the lowest of the tones).

一个未定位或定位的音素里含数个音的信息,那对于这数个音中的每个音都可说成是这个未定位或定位的音素里的音,一个未定位或定位的类音素里含数个音的信息,那对于这数个音中的每个音都可说成是这个未定位或定位的类音素里的音。An unlocalized or localized phoneme contains information of several tones, and each of these several tones can be said to be a sound in this unlocalized or localized phoneme, an unlocalized or localized phoneme Contains the information of several sounds, then for each sound in these several sounds, it can be said to be the sound in this unlocalized or localized phoneme.

本申请文件所说的节奏信息包括乐段的实际拍的节拍信息(比如把4分音符当成一拍,每小节4拍)和速度信息(指每秒多少拍)。The rhythm information mentioned in this application document includes the beat information of the actual beat of the music section (for example, a quarter note is regarded as a beat, 4 beats per measure) and speed information (referring to how many beats per second).

如类音素的值就是一个音素编号,而这音素编号对应的音素含数个音的信息,那这个类音素可看成就是含这数个音的信息。If the value of a phoneme is a phoneme number, and the phoneme corresponding to the phoneme number contains the information of several tones, then the phoneme can be regarded as containing the information of these several tones.

本说明书所说的符头,就是五线谱上那个小椭圆,如在某时刻发的一个音,其还带一个冠音,那五线谱上这个音和这个冠音都各自由不同的符头来表示(即共有两个小椭圆);本说明书所说的未定位或定位的音素里的音,都只相当于五线谱上一个符头(不能相当于五线谱上多个符头),本说明书所说的未定位或定位的类音素里的音,都只相当于五线谱上一个符头(不能相当于五线谱上多个符头),本说明书所说的音组合里的音,都只对应五线谱上一个符头(不能对应五线谱上多个符头)。The note head mentioned in this manual is the small ellipse on the staff, such as a sound pronounced at a certain moment, which also has a crown sound, and the sound and the crown sound on the staff are represented by different note heads ( That is, there are two small ellipses in total); the sound in the unlocated or positioned phonemes mentioned in this manual is only equivalent to one symbol head on the staff (can not be equivalent to multiple symbol heads on the staff), and the unlocated The sound in the phoneme-like positioning or positioning is only equivalent to one notehead on the staff (it cannot be equivalent to multiple noteheads on the staff), and the sound in the sound combination mentioned in this manual is only corresponding to one notehead on the staff (cannot correspond to multiple noteheads on the stave).

本说明书各处在阐述乐谱的拼合技术时,是以乐段是五线谱为例的,对于其它谱种,比如六线的吉他谱,简谱等,与五线谱是类似的,其它谱种涉及的拼合技术可参照五线谱的拼合技术。说明书附图8,9,11,12的五线谱默认为C大调,谱号为高音谱号,以4分音符为1拍,图15的五线谱为C大调,谱号为低音谱号,以4分音符为1拍,图16的五线谱为C大调,谱号为高音谱号,以4分音符为1拍。When explaining the combination technology of music scores in this manual, the musical notation is taken as an example. For other notation types, such as six-line guitar notation, simplified notation, etc., it is similar to stave notation. The combination technique involved in other notation types You can refer to the stitching technique of staves. The staves in Figures 8, 9, 11, and 12 of the accompanying manual are defaulted to C major, and the clef is a treble clef, with a quarter note as a beat. The staff in Figure 15 is in C major, and the clef is a bass clef, with A quarter note is a beat, and the stave in Figure 16 is a C major, and the clef is a treble clef, and a quarter note is a beat.

本说明书所说的音的实际音高值(在被编码乐段中的实际音高值)用音调值来计量,这个音调值的定义方式可有很多种,只要能表示音高即可,比如被编码乐段是钢琴谱,双声道(假设称为低音声道和高音声道),那乐谱上的任一音都可对应到钢琴的键盘上,可把钢琴键盘上最低音的那个键当成乐谱计量音高的起点,即五线谱里的一个音如对应到这个键,那这个音的音调值为0,然后从这个键开始,往右每过一个黑键音调值加1(这每加的1就对应1音步),每过一个白键音调值也加1(这每加的1就对应1音步),这样五线谱任一声道上任一小节的有确定调式及谱号的任一音对应到相应键后即可得到其音调值(这里只考虑这任一音所处位置的调式及谱号,而不管这任一声道上各处是否有改调式或谱号的情况);后面的解码举例1和解码举例2里涉及的音高值就是按以上7行这样计算的。(本说明书其它各处提到的音步也按本段所述的音步这么理解)The actual pitch value of the sound mentioned in this manual (the actual pitch value in the coded section) is measured by the pitch value. There are many ways to define the pitch value, as long as the pitch can be expressed, for example The coded section is a piano score, binaural (assumed to be called a bass channel and a treble channel), and any sound on the score can correspond to the keyboard of the piano, and the lowest key on the piano keyboard can be As the starting point for measuring the pitch of the music score, that is, if a sound in the stave corresponds to this key, then the pitch value of this sound is 0, and then starting from this key, the pitch value of each black key to the right is increased by 1 (this 1 just corresponds to 1 tone step), and every time a white key pitch value is also added by 1 (this every added 1 just corresponds to 1 tone step), so that any bar on any channel of the stave has any definite mode and clef After one tone corresponds to the corresponding key, its pitch value can be obtained (here only the mode and clef of the position of any tone are considered, regardless of whether there is any change of mode or clef on any channel) ; The pitch values involved in the following decoding example 1 and decoding example 2 are calculated according to the above 7 lines. (The footsteps mentioned elsewhere in this manual are also understood according to the footsteps described in this paragraph)

关于本说明书所说的音素,用于拼合的音素里须包含一个或多个音的信息,并且音素里可以有或没有其它附加信息,而定义音素的方式有多种,本说明书所说的把一个音组合定义成音素是定义音素的方式的一个特例,这样定义的音素里是没有其它附加信息的。Regarding the phonemes mentioned in this manual, the phonemes used for splicing must contain the information of one or more sounds, and there may or may not be other additional information in the phonemes, and there are many ways to define phonemes. Defining a combination of sounds as a phoneme is a special case of the way to define a phoneme, and there is no other additional information in the phoneme defined in this way.

本说明书各处述及距离单位时涉及的距离,是指五线谱垂直方向的距离。除非提及时有特别指明有他意,本说明书各处提及的某声道五线谱上的五条线,默认是指此声道五线谱上的通常见的那五条线;除非提及时有特别指明有他意,本说明书各处提及的某声道五线谱上的某线,默认是指此声道五线谱上的通常见的五条线中的一条线。The distances involved when referring to the distance units throughout this specification refer to the distances in the vertical direction of the staff. Unless otherwise specified when mentioned, the five lines on a staff of a certain channel mentioned throughout this manual refer to the usual five lines on the staff of this channel by default; unless otherwise specified when mentioned, A certain line on the stave of a certain channel mentioned in this manual refers to one of the five commonly seen lines on the staff of this channel by default.

本说明书各处所说的编码规则举例都是相同所指,本说明书各处所说的解码规则举例都是相同所指。The examples of encoding rules mentioned throughout this specification all refer to the same, and the examples of decoding rules mentioned throughout this specification all refer to the same.

如提及时无特别指明有他意也不能从提及处的上下文推出有他意,本申请文件描述乐段拼合技术时各处提及的乐段,默认是指被编码乐段。If there is no special indication of other intentions when mentioning, and it cannot be deduced from the context of the mention, the sections mentioned in this application document when describing the section combination technology refer to the encoded sections by default.

本说明书各处有提到‘音素所处声道’,这里说的音素是指从拼合信息的值里直接得到的音素,这种说法(‘音素所处声道’)的意思是指这音素被定位而成被定位的音素后,这被定位的音素所处的声道。There are references to "the channel where the phoneme is located" everywhere in this manual. The phoneme mentioned here refers to the phoneme directly obtained from the value of the combined information. This statement ('the channel where the phoneme is located') means that the phoneme After being localized to form a localized phoneme, the channel where the localized phoneme is located.

除非提及时有特别注明或从提及处的上下文可推出有他意,本说明书各处(包括后面编码规则举例的叙述,解码规则举例的叙述,编码举例的叙述,编码举例的叙述)提及的其前面没加‘未定位’,‘定位’,‘自定位’,‘被定位’这些字眼来修饰限定的‘音素’(而象‘被定位的这第2个音素’,这里的‘音素’的前面是有‘被定位’来修饰的;而象‘定位的单声道音素’,‘定位的多声道音素’,这些说法里的‘音素’的前面是有‘定位’来修饰的),默认是指定义的音素(编码规则里定义的,或解码规则里定义的,或既在编码规则定义也在解码规则里定义的),本说明书后面各编码举例和各解码举例里,提到的音素1,音素2,音素3,音素4,音素56(1),音素56(2),音素56(3),音素56(4),是指定义的音素(既在编码规则定义也在解码规则里定义的)。除非从提及处的上下文可推出有他意,本说明书各处提及的其前面有用‘未定位’或‘自定位’来修饰限定的‘音素’,默认是指定义的音素(编码规则里定义的,或解码规则里定义的,或既在编码规则定义也在解码规则里定义的);除非从提及处的上下文可推出有他意,本说明书各处提及的‘定位的音素’,默认是指定义的自定位的音素(编码规则里定义的,或解码规则里定义的,或既在编码规则定义也在解码规则里定义的)或解码时得到的被定位的音素。Unless there is a special note when mentioning or it can be inferred from the context of the mention, everywhere in this specification (including the description of the following encoding rules, the description of the decoding rules, the description of the encoding examples, the description of the encoding examples) mentioned The words 'unlocated', 'positioned', 'self-positioned', and 'positioned' are not added in front of it to modify the limited 'phoneme' (and like the second phoneme of 'positioned', the 'phoneme' here 'is modified by 'positioned' in front of it; and like 'positioned monophonic phoneme', 'positioned multi-channel phoneme', the front of 'phoneme' in these expressions is modified by 'positioned' ), the default refers to the defined phoneme (defined in the encoding rule, or in the decoding rule, or both in the encoding rule definition and in the decoding rule). To phoneme 1, phoneme 2, phoneme 3, phoneme 4, phoneme 56(1), phoneme 56(2), phoneme 56(3), phoneme 56(4), refers to the defined phonemes (both defined in the encoding rules and defined in the decoding rules). Unless it can be deduced from the context of the mention that there are other meanings, all the references in this manual are preceded by 'unlocated' or 'self-located' to modify the limited 'phoneme', and the default refers to the defined phoneme (defined in the coding rules or defined in the decoding rules, or defined in both the encoding rules and the decoding rules); unless other meanings can be deduced from the context of the reference, the 'positioned phoneme' mentioned throughout this specification defaults to Refers to the defined self-localized phoneme (defined in the encoding rule, or in the decoding rule, or defined in both the encoding rule definition and the decoding rule) or the localized phoneme obtained during decoding.

如是对一未定位或定位的音素进行定位而得到一被定位的音素,再有说这被定位的音素里有其它附加信息(比如音色,调式,操作信息等信息),那显然这被定位的音素里的这些其它附加信息是本来在前述一未定位或定位的音素里就有的。If an unlocated or located phoneme is located to obtain a located phoneme, and there are other additional information (such as timbre, mode, operation information, etc.) in the located phoneme, then obviously this is located These other additional information in the phoneme are originally in the aforementioned unlocalized or localized phoneme.

本说明书各处说到的根据定位的音素(或定位的类音素)里的一个音的上下位置值和各音互相之间的音程关系数据,来得到这个定位的音素(或定位的类音素)里的各音的音高值这类说法,举个例子,比如一个被定位的音素里含两个音的信息,由其中一个音的上下位置值及这被定位的音素所处声道上这被定位的音素那里的调式,谱号信息,而得到这个音的音调值(音高值),再由这被定位的音素里的两个音之间的音程关系值(也就是这音素里的各音之间的音程关系数据),得知另一个音比这个音高2个音步,那把前面得到的这个音的音调值加上2个音步就是这另一个音的音调值。According to the position value of a sound in the positioned phoneme (or positioned phoneme) and the interval relationship data between each sound, the positioned phoneme (or positioned phoneme) is obtained everywhere in this manual. For example, if a localized phoneme contains information about two tones, the value of the upper and lower positions of one of the tones and the location of the localized phoneme on the channel The tone value (pitch value) of the tone is obtained from the mode and clef information of the located phoneme, and then the interval relationship value between the two tones in the located phoneme (that is, the pitch value in the phoneme) Interval relationship data between each sound), know that another sound is 2 steps higher than this pitch, then adding 2 steps to the pitch value of this sound obtained before is the pitch value of this other sound.

本说明书所说的解码时按一定位的音素来发音,一般是按得到的这定位的音素里各音的音长值,各音的音高值,各音的乐段发音时刻值来发音,而且一般是按此音素的音素编号去音素数据库找到此音素的记录项,并从这记录项取出此音素里的音的信息,并结合此定位的音素所处声道上此定位的音素那里的调式,谱号等信息来得到这定位的音素里各音的音长值,各音的音高值,各音的乐段发音时刻值等信息;本说明书所说的解码时按一定位的类音素来发音,一般是按得到的这定位的类音素里各音的音长值,各音的音高值,各音的乐段发音时刻值来发音。本段各处提到的定位的音素里,可以没有或有其它附加信息;本段各处提到的定位的类音素里,可以没有或有其它附加信息。When decoding in this manual, it is pronounced according to a positioned phoneme. Generally, it is pronounced according to the sound length value of each sound in the positioned phoneme, the pitch value of each sound, and the pronunciation time value of each sound section. And generally go to the phoneme database according to the phoneme number of this phoneme to find the record item of this phoneme, and take out the information of the sound in this phoneme from this record item, and combine the information of this located phoneme on the channel where the located phoneme is located mode, clef and other information to obtain information such as the length value of each sound in the positioned phoneme, the pitch value of each sound, and the pronunciation time value of each sound segment; The phoneme is pronounced, generally by the sound length value of each sound in the class phoneme obtained, the pitch value of each sound, and the music section pronunciation time value of each sound. The localized phonemes mentioned in this paragraph may not have or have other additional information; the localized phonemes mentioned in this paragraph may have no or other additional information.

本申请文件描述音素时涉及的上下位置值,有不同的情况,比如在拼合信息的值里放上下位置值(此上下位置值不是放在音素包含的信息里的),或把上下位置值放在音素包含的信息里,还有对自定位或被定位的音素讨论其上下位置值(即比如讨论定位的音素里第1个音的上下位置值),所以本申请文件为区别起见,将放在拼合信息的值里(不是放在音素包含的信息里)的上下位置值称为上下位置值1,将放在音素包含的信息里的上下位置值称为上下位置值2,对自定位或被定位的音素讨论其上下位置值时将上下位置值称为上下位置值3,而如提到上下位置值(后面无数字标志)是表示可以是以上3种任一种情况的上下位置值。There are different situations for the upper and lower position values involved in the description of phonemes in this application document, such as putting the upper and lower position values in the value of the combined information (the upper and lower position values are not placed in the information contained in the phoneme), or putting the upper and lower position values in In the information contained in the phoneme, there is also a discussion about the upper and lower position values of the phoneme that is self-located or located (that is, the upper and lower position values of the first sound in the phoneme that is discussed). Therefore, for the sake of distinction, this application document will put The upper and lower position value in the value of the combined information (not in the information contained in the phoneme) is called the upper and lower position value 1, and the upper and lower position value placed in the information contained in the phoneme is called the upper and lower position value 2, for self-positioning or When discussing the upper and lower position values of the positioned phoneme, the upper and lower position values are referred to as the upper and lower position value 3, and as mentioned, the upper and lower position values (without a numeral sign behind) are the upper and lower position values that can be any of the above three situations.

本申请文件描述类音素时涉及的上下位置值,有不同的情况,比如在拼合信息的值里放上下位置值(此上下位置值不是放在类音素包含的信息里的),或把上下位置值放在类音素包含的信息里,还有对自定位或被定位的类音素讨论其上下位置值(即比如讨论定位的类音素里第1个音的上下位置值),所以本申请文件为区别起见,将放在拼合信息的值里(不是放在类音素包含的信息里)的上下位置值称为上下位置值1,将放在类音素包含的信息里的上下位置值称为上下位置值2,对自定位或被定位的类音素讨论其上下位置值时将上下位置值称为上下位置值3,而如提到上下位置值(后面无数字标志)是表示可以是以上3种任一种情况的上下位置值。This application document describes the upper and lower position values involved in the phoneme-like, and there are different situations, such as putting the upper and lower position values in the value of the combined information (the upper and lower position values are not placed in the information contained in the phoneme), or putting the upper and lower positions The value is placed in the information contained in the phoneme, and the upper and lower position values of the self-positioned or positioned phoneme (that is, the upper and lower position value of the first sound in the phoneme that discusses positioning), so the application document is For the sake of distinction, the upper and lower position values placed in the value of the combined information (not in the information contained in the phoneme) are called the upper and lower position value 1, and the upper and lower position values placed in the information contained in the phoneme are called the upper and lower positions Value 2, when discussing the upper and lower position values of the self-localized or positioned phonemes, the upper and lower position values are called the upper and lower position values 3, and as mentioned the upper and lower position values (there is no number mark behind) means that it can be any of the above three The upper and lower position values for one case.

一般情况,一个定位(自定位或被定位)的音素可以是定位在五线谱的垂直方向的任一位置;一般情况,一个定位(自定位或被定位)的类音素可以是定位在五线谱的垂直方向的任一位置。本段说的定位的音素里可以有或没有其它附加信息,本段说的定位的类音素里可以有或没有其它附加信息。In general, a phoneme that is positioned (self-positioned or positioned) can be positioned at any position in the vertical direction of the staff; in general, a phoneme that is positioned (self-located or positioned) can be positioned in the vertical direction of the staff any position. The localized phonemes mentioned in this paragraph may or may not have other additional information, and the localized phonemes mentioned in this paragraph may or may not have other additional information.

对于本说明书除本段和下一段以外提到的音素(此音素里含多个音的信息)里的任意某两个音之间的音高差值(音步作为计量单位),不论这音素被定位在五线谱上垂直方向哪个位置,前述某两个音之间的音高差值是不变的,(假设这种音素的种类为音素种类1);本说明书除本段和下一段以外述及未定位的音素里的各音互相之间的音程关系数据与被定位后的此音素里的各音互相之间的音程关系数据是相同的(在本段前述情况时);比如一个音素被定位在一个声道的五线谱上垂直方向一个位置,其第1音和第2音的音高差值为2个音步,这个音素被定位在一个声道的五线谱上垂直方向另一个位置,其第1音和第2音的音高差值仍要为2个音步;除了前面说的音素种类1外,还可以有一种音素(假设这种类为音素种类2),这种音素里的任意某两个音之间的音高差值(音步作为计量单位),在这音素被定位在五线谱上垂直方向不同位置时,前述某两个音之间的音高差值可能相同或不同;比如是定义音素(此音素里含多个音的信息)里的任意两个音之间的音高差值是以距离单位为计量标准而且距离单位数不变为准,比如这个音素被定位在一个声道的五线谱上垂直方向一个位置,其第1音和第2音的音高差值为1个距离单位,音高差值实际为1个音步,这个音素被定位在一个声道的五线谱上垂直方向另一个位置,其第1音和第2音的音高差值仍要为1个距离单位,但音高差值实际为2个音步,用这样的音素也是可用来作乐段拼合的;(本段以下说的‘这种音素’是指音素种类2),本申请文件所说的用音素来拼合乐段发音,所用的音素里可以全部或部分是这种音素,拼合时这种音素的用法与本段最开始几句说的音素的用法相似,比如这种音素里可有也可没有其它附加信息,有用这种音素拼合时涉及的音长,发音时刻值的计量及使用也可同于本说明书前面述的音素的音长,发音时刻值的计量及使用,对这种音素进行处理等后也可得到类音素,有用这种音素拼合时可作定位,顺序拼合,交织拼合,复制处理等处理,有用这种音素时按拼合信息的值解码时可采用整步发音方式或分步发音方式等发音方式发音,且解码时是分一个拼合步骤或多个拼合步骤来完成拼合的,有用这种音素拼合时用于拼合一乐段发音的音素中至少有一个音素是含有不同发音时刻值的,从本段第17行‘比如’到本段第23行‘不同发音时刻值的’为止述的这些内容的具体处理都可参考本说明书其它段的相关叙述,只是在使用时要注意音素里的各音互相之间的音程关系数据中的某两音(这里某两音是任意选出来讨论的)之间的音程关系值里的音高的差值,在此音素被定位在有某调式及谱号的五线谱某声道的垂直方向不同位置时,前述音高的差值有可能会改变。For the pitch difference between any two tones in the phonemes mentioned in this manual except this paragraph and the next paragraph (this phoneme contains information of multiple tones) (steps are used as the unit of measurement), regardless of the phoneme It is located at which position in the vertical direction on the stave, the pitch difference between the aforementioned two tones is constant, (assuming that the type of this phoneme is phoneme type 1); except for this paragraph and the next paragraph in this manual And the interval relationship data between each sound in the unlocated phoneme is the same as the interval relationship data between each sound in this phoneme after being positioned (in the case of the preceding paragraph); for example, a phoneme is Positioned at one position in the vertical direction on the staff of a channel, the pitch difference between the first and second sounds is 2 steps, this phoneme is located at another position in the vertical direction on the staff of a channel, its The pitch difference between the first tone and the second tone still needs to be 2 steps; in addition to the phoneme category 1 mentioned above, there can also be a kind of phoneme (assuming that this category is phoneme category 2), any phoneme in this phoneme The pitch difference between a certain two tones (step as the unit of measurement), when the phoneme is located at different positions in the vertical direction on the staff, the pitch difference between the aforementioned two tones may be the same or different; For example, the pitch difference between any two tones in a defined phoneme (the phoneme contains information about multiple tones) is based on the distance unit as the measurement standard and the number of distance units remains unchanged. For example, this phoneme is positioned at A position in the vertical direction on the stave of a channel, the pitch difference between the first tone and the second tone is 1 distance unit, and the pitch difference is actually 1 step. This phoneme is positioned in a channel At another position in the vertical direction on the stave, the pitch difference between the first note and the second note should still be 1 distance unit, but the pitch difference is actually 2 steps, and such phonemes can also be used as a paragraph Combined; ('this phoneme' mentioned below in this paragraph refers to the phoneme category 2), the use of phonemes in this application document to combine the pronunciation of the passage, can be all or part of this phoneme in the phonemes used, when combined The usage of this phoneme is similar to the usage of the phoneme mentioned in the first few sentences of this paragraph. For example, there may or may not be other additional information in this phoneme, the length of the sound involved in the combination of this phoneme, the measurement of the pronunciation time value and The use can also be the same as the length of the above-mentioned phoneme in this manual, the measurement and use of the pronunciation time value, and the phoneme can also be obtained after this phoneme is processed. Interweaving and splicing, copy processing and other processing, when this kind of phoneme is useful, it can be pronounced in a whole-step pronunciation method or a step-by-step pronunciation method when decoding according to the value of the spliced information, and the decoding is divided into one splicing step or multiple splicing steps. When the combination is completed, at least one of the phonemes used to combine the pronunciation of a piece of music has a different pronunciation time value, from the 17th line of this paragraph 'for example' to the 23rd line of this paragraph' different pronunciations For the specific processing of the above-mentioned content, you can refer to the relevant descriptions in other paragraphs of this manual, but you should pay attention to the interval relationship coefficient between each sound in the phoneme when using it. The pitch difference in the interval relation value between a certain two tones in the data (here the certain two tones are arbitrarily selected for discussion), where the phoneme is located in a certain channel of a stave with a certain mode and a clef At different positions in the vertical direction, the aforementioned difference in pitch may change.

类音素的情况与上段述的音素的情况类似;对于本说明书本段以外提到的类音素(此类音素里含多个音的信息)里的任意某两个音之间的音高差值(音步作为计量单位),不论这个类音素被定位在五线谱上垂直方向哪个位置,前述某两个音之间的音高差值是不变的,(假设这种类音素的种类为类音素种类1);本说明书本段以外述及未定位的类音素里的各音互相之间的音程关系数据与被定位后的此类音素里的各音互相之间的音程关系数据是相同的(在本段前述情况时);比如一个类音素被定位在一个声道的五线谱上垂直方向一个位置,其第1音和第2音的音高差值为2个音步,这个类音素被定位在一个声道的五线谱上垂直方向另一个位置,其第1音和第2音的音高差值仍要为2个音步;除了前面说的类音素种类1外,还可以有一种的类音素(假设这种类为类音素种类2),这种的类音素里的任意某两个音之间的音高差值(音步作为计量单位),在这类音素被定位在五线谱上垂直方向不同位置时,前述某两个音之间的音高差值可能相同或不同;比如是定义类音素(此类音素里含多个音的信息)里的任意两个音之间的音高差值是以距离单位为计量标准而且距离单位数不变为准,比如这个类音素被定位在一个声道的五线谱上垂直方向一个位置,其第1音和第2音的音高差值为1个距离单位,音高差值实际为1个音步,这个类音素被定位在一个声道的五线谱上垂直方向另一个位置,其第1音和第2音的音高差值仍要为1个距离单位,但音高差值实际为2个音步,作乐段拼合时也可涉及到这种类音素;(本段以下说的‘这种类音素’是指类音素种类2),本申请文件所说的用音素来拼合乐段发音时,所实际涉及的类音素里可以全部或部分是这种类音素,拼合时这种类音素的用法与本段最开始几句说的类音素的用法相似,比如这种类音素里可有也可没有其它附加信息,拼合有涉及这种类音素时拼合时涉及的音长,发音时刻值的计量及使用也可同于本说明书前面述的类音素的音长,发音时刻值的计量及使用,对这种类音素进行处理等后也可得到类音素,有涉及这种类音素时拼合时可作定位,顺序拼合,交织拼合,复制处理等处理,按拼合信息的值解码时如有涉及这种类音素那仍可采用整步发音方式或分步发音方式等发音方式发音,且解码时是分一个拼合步骤或多个拼合步骤来完成拼合的,(以上6句话说的情况都可以在拼合时有用到上一段所述的音素种类1和音素种类2中的两种或任一种),从本段第18行‘比如’到本段第24行‘拼合的’为止述的这些内容的具体处理都可参考本说明书其它段的相关叙述,只是在使用时要注意类音素里的各音互相之间的音程关系数据中的某两音(这里某两音是任意选出来讨论的)之间的音程关系值里的音高的差值,在此类音素被定位在有某调式及谱号的五线谱某声道的垂直方向不同位置时,前述音高的差值有可能会改变。(本段所说的类音素可为未定位的或自定位的或被定位的)The situation of the phoneme is similar to that of the phoneme mentioned in the previous paragraph; for the phoneme mentioned outside this paragraph of this specification (this type of phoneme contains the information of multiple sounds), the pitch difference between any two tones (step as the unit of measurement), no matter where the phoneme is located in the vertical direction on the staff, the pitch difference between the aforementioned two tones is constant, (assuming that the phoneme type is the phoneme type 1); The interval relationship data between each sound in the non-located phoneme in this section of this manual is the same as the interval relationship data between each sound in this type of phoneme after being positioned (in During the aforementioned situation of this paragraph); For example, a phoneme is positioned at a position in the vertical direction on the staff of a vocal tract, and the pitch difference between the first and second sounds is 2 steps, and this phoneme is positioned at At another position in the vertical direction on the stave of one channel, the pitch difference between the first tone and the second tone should still be 2 steps; in addition to the aforementioned phoneme-like category 1, there can also be a kind of phoneme-like (assuming that this class is phoneme-like category 2), the pitch difference (step as a unit of measurement) between any two tones in this kind of phoneme is different in vertical direction when this type of phoneme is positioned on the stave position, the pitch difference between the aforementioned two tones may be the same or different; for example, define the pitch difference between any two tones in a phoneme (such a phoneme contains information about multiple tones) The distance unit is used as the measurement standard and the number of distance units does not change. For example, this phoneme is positioned at a position in the vertical direction on the stave of a channel, and the pitch difference between the first tone and the second tone is 1 The unit of distance, the pitch difference is actually 1 step, this type of phoneme is positioned at another position in the vertical direction on the stave of a channel, and the pitch difference between the first and second tones should still be 1 distance unit, but the pitch difference is actually 2 steps, and this type of phoneme can also be involved when making music pieces; When it is said that phonemes are used to combine the pronunciation of a passage, all or part of the phonemes actually involved may be this kind of phoneme. The usage of this kind of phoneme is similar to the usage of the phoneme mentioned in the first few sentences of this paragraph, such as There may or may not be other additional information in this type of phoneme. When the combination involves this type of phoneme, the sound length involved in the combination, the measurement and use of the pronunciation time value can also be the same as the sound length and pronunciation of the above-mentioned phoneme in this manual. The measurement and use of the time value, after processing this type of phoneme, the phoneme can also be obtained. When this type of phoneme is involved, it can be used for positioning, sequence combination, interleaving combination, copy processing, etc., and decode according to the value of the combination information If this type of phoneme is involved, it can still be pronounced by the whole-step pronunciation mode or the step-by-step pronunciation mode, and the decoding is completed in one or more combination steps, (the situation in the above 6 sentences Both can use phoneme type 1 and phoneme type 2 described in the previous paragraph or any one of them when assembling), from the 18th line of this paragraph 'such as' to the 24th line of this paragraph 'stitched' The specific processing of these contents so far can refer to the relevant narrations of other paragraphs in this manual, but when using, pay attention to certain two tones in the interval relationship data between each sound in the class phoneme (here certain two tones are arbitrary The pitch difference in the value of the interval relationship between the selected and discussed), when such phonemes are positioned at different positions in the vertical direction of a certain channel of a stave with a certain mode and clef, the aforementioned pitch difference Subject to change. (The phonemes mentioned in this paragraph can be unlocated or self-located or positioned)

对任一乐段,其不同声道的节奏一般是相同的,也可能不同;如述及时无特别指明有他意,本说明书各处述及有关被编码乐段的节奏的内容时,默认是以被编码乐段不同声道的节奏相同为例的,本说明书后面编码举例1和编码举例2和解码举例1和解码举例2都是这样;对于乐段里不同声道的节奏不同这样的情况,编解码的原理与乐段里不同声道的节奏相同时的编解码原理本质相同,乐段里不同声道的节奏相同时的编解码处理可参考乐段里不同声道的节奏相同时的编解码处理。For any piece of music, the rhythms of different channels are generally the same, and may also be different; if it is stated and there is no special intention to indicate otherwise, when the content of the rhythm of the coded piece is mentioned in this manual, the default is For example, if the rhythms of different channels of the coded section are the same, this is the case for encoding example 1 and encoding example 2, decoding example 1 and decoding example 2 in the following instructions; The principle of encoding and decoding is essentially the same as the principle of encoding and decoding when the rhythms of different channels in a segment are the same. Decoding processing.

对一乐段,其各声道的节奏,调式,谱号一般是不变的(即比如一个乐段的各声道的调式总为C大调),除非提及时有指明有他意,本说明书前面各处描述乐段拼合技术时,默认提及的乐段,其各声道的节奏,调式,谱号是以不变为例的,本说明书前面各处描述乐段拼合技术时,提及的‘被编码乐段的节奏,调式,谱号信息’,就是指被编码乐段的所有声道各自的节奏,调式,谱号信息且任一声道中的节奏,调式,谱号都是不变的,(下面几段提到的被编码乐段的节奏,调式,谱号信息的意思,在被编码乐段里的节奏,调式,谱号是不变的情况时,也是按以上几句所述理解),本说明书前面各处描述乐段拼合技术时,提及到确定被定位的音素(或定位的类音素)里音的上下位置值,各音的音高值,各音互相之间的音程关系值等时,要根据被定位的音素(或定位的类音素)所处声道位置的调式及谱号来确定这类说法,这类说法中是以涉及的乐段中的节奏,调式,谱号是不变的为例的,本说明书前面各处描述乐段拼合技术时,提及的‘有某调式及谱号的五线谱某声道’,是以五线谱这某声道的节奏,调式,谱号是不变的为例的;对于有变的乐段(即乐段中有声道的节奏,调式,谱号有变),可对其分成多个小段,每个小段里的各声道的节奏,调式,谱号是不变的,再对各小段进行编码(对各小段的编码处理方式可参考本说明书其它段所述编码处理方式),将各小段的编码结果合起来就是拼合信息的值,解码时对拼合信息的值里的各小段的数据分别解码以播乐段发音即可(对各小段的解码处理方式可参考本说明书其它段所述解码处理方式);对这种被编码乐段的节奏,调式,谱号是变的的情况,其编解码处理仍可参照本说明书前面所述没有变节奏,调式,谱号的乐段涉及的编解码处理,只是相应的,要把本说明书前面各处描述乐段拼合技术时,提及的‘被编码乐段的节奏,调式,谱号信息’,理解成是包含被编码乐段的所有声道各自涉及的所有节奏,调式,谱号信息(包括乐段开始时各声道各自的节奏,调式,谱号信息,以及以后什么时候变,怎么变的信息),(下面几段提到的被编码乐段的节奏,调式,谱号信息的意思,在被编码乐段里的节奏,调式,谱号是有变的情况时,也是按以上几句所述理解),且要把本说明书前面各处描述乐段拼合技术时,提及到的确定被定位的音素(或定位的类音素)里音的上下位置值,各音的音高值,各音互相之间的音程关系值等时,要根据被定位的音素(或定位的类音素)所处声道的调式及谱号来确定这类说法,理解成这类说法中涉及的乐段中有声道的节奏,调式,谱号是变的,实际确定时一般可根据被定位的音素(或定位的类音素)所处当前位置(一般是所处五线谱相应声道的那小节里)的调式及谱号,另还要把本说明书前面各处描述乐段拼合技术时,提及的‘有某调式及谱号的五线谱某声道’,理解成五线谱这某声道的节奏,调式,谱号是变的。For a piece of music, the rhythm, mode and clef of each channel are generally unchanged (that is, for example, the mode of each channel of a piece of music is always C major), unless otherwise specified when mentioned, this manual When describing the segment combination technology in the previous sections, the rhythm, mode, and clef of the segment mentioned by default, the rhythm, mode, and clef of each channel are assumed to be unchanged. The "rhythm, mode, and clef information of the coded segment" refers to the respective rhythm, mode, and clef information of all channels of the encoded segment, and the rhythm, mode, and clef in any channel are unchanged (the meaning of the rhythm, mode and clef information of the coded music section mentioned in the following paragraphs, when the rhythm, mode and clef in the coded music section are unchanged, it is also according to the above few sentences understanding), when describing the segment combination technology in the front of this manual, it is mentioned that the upper and lower position values of the sounds in the positioned phoneme (or positioned phoneme), the pitch value of each sound, and the relationship between each sound When the interval relation value of the phoneme (or localized phoneme) is located, the mode and clef of the position of the vocal tract should be used to determine this type of statement. In this type of statement, the rhythm in the involved section is The modes and clefs are constant as an example. When describing the technique of segment combination in this manual, the "a certain channel of the stave with a certain mode and clef" mentioned is based on the rhythm of a certain channel of the stave. , mode, and clef are constant as an example; for a section that changes (that is, the rhythm, mode, and clef that have a vocal track in the section change), it can be divided into multiple sub-sections, and each sub-section The rhythm, mode, and clef of each sound channel are unchanged, and then each subsection is encoded (for the encoding processing method of each subsection, please refer to the encoding processing method described in other sections of this manual), and the encoding results of each subsection are combined. It is the value of the combined information. When decoding, the data of each subsection in the value of the combined information is decoded separately to play the pronunciation of the music section (for the decoding processing method of each subsection, please refer to the decoding processing method described in other paragraphs of this manual); For the situation that the rhythm, mode and clef of this coded section are changed, its codec processing can still refer to the codec processing involved in the music section without changing rhythm, mode and clef as mentioned earlier in this manual. Correspondingly, the "rhythm, mode, and clef information of the coded paragraph" mentioned in the description of the segment combination technology in the previous parts of this manual should be understood as including all the channels involved in the coded segment. All rhythm, mode, and clef information (including the rhythm, mode, and clef information of each channel at the beginning of the section, and when and how to change the information later), (the coded sections mentioned in the following paragraphs The meaning of the rhythm, mode, and clef information in the coded section, when the rhythm, mode, and clef in the coded section are changed, it is also understood as described in the above sentences), and the descriptions in the previous parts of this manual When segment combination technology is mentioned, when it is mentioned to determine the upper and lower position values of the tones in the positioned phoneme (or positioned phoneme), the pitch value of each tone, the interval relationship value between each tone, etc., it should be based on The mode and clef of the vocal channel where the located phoneme (or located phoneme-like) is located to determine Define this kind of statement, it is understood that the rhythm, mode, and clef of the vocal tract in the passage involved in this kind of statement are changed, and the actual determination can generally be made according to the current position of the positioned phoneme (or positioned phoneme). The mode and clef at the location (generally in the bar corresponding to the channel of the stave), and the "a certain stave with a certain mode and a certain clef" mentioned in the description of the music segment combination technology in the previous parts of this manual. Vocal channel' is understood as the rhythm, mode, and clef of a certain channel of the stave.

对于本说明书各处说到的放在音素里的作为其它附加信息的节奏,调式,谱号信息,因为本说明书前面各处描述乐段拼合技术时,默认提及的被编码乐段的节奏,调式,谱号是以不变为例的,所以相应的,涉及放在音素里的节奏,调式,谱号信息,一般可以是用于表示被编码乐段总的节奏,调式,谱号信息(比如解码时按约定根据某一个音素里的节奏,调式,谱号信息就知道被编码乐段总的节奏,调式,谱号信息),也可以是表示这音素本身涉及有关的节奏,调式,谱号信息,然后解码拼合时的定位,生成类音素等都与此有关,并且用于拼合一乐段发音的相同声道的各音素里的节奏,调式,谱号信息互相应是一致的,解码得到的相同声道的各定位的音素,类音素里的节奏,调式,谱号信息互相应是一致的,并且各声道的每个定位的音素,或类音素里的节奏,调式,谱号信息就相当于是被编码乐段相应声道的节奏,调式,谱号信息,解码时按发音顺序按解码得到的各声道的各定位的音素,类音素里的节奏,调式,谱号信息及各音的信息发音,即可得到乐段发音,除以上两种情况外,还可以是别的情况;而对于上一段所说的对一乐段,其有声道的节奏,调式,谱号是变的这种情况,就更复杂一些,比如解码时根据拼合信息的值里的一个音素里的节奏,调式,谱号信息,直接就按约定改变乐段某声道的节奏,调式或谱号,或是其它情况。本段以上所说的被编码乐段总的节奏,调式,谱号信息,是指被编码乐段一直用这节奏,调式,谱号信息而不改,且这节奏,调式,谱号信息可以是数个声道的信息,即被编码乐段是多少个声道,这节奏,调式,谱号信息就是几个声道的信息。前面述的‘各音素里的节奏,调式,谱号信息互相应是一致的’,按这样理解:比如各音素包含3个音素(音素1,音素2,音素3),那前几句引号里的说法就是指音素1里的节奏信息与音素2里的节奏信息与音素3里的节奏信息相同,音素1里的调式信息与音素2里的调式信息与音素3里的调式信息相同,音素1里的谱号信息与音素2里的谱号信息与音素3里的谱号信息相同;前面与‘各音素里的节奏,调式,谱号信息互相应是一致的’类似的说法也按前面几句所述类推理解。本说明书所说的被编码乐段的节奏,调式,谱号是不变的,是指被编码乐段的任一声道里的节奏,调式,谱号都是不变的(不变是针对在同一声道内前后不变);本说明书所说的被编码乐段的节奏,调式,谱号是变的,是指被编码乐段的全部或部分声道中的任一声道里,节奏,调式,谱号中的一个或多个有变(变是针对在同一声道内前后有变)。For the rhythm, mode, and clef information that are placed in the phonemes as other additional information mentioned in this manual, because the rhythm of the coded segment mentioned by default when describing the segment combination technology in the previous part of this manual, Mode and clef are exemplified as unchanged, so correspondingly, the rhythm, mode, and clef information involved in the phoneme can generally be used to represent the overall rhythm, mode, and clef information of the coded segment ( For example, when decoding, according to the rhythm, mode, and clef information in a certain phoneme, the general rhythm, mode, and clef information of the coded section can be known), or it can also indicate that the phoneme itself involves related rhythm, mode, and spectrum number information, and then decode the location when stitching together, generate phonemes, etc. are all related to this, and the rhythm, mode, and clef information in each phoneme of the same channel used to stitch together the sound of a piece of music should be consistent with each other, decoding The obtained phonemes of the same channel, the rhythm, mode, and clef information in the phoneme should be consistent with each other, and the phonemes of each location in each channel, or the rhythm, mode, and clef in the phoneme The information is equivalent to the rhythm, mode, and clef information of the corresponding channel of the encoded music segment. When decoding, the phonemes of each location of each channel obtained by decoding in the order of pronunciation, the rhythm, mode, and clef information in the phoneme-like The pronunciation of the information of each sound can be used to obtain the pronunciation of the passage. In addition to the above two situations, it can also be other situations; and for the one-to-one passage mentioned in the previous paragraph, it has the rhythm, mode, and clef of the vocal tract. This situation is more complicated. For example, when decoding, according to the rhythm, mode, and clef information in a phoneme in the value of the combined information, directly change the rhythm, mode or spectrum of a certain channel of the piece according to the agreement. number, or otherwise. The general rhythm, mode, and clef information of the coded section mentioned above in this paragraph means that the coded section has been using the rhythm, mode, and clef information without changing, and the rhythm, mode, and clef information can be It is the information of several channels, that is, how many channels are the coded section, and the rhythm, mode, and clef information are the information of several channels. The above-mentioned 'rhythm, mode, and clef information in each phoneme should be consistent with each other' can be understood in this way: for example, each phoneme contains 3 phonemes (phoneme 1, phoneme 2, phoneme 3), then in the first few quotation marks The term means that the rhythm information in phoneme 1 is the same as the rhythm information in phoneme 2 and the rhythm information in phoneme 3, the mode information in phoneme 1 is the same as the mode information in phoneme 2 and phoneme 3, and the mode information in phoneme 1 is the same as that in phoneme 3. The clef information in phoneme 2 is the same as the clef information in phoneme 3; the previous statement similar to 'the rhythm, mode, and clef information in each phoneme should be consistent with each other' is also according to the previous few Sentence described by analogy. The rhythm, mode, and clef of the coded music section mentioned in this manual are invariable, which means that the rhythm, mode, and clef in any channel of the coded music section are all invariant (the constant is for the same The front and back of the channel remain unchanged); the rhythm, mode, and clef of the coded section mentioned in this manual are changed, which means that in any channel in all or part of the channels of the coded section, the rhythm, mode, One or more changes in the clef (changes are for forward and backward changes within the same channel).

对于本申请文件所说的用音素来拼合乐段发音,还有一些极端情况(本段和以下3段是极端情况的举例),比如编码规则和与其专门对应的解码规则里定义的音素都是自定位的(任一音素里可以有或没有音色,操作信息这样的其它附加信息,这里的操作信息比如是指示按怎样的步骤解码),用此编码规则对一乐段编码生成的拼合信息的值里的前面(也可以是拼合信息的值里的后面等地方)放有被编码乐段的节奏,调式,谱号信息,再按乐段的不同声道分别各自定义一些不同的数据块(如乐段声道只有一个那数据块也只有一个),每个数据块里包含对应声道发音的所有音素(自定位的音素)各自的音素编号(每个数据块里各音素的音素编号按音素发音先后顺序排列,不需再放用于对音素进行定位的上下位置值1),拼合信息的值里包含所有这些数据块,以上各声道的音素编号也可在拼合信息的值里按其它方式放,只要能从拼合信息的值里取出各声道的音素编号即可,按此解码规则解码时,先按在拼合信息的值里得到的被编码乐段的节奏,调式,谱号信息来设置被编码乐段开始发音时的所有声道各自的节奏,调式,谱号等参数(如被编码乐段的节奏,调式,谱号是变的,那根据前述被编码乐段的节奏,调式,谱号信息在后面解码过程中还要再设置被编码乐段的某些声道的节奏,调式,谱号等参数),再按分步发音方式(下面每步解码做什么可根据约定或根据音素里的操作信息来定),从这拼合信息的值里每步取出乐段所有声道中每个声道的一个音素编号,将取出的乐段所有声道中每个声道的一个音素编号都直接拿去发音(按音素编号在音素数据库里找到其对应的记录项并从这记录项里取出音的信息比如各音的音高值,音长值,发音时刻值等信息来发音),照此方式解码结束即完成播放乐段的全部发音或部分发音,这种情况,每步取出音素编号并拿去发音并没作什么处理(没作顺序拼合,交织拼合,复制处理,定位等处理),这种情况如乐段是多个声道的话,每步里取得的不同声道的音素编号对应的音素的发音持续时间是相同的,这种情况,可把按分步发音方式每步取数个音素编号并拿去发音,由这些步的发音形成乐段的全部或部分发音本身看成是根据音素来作处理。本申请文件所说的用音素来拼合乐段发音是包括本段以上所述情况的。本段所述的可从一个拼合信息的值里直接得到的音素(实际上直接得到的是一些音素的音素编号)中,至少有一个音素中是包含有不同发音时刻值的;本段各处提及的乐段是指被编码乐段。本段各处提及的乐段的所有声道可以是一个或多个声道。本段所说这种情况里被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的(如是变的,那在拼合信息的值里放的被编码乐段的节奏,调式,谱号信息里就可包含何时变,怎么变的信息)。There are also some extreme cases (this paragraph and the following 3 paragraphs are examples of extreme cases) of using phonemes to combine the pronunciation of the passage mentioned in this application document, such as the phonemes defined in the coding rules and the corresponding decoding rules. Self-positioning (any phoneme may or may not have timbre, other additional information such as operation information, where the operation information is, for example, indicating the steps to decode), use this coding rule to code a musical segment to generate the combined information The front of the value (or the back of the value of the combined information) puts the rhythm, mode, and clef information of the coded section, and then defines some different data blocks according to the different channels of the section ( If there is only one audio channel of the music section, there is only one data block), and each data block contains the respective phoneme numbers of all phonemes (self-positioned phonemes) pronounced by the corresponding channel (the phoneme numbers of each phoneme in each data block are pressed by The pronunciation of the phonemes is arranged in order, and there is no need to put the upper and lower position values 1) for positioning the phonemes. The value of the combined information contains all these data blocks. The phoneme numbers of the above channels can also be pressed in the value of the combined information. Play in other ways, as long as the phoneme number of each channel can be extracted from the value of the combined information. When decoding according to this decoding rule, first follow the rhythm, mode, and clef of the coded segment obtained in the value of the combined information information to set the respective rhythm, mode, clef and other parameters of all channels when the coded section starts to pronounce (as the rhythm of the coded section, the mode, and the clef change, then according to the rhythm of the aforementioned coded section , mode, and clef information in the subsequent decoding process, and then set the rhythm, mode, clef and other parameters of some channels of the coded section), and then follow the step-by-step pronunciation method (what to do in each step of decoding below can be based on According to the agreement or according to the operation information in the phoneme), a phoneme number of each channel in all channels of the segment is taken out from the value of the combined information at each step, and each channel in all channels of the extracted segment is All the phoneme numbers are taken directly for pronunciation (according to the phoneme number, find its corresponding record item in the phoneme database and take out the information of the sound from this record item, such as the pitch value of each sound, the sound length value, the pronunciation time value and other information Pronunciation), after decoding in this way, all or part of the pronunciation of the playback section is completed. In this case, the phoneme number is taken out in each step and pronounced without any processing (no sequence stitching, interweaving stitching, copy processing , positioning, etc.), in this case, if the music section has multiple channels, the pronunciation duration of the phonemes corresponding to the phoneme numbers of different channels obtained in each step is the same. The pronunciation method takes several phoneme numbers in each step and takes them for pronunciation, and the pronunciation of all or part of the passage formed by the pronunciation of these steps is regarded as being processed according to the phonemes. The use of phonemes in the said application documents to combine the pronunciation of the music section includes the above-mentioned situation in this paragraph. Among the phonemes that can be directly obtained from the value of a combined information described in this paragraph (in fact, the phoneme numbers of some phonemes are obtained directly), at least one phoneme contains different pronunciation time values; everywhere in this paragraph References to sections refer to encoded sections. All channels of passages mentioned throughout this paragraph may be one or more channels. In the case mentioned in this paragraph, the rhythm, mode, and clef of the coded section can be unchanged or changed (if it is changed, the rhythm of the coded section placed in the value of the combined information , mode, information on when to change and how to change).

再比如编码规则和与其专门对应的解码规则里定义的音素都是自定位的(任一音素里都有节奏,调式,谱号信息,任一音素里可以有或没有音色,操作信息这样的其它附加信息,这里的操作信息比如是指示按怎样的步骤解码),用此编码规则对一乐段编码生成拼合信息的值时,按乐段的不同声道分别各自定义一些不同的数据块(如乐段声道只有一个那数据块也只有一个),每个数据块里包含对应声道发音的所有音素(这一些自定位的音素中各音素里都包括节奏,调式,谱号信息)各自的音素编号(每个数据块里各音素的音素编号按音素发音先后顺序排列,不需再放用于对音素进行定位的上下位置值1),拼合信息的值里包含所有这些数据块,以上各声道的音素编号也可在拼合信息的值里按其它方式放,只要能从拼合信息的值里取出各声道的音素编号即可,按此解码规则解码时,按分步发音方式(解码时每步做什么可根据约定或根据音素里的操作信息来定),从这拼合信息的值里每步取出乐段所有声道中每个声道的一个音素编号,将取出的乐段所有声道中每个声道的一个音素编号都直接拿去发音(按音素编号在音素数据库里找到其对应的记录项并从这记录项里取出音的信息比如各音的音高值,音长值,发音时刻值等信息来发音,且发音时要按记录项里的节奏,调式,谱号信息设置此音素所处声道当前发音位置的节奏,调式,谱号等发音参数并把音素发出音来),照此方式解码结束即完成播放乐段的全部发音或部分发音,这种情况,每步取出音素编号并拿去发音并没作什么处理(没作顺序拼合,交织拼合,复制处理,定位等处理),这种情况如乐段是多个声道的话,每步里取得的不同声道的音素编号对应的音素的发音持续时间是相同的,这种情况,可把按分步发音方式每步取数个音素编号并拿去发音,由这些步的发音形成乐段的全部或部分发音本身看成是根据音素来作处理。本申请文件所说的用音素来拼合乐段发音是包括本段以上所述情况的。本段所述的可从一个拼合信息的值里直接得到的音素(实际上直接得到的是一些音素的音素编号)中,至少有一个音素中是包含有不同发音时刻值的;本段各处提及的乐段是指被编码乐段。本段各处提及的乐段的所有声道可以是一个或多个声道。本段所说这种情况里被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的。Another example is that the phonemes defined in the encoding rules and the corresponding decoding rules are all self-positioning (any phoneme has rhythm, mode, and clef information, and any phoneme can have or not have timbre, operation information, etc. Additional information, the operation information here is, for example, indicating how to decode), when using this encoding rule to encode a music section to generate the value of the combined information, define some different data blocks according to the different channels of the music section (such as There is only one piece of music, and there is only one data block), and each data block contains all the phonemes of the corresponding channel pronunciation (these self-positioned phonemes include rhythm, mode, and clef information in each phoneme). Phoneme numbers (the phoneme numbers of each phoneme in each data block are arranged in the order of pronunciation of the phonemes, and there is no need to put the upper and lower position values 1 for locating the phonemes), and the value of the combined information includes all these data blocks. The phoneme number of sound channel also can be put in other ways in the value of mosaic information, as long as can take out the phoneme number of each sound channel from the value of mosaic information, when decoding according to this decoding rule, press the step-by-step pronunciation mode (decoding What to do at each step can be determined according to the agreement or according to the operation information in the phoneme), from the value of the combined information, take out a phoneme number of each channel in all channels of the music section at each step, and use all the taken out music sections A phoneme number of each channel in the channel is directly used for pronunciation (find the corresponding record item in the phoneme database according to the phoneme number and take out the information of the sound from this record item, such as the pitch value of each sound, the sound length Value, pronunciation time value and other information to pronounce, and the rhythm, mode, clef information in the record item should be set according to the rhythm, mode, clef and other pronunciation parameters of the current pronunciation position of the channel where the phoneme is located and the phoneme will be emitted Tone), after decoding in this way, the whole or part of the pronunciation of the playback section is completed. In this case, the phoneme number is taken out in each step and pronounced without doing any processing (no sequence stitching, interweaving stitching, copy processing , positioning, etc.), in this case, if the music section has multiple channels, the pronunciation duration of the phonemes corresponding to the phoneme numbers of different channels obtained in each step is the same. The pronunciation method takes several phoneme numbers in each step and takes them for pronunciation, and the pronunciation of all or part of the passage formed by the pronunciation of these steps is regarded as being processed according to the phonemes. The use of phonemes in the said application documents to combine the pronunciation of the music section includes the above-mentioned situation in this paragraph. Among the phonemes that can be directly obtained from the value of a combined information described in this paragraph (in fact, the phoneme numbers of some phonemes are obtained directly), at least one phoneme contains different pronunciation time values; everywhere in this paragraph References to sections refer to encoded sections. All channels of passages mentioned throughout this paragraph may be one or more channels. In the situation mentioned in this paragraph, the rhythm, mode, and clef of the coded section can be constant or variable.

再比如编码规则和与其专门对应的解码规则里定义的音素都是自定位的(任一音素里可以有或没有音色,操作信息这样的其它附加信息,这里的操作信息比如是指示按怎样的步骤解码),用此编码规则对一乐段编码生成的拼合信息的值里的前面(也可以是拼合信息的值里的后面等地方)放有被编码乐段的节奏,调式,谱号信息,再按乐段的不同声道分别各自定义一些不同的数据块(如乐段声道只有一个那数据块也只有一个),每个数据块里包含对应声道发音的所有音素(自定位的音素)各自的音素编号(每个数据块里各音素的音素编号按音素发音先后顺序排列,不需再放用于对音素进行定位的上下位置值1),拼合信息的值里包含所有这些块,以上各声道的音素编号也可在拼合信息的值里按其它方式放,只要能从拼合信息的值里取出各声道的音素编号即可,解码时,先按在拼合信息的值里得到的被编码乐段的节奏,调式,谱号信息来设置被编码乐段开始发音时的所有声道各自的节奏,调式,谱号等参数(如被编码乐段的节奏,调式,谱号是变的,那根据前述被编码乐段的节奏,调式,谱号信息在后面解码过程中还要再设置被编码乐段的某些声道的节奏,调式,谱号等参数),再按分步发音方式(下面每步解码做什么可根据约定或根据音素里的操作信息来定),每步从这拼合信息的值里取出每个声道的一些音素编号,将取出的这些音素编号拿去发音(不用对音素作定位),照此方式解码结束即完成播放乐段的全部发音或部分发音,且其中某步从这拼合信息的值里顺序取出的那些音素编号中至少包含一个声道(假设这声道为某声道)的多个音素编号,这种情况这某步里可看成有把前述某声道的前述多个音素编号先作顺序拼合(组拼形式),再把得到的结果与其它声道的音素编号拿去发音(对于结果中的每个音素编号,按音素编号在音素数据库里找到其对应的记录项并从这记录项里取出音的信息比如各音的音高值,音长值,发音时刻值等信息来发音),这种情况如乐段是多个声道的话,每步里,不同声道按取得的数个音素编号来发音所持续的时间(数个音素发音时间的和)是相同的。本申请文件所说的用音素来拼合乐段发音是包括本段以上所述情况的。本段所述的可从一个拼合信息的值里直接得到的音素(实际上直接得到的是一些音素的音素编号)中,至少有一个音素中是包含有不同发音时刻值的;本段各处提及的乐段是指被编码乐段。本段各处提及的乐段的所有声道可以是一个或多个声道。本段所说这种情况里被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的(如是变的,那在拼合信息的值里放的被编码乐段的节奏,调式,谱号信息里就可包含何时变,怎么变的信息)。Another example is that the phonemes defined in the encoding rules and the corresponding decoding rules are all self-positioning (any phoneme can have or not have timbre, other additional information such as operation information, and the operation information here is, for example, indicating what steps to follow decoding), using this encoding rule to encode the rhythm, mode, and clef information of the encoded music section in the front of the value of the pieced information generated by encoding a piece of music (it can also be in the back of the value of the pieced information, etc.), Then define some different data blocks according to the different channels of the music segment (if there is only one channel in the music segment, then there is only one data block). ) respective phoneme numbers (the phoneme numbers of each phoneme in each data block are arranged in the order of pronunciation of the phonemes, and there is no need to put the upper and lower position values 1 for locating the phonemes), and the value of the combined information includes all these blocks, The phoneme numbers of the above channels can also be placed in other ways in the value of the combined information, as long as the phoneme numbers of each channel can be taken out from the value of the combined information. When decoding, first press the value of the combined information to get The rhythm, mode, and clef information of the coded section are used to set the respective rhythm, mode, clef and other parameters of all channels when the coded section starts to sound (such as the rhythm, mode, and clef of the coded section are change, then according to the rhythm, mode, and clef information of the aforementioned encoded section, the rhythm, mode, clef and other parameters of some channels of the encoded section should be set in the subsequent decoding process), and then press Step-by-step pronunciation method (the decoding of each step below can be determined according to the agreement or according to the operation information in the phoneme), each step takes out some phoneme numbers of each channel from the value of the combined information, and takes the phoneme numbers taken out Pronunciation (without positioning the phonemes), and after decoding in this way, the entire pronunciation or partial pronunciation of the playback section is completed, and at least one channel is included in the phoneme numbers sequentially extracted from the value of the combined information in a certain step (assuming that this channel is a plurality of phoneme numbers of a certain channel), in this case, in this certain step, it can be regarded as that the aforementioned multiple phoneme numbers of the aforementioned certain channel are first combined in sequence (combined form), and then put together Get the result and the phoneme numbers of other channels to pronounce (for each phoneme number in the result, find its corresponding record item in the phoneme database according to the phoneme number and take out the information of the sound from this record item, such as the phoneme number of each sound Pitch value, sound length value, pronunciation time value and other information to pronounce), in this case, if the piece has multiple channels, in each step, different channels are pronounced according to the number of phonemes obtained. (the sum of several phoneme pronunciation times) is the same. The use of phonemes in the said application documents to combine the pronunciation of the music section includes the above-mentioned situation in this paragraph. Among the phonemes that can be directly obtained from the value of a combined information described in this paragraph (in fact, the phoneme numbers of some phonemes are obtained directly), at least one phoneme contains different pronunciation time values; everywhere in this paragraph References to sections refer to encoded sections. All channels of passages mentioned throughout this paragraph may be one or more channels. In the case mentioned in this paragraph, the rhythm, mode, and clef of the coded section can be unchanged or changed (if it is changed, the rhythm of the coded section placed in the value of the combined information , mode, information on when to change and how to change).

如编码规则和与其专门对应的解码规则里定义的音素都是自定位的(任一音素里可以有或没有音色,操作信息这样的其它附加信息,这里的操作信息比如是指示按怎样的步骤解码),用此编码规则对一乐段编码生成拼合信息的值时,按乐段的不同声道分别各自定义一些不同的数据块(如乐段声道只有一个那数据块也只有一个),每个数据块里包含对应声道发音的所有音素(自定位的音素)各自的音素编号(每个数据块里各音素的音素编号按音素发音先后顺序排列,不需再放用于对音素进行定位的上下位置值1),拼合信息的值里包含所有这些数据块,以上各声道的音素编号也可在拼合信息的值里按其它方式放,只要能从拼合信息的值里取出各声道的音素编号即可,按此解码规则解码时,按整步发音方式(解码时各步骤做什么可根据约定或根据音素里的操作信息来定),从这拼合信息的值里取出乐段所有声道中每个声道的所有这些音素编号再一起拿去用于发音(不用对音素作定位,按音素编号在音素数据库里找到其对应的记录项并从这记录项里取出音的信息比如各音的音高值,音长值,发音时刻值等信息来发音),以得到乐段的全部发音或部分发音,这种情况,可看成最后是把乐段所有声道中每个声道的所有这些自定位的音素进行顺序拼合(组拼形式)得到某一结果,再把此某一结果用于后面发音;本段所说情况可以是在拼合信息的值里的前面(也可以是拼合信息的值里的后面等地方)放有被编码乐段的节奏,调式,谱号信息,解码开始就用这些被编码乐段的节奏,调式,谱号信息先设置被编码乐段开始发音时的所有声道各自的节奏,调式,谱号等参数,再处理得到前述某一结果并按此某一结果发音(这种情况时被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的,如是变的,那按这些被编码乐段的节奏,调式,谱号信息在后面按前述某一结果发音的过程中还需设置某些声道的节奏,调式,谱号等参数),或把这些被编码乐段的节奏,调式,谱号信息与前述某一结果组成另一结果并按此另一结果发音(这种情况时被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的),也可以是在拼合信息的值里的每个音素里有节奏,调式,谱号信息,按前述某一结果解码发音时,对各音素,按音素里的这些其它附加信息来设置各音素所处声道当前发音位置的节奏,调式,谱号等参数并把音素发出音来(这种情况时被编码乐段的节奏,调式,谱号可以是不变的,也可以是变的),也可以是在被编码乐段最先发音的某个声道的一音素里有节奏,调式,谱号信息(这里的节奏,调式,谱号信息也正好是被编码乐段的所有声道各自的节奏,调式,谱号信息),按前述某一结果解码发音时,按这个音素里的这些其它附加信息来设置整个被编码乐段的所有声道各自的节奏,调式,谱号等发音参数并把各音素按这些发音参数发出音来(这种情况时被编码乐段的节奏,调式,谱号是不变的)。本申请文件所说的用音素来拼合乐段发音是包括本段以上所述情况的。本段各处提及的乐段的所有声道可以是一个或多个声道。For example, the phonemes defined in the encoding rules and the corresponding decoding rules are self-positioning (any phoneme may or may not have timbre, and other additional information such as operation information, where the operation information is, for example, indicating what steps to decode ), when using this encoding rule to encode a music section to generate the value of the combined information, define some different data blocks respectively according to the different channels of the music section (if there is only one music section channel, then there is only one data block). Each data block contains the respective phoneme numbers of all the phonemes (self-positioned phonemes) pronounced by the corresponding vocal tract (the phoneme numbers of each phoneme in each data block are arranged in the order of pronunciation of the phonemes, and do not need to be used for positioning the phonemes. The upper and lower position values of 1), the value of the mosaic information contains all these data blocks, and the phoneme numbers of the above channels can also be placed in other ways in the value of the mosaic information, as long as each channel can be taken out from the value of the mosaic information The phoneme number is enough. When decoding according to this decoding rule, according to the whole-step pronunciation method (what to do in each step during decoding can be determined according to the agreement or according to the operation information in the phoneme), take out all the pieces from the value of the combined information All these phoneme numbers of each channel in the channel are used together for pronunciation (no need to locate the phoneme, find the corresponding record item in the phoneme database according to the phoneme number and take out the sound information from this record item, such as The pitch value of each sound, the sound length value, the pronunciation time value and other information to pronounce), to obtain the whole pronunciation or part of the pronunciation of the music section, in this case, it can be regarded as the final sound of each sound in all the channels of the music section All these self-positioned phonemes of the Tao are sequentially combined (combined form) to obtain a certain result, and then this certain result is used for subsequent pronunciation; the situation mentioned in this paragraph can be in front of the value of the combined information (also can be It is the back of the value of the combined information, etc.) put the rhythm, mode, and clef information of the encoded section, and use the rhythm, mode, and clef information of the encoded section to set the encoded section first. The parameters such as the respective rhythms, modes, and clefs of all vocal channels during pronunciation are processed to obtain a certain result above and pronounced according to this certain result (in this case, the rhythm of the coded section, the mode, and the clef can be different Change, it can also be changed, if it is changed, then according to the rhythm, mode and clef information of these coded sections, the rhythm and mode of certain channels need to be set in the process of pronunciation according to the aforementioned result , clef and other parameters), or the rhythm of these coded sections, mode, clef information and the aforementioned certain result form another result and pronounce according to this other result (in this case, the rhythm of the coded section, The mode and clef can be constant or variable), or there can be rhythm, mode and clef information in each phoneme in the value of the combined information. When decoding the pronunciation according to one of the aforementioned results, the Each phoneme, according to these other additional information in the phoneme, set the rhythm, mode, clef and other parameters of the current pronunciation position of the channel where each phoneme is located and sound the phoneme (in this case, the rhythm and mode of the coded section , the clef can be constant or variable), or it can be a phoneme of a certain channel that is first pronounced in the coded section There are rhythm, mode, and clef information (the rhythm, mode, and clef information here are exactly the rhythm, mode, and clef information of all the channels of the coded section). When decoding the pronunciation according to one of the aforementioned results, According to these other additional information in this phoneme, the respective rhythms of all sound channels of the entire coded section are set, the pronunciation parameters such as mode, clef and each phoneme are sounded according to these pronunciation parameters (in this case, the coded music The rhythm, mode, and clef of the section are unchanged). The use of phonemes in the said application documents to combine the pronunciation of the music section includes the above-mentioned situation in this paragraph. All channels of passages mentioned throughout this paragraph may be one or more channels.

以上4段各处提及的‘拼合信息的值里的前面’,是指处于拼合信息的值的内部的,比如拼合信息的值分3个部分,最开始是部分1,接着是部分2,再接着是部分3,那部分1就是放在拼合信息的值里的前面的。The "before the value of the combined information" mentioned in the above 4 paragraphs refers to the inside of the value of the combined information. For example, the value of the combined information is divided into 3 parts, the first part is part 1, followed by part 2, Then there is part 3, and that part 1 is placed in front of the value of the combined information.

显然编码规则和解码规则可以有许多种,按照不同的编码规则得到的乐段拼合信息的值的格式一般是不同的,可按照本说明书后面叙述的调式的处理,大小调处理,节奏的处理,音素的定义,对于对重复的音组合的处理,乐段拼合信息的值的结构的定义等来制定一个编码规则举例和一个解码规则举例,此作为举例的编码规则(编码规则举例)包括这些细则:对调式的处理,大小调处理,节奏的处理,音素的定义,对音素赋予音素编号(这几条都按本说明书后面相关所述处理),建立音素数据库,对重复的音组合的编码(也按照本说明书后面所述来编码),编码生成的乐段拼合信息的值的结构的定义也按照本说明书后面所述,等;后面的各编码举例(编码举例1和编码举例2)是按照此编码规则来作的;与此编码规则专门对应的此解码规则举例包括这些细则:对调式的处理,大小调处理,节奏的处理,音素的定义,对音素赋予音素编号(这几条都按本说明书后面相关所述处理),建立音素数据库(此音素数据库与编码规则用的音素数据库相同或兼容),对乐段拼合信息的值的分析解码(也按照本说明书后面所述的乐段拼合信息的值的结构的定义来解码),按本说明书后面所述的重复音组合的结构对乐段拼合信息的值中重复音组合涉及的信息进行解码,等;后面的各解码举例(解码举例1和解码举例2)是按照此解码规则来进行的。本段说的此作为举例的解码规则与本段说的此作为举例的编码规则是专门对应的。Obviously there can be many kinds of encoding rules and decoding rules. The format of the value of the piece combination information obtained according to different encoding rules is generally different. It can be processed according to the mode, major and minor, and rhythm described later in this manual. The definition of phonemes, for the processing of repeated sound combinations, the definition of the structure of the value of the music segment combination information, etc. to formulate an example of encoding rules and an example of decoding rules, this encoding rule (example of encoding rules) as an example includes these detailed rules : the processing of the mode, the processing of the major and minor scales, the processing of the rhythm, the definition of the phonemes, the phonemes are given phoneme numbers (these are all processed according to the relevant descriptions in the back of this manual), the phoneme database is set up, and the coding of repeated sound combinations ( Also encode according to the description later in this manual), the definition of the structure of the value of the music piece combination information generated by encoding is also according to the description later in this manual, etc.; the following coding examples (coding example 1 and coding example 2) are according to This encoding rule is done; This decoding rule specially corresponding to this encoding rule includes these details for example: the processing of mode, the processing of major and minor scales, the processing of rhythm, the definition of phonemes, and assigning phoneme numbers to phonemes (these several are all according to This instruction manual is related to the described processing later), set up a phoneme database (this phoneme database is the same as or compatible with the phoneme database used by the coding rules), and analyze and decode the value of the music segment combination information (also according to the music segment combination described later in this specification sheet The definition of the structure of the value of the information to decode), according to the structure of the repeating sound combination described later in this manual, the information related to the repeating sound combination in the value of the music section stitching information is decoded, etc.; each decoding example in the back (decoding example 1 and decoding example 2) are carried out according to this decoding rule. The decoding rule mentioned in this paragraph as an example corresponds specifically to the encoding rule mentioned in this paragraph as an example.

调式的处理:按常规,各个不同乐曲的调式经常是不同的,比如C大调,G大调等,对于一个处在五线谱上某线上的单音来说,在各个不同调式下,其音的高低差别不大,一般就是高半度,或低半度而已,而对于两个相邻的音,如两音之间的音程为3度,那在各种不同的调式下,两音之间的音程也只有大3度和小3度的区别。所以在对某乐段编码前,可先看其调式,然后在此调式下,确定各音被组合起来后可由哪些音素来分别表示,再做编码。比如两个相邻的音,两音的音程为3度(后音的音高高于前音),如把此两音的组合定义成音素,那在不同调式下,这两音的组合应该可定义成两个不同音素(每音素含两音),一个音素里两音的音程为大3度,另一个音素里两音的音程为小3度。Handling of modes: Conventionally, the modes of different pieces of music are often different, such as C major, G major, etc. For a single tone on a certain line on the stave, in each mode, its tone There is not much difference between the height and the low, which is generally half a degree higher or lower. For two adjacent tones, if the interval between the two tones is 3 degrees, then in various modes, the difference between the two tones The interval between them is only a major 3rd degree and a minor 3rd degree difference. Therefore, before encoding a certain music section, one can first look at its mode, and then under this mode, determine which phonemes can be used to represent each sound after being combined, and then encode. For example, for two adjacent tones, the interval between the two tones is 3 degrees (the pitch of the back tone is higher than that of the front tone). If the combination of these two tones is defined as a phoneme, then under different modes, the combination of these two tones should be It can be defined as two different phonemes (each phoneme contains two tones), the interval of two tones in one phoneme is a major 3 degree, and the interval of two tones in another phoneme is a minor 3 degree.

大小调的处理:对于每一种相同的调式音级的标记形式,它既可能是大调,又可能是小调,这两种调的五线谱各条线上的音高情况相同,只是两种调使用各自不同的和弦方式,以构成不同的音乐的色彩;对其处理,既可以对大小调区别对待,也可不区别对待,即只认大调式,不管小调式,小调式情况下的各音素划分同于大调式处理,本说明书作为举例的编码规则和解码规则以按后种情况不区别对待大小调处理为例。Handling of major and minor scales: For each marking form of the same mode of sound level, it may be a major scale or a minor scale. Different ways of chords are used to form different colors of music; to deal with it, the major and minor scales can be treated differently or not, that is, only the major mode is recognized, regardless of the minor mode, and the division of each phoneme in the case of the minor mode The same as the processing of the major mode, the coding rules and decoding rules used in this manual as an example take the processing of major and minor modes as an example in the latter case without distinction.

节奏的处理:可以在拼合信息的值的编码头里记录节奏的信息,比如每分钟多少拍,是把几分音符当成一拍,每小节里多少拍。解码时,根据拼合信息的值的编码头里的信息,设置音乐播放的速度,节拍等。本编码规则举例和解码规则举例是把4分音符定义成一拍。(本段说的拍是指实际拍)Rhythm processing: Rhythm information can be recorded in the encoding header of the combined information value, such as how many beats per minute, how many cents of notes are regarded as one beat, and how many beats per measure. When decoding, according to the information in the encoding header of the value of the combined information, set the speed and beat of the music playback. The coding rule example and the decoding rule example are to define quarter notes as one beat. (The shooting mentioned in this paragraph refers to the actual shooting)

对于本编码规则举例和解码规则举例涉及编解码的任一乐段,默认在表示这乐段的五线谱里是不改变调式,也不改变节奏,也不改变谱号的。For any musical section involving encoding and decoding in this encoding rule example and decoding rule example, the mode, rhythm, and clef will not be changed by default in the stave representing this musical section.

以下叙述中提到音组合时,都是默认音组合涉及的拍是把4分音符当成一拍的。When the sound combination is mentioned in the following description, the beats involved in the default sound combination are regarded as a quarter note as a beat.

除非特别指明有他意或明显可看出有他意,以下叙述中提到音组合时,都是默认一个音组合中的所有音都是处于同一个声道的。Unless otherwise specified or clearly seen, when referring to sound combinations in the following descriptions, it is assumed that all the sounds in a sound combination are in the same channel.

音素的定义:Phoneme definition:

本编码规则举例和解码规则举例涉及定义的音素都是以音素是音组合为例的,说音素是某音组合(比如说音素是围绕音,音素是单音重复),其意思是把音组合定义成音素。本编码规则举例和解码规则举例涉及定义的音素(包括以下叙述中涉及定义的音素),都是以4分音符为1抽象拍,如被编码乐段不是以4分音符为1实际拍,那仍可用这些音素来做拼合,只是解码拼合时要对拍的标准作转换。This encoding rule example and the decoding rule example relate to the phoneme of definition and all take the phoneme as an example of a sound combination, say that a phoneme is a certain sound combination (for example, a phoneme is a surrounding sound, and a phoneme is a monophonic repetition), which means that the sound combination Defined as phonemes. This encoding rule example and the decoding rule example involve the defined phonemes (including the phonemes defined in the following descriptions), all of which take quarter notes as an abstract beat. If the coded section is not an actual beat with a quarter note, then These phonemes can still be used for splicing, but the beat standard needs to be converted when decoding the splicing.

考量乐曲的构成,可以发现,乐曲是可以由一系列音组合排列组成的(这里‘一系列’是修饰‘音组合’的),音组合可以是一小段音阶(如图9),或者将一个同样的音重复多次(如图8),或者一个和弦的分解(如图10),等;本说明书作为举例的编码规则和解码规则将这些音组合(这里‘这些’是修饰‘音组合’的)各自定义为音素,音素甚至可以是单音。Considering the composition of the melody, it can be found that the melody can be composed of a series of tone combinations (here 'series' is modified 'tone combination'), and the tone combination can be a small scale (as shown in Figure 9), or a The same tone is repeated multiple times (as shown in Figure 8), or the decomposition of a chord (as shown in Figure 10), etc.; this manual is used as an example of encoding rules and decoding rules to combine these sounds (here 'these' is a modified 'sound combination' ) are defined as phonemes, which can even be monophonic.

音素还可以是围绕音,一般由4个音组成。还有可以统计一下各种乐曲经常使用的音组合,也可定义成音素,比如钢琴的左手伴奏的音比较简单而且有规律,容易归纳统计一些音组合并把这一些音组合各自定义成音素(这里‘一些’是修饰‘音组合’的),可在不同乐曲的乐段编码时使用。A phoneme can also be a surround sound, which generally consists of 4 sounds. In addition, you can count the frequently used sound combinations of various pieces of music, which can also be defined as phonemes. For example, the left-hand accompaniment of the piano has relatively simple and regular sounds. It is easy to summarize and count some sound combinations and define these sound combinations as phonemes ( Here 'some' is used to modify the 'sound combination'), which can be used in the encoding of different music sections.

总之,音素可有这些类:一类音素里的各音素是把单音重复(音组合)定义成音素,比如单音重复2次,3次,4次都可分别定义成不同的音素,这些不同音素属于这一类;一类音素里的各音素是把和弦分解(音组合)定义成音素,比如对于大三和弦,小三和弦,等的分解,就可分别定义成不同的音素,这些不同音素属于这一类;一类音素里的各音素是把音阶(音组合)定义成音素,比如3个音的音阶,5个音的音阶,可分别定义成不同音素,这些不同音素属于这一类;一类音素里的各音素是把围绕音(音组合)定义成音素,可把不同的围绕音分别定义成不同音素,这些不同音素属于这一类;一类音素里的各音素是把单音定义成音素,可把不同音长的单音分别成不同的音素,这些不同音素属于这一类;还可有其它类的音素。In a word, phonemes can have these classes: each phoneme in a class of phonemes is to define monophonic repetition (sound combination) as phoneme, such as monophonic repetition 2 times, 3 times, 4 times can be defined as different phonemes respectively, these Different phonemes belong to this category; each phoneme in a class of phonemes defines chord decomposition (sound combination) as phonemes, such as the decomposition of major triads, minor triads, etc., which can be defined as different phonemes respectively. Phonemes belong to this category; each phoneme in a class of phonemes defines scales (sound combinations) as phonemes, such as scales with 3 tones and scales with 5 tones, which can be defined as different phonemes respectively, and these different phonemes belong to this category. class; each phoneme in a class of phonemes is to define the surround sound (sound combination) as a phoneme, and different surround sounds can be defined as different phonemes respectively, and these different phonemes belong to this class; each phoneme in a class of phonemes is to define Monotones are defined as phonemes, and monotones of different sound lengths can be divided into different phonemes, and these different phonemes belong to this category; there may also be other categories of phonemes.

休止符也可被定义成一类音素(这里‘一类’是修饰‘音素’的),对于不同长度的休止符,可分别定义成不同音素,这些不同音素属于这一类,休止符音素可看成是自定位的音素,休止符音素里含有一个音的信息(这个音的音长值和音高值),但这个音的音高值为某个表示不发音的值。Rests can also be defined as a class of phonemes (here 'a class' modifies 'phonemes'). For rests of different lengths, they can be defined as different phonemes respectively. These different phonemes belong to this class. Rests phonemes can be regarded as self- The phoneme of positioning, the rest phoneme contains information of a sound (sound length value and pitch value of this sound), but the pitch value of this sound is a certain value indicating silence.

对于两个音组合,如组成前一个音组合的音的个数与组成后一个音组合的音的个数相同,且前一个音组合里的各音互相之间的音程关系数据与后一个音组合里的各音互相之间的音程关系数据相同,前一个音组合中各音的发音时刻值与后一个音组合中各音的发音时刻值一一对应相同,但前一个音组合中各音的音长与后一个音组合中各音的音长不一一对应相同,那这两个音组合也是被定义成两个不同音素。For two sound combination, the number of the sound that forms the previous sound combination is identical with the number of the sound that forms the following sound combination, and the interval relationship data between each sound in the previous sound combination is the same as that of the latter sound combination. The interval relationship data between each sound in the combination is the same, the pronunciation time value of each sound in the previous sound combination is the same as the pronunciation time value of each sound in the next sound combination, but the pronunciation time value of each sound in the previous sound combination is the same. The length of the sound is not the same as the sound length of each sound in the latter sound combination, so these two sound combinations are also defined as two different phonemes.

对于钢琴等乐谱,各音一般伴有冠音(与根音同时发音),如三度音程冠音,八度音程冠音,如图11是两个带八度冠音的示例音;如有数个不同时刻发音的音,那对这数个音加一些冠音,这数个音和这些冠音就组成音组合1,或对这数个音加另外一些冠音,这数个音和这些另外这些冠音就组成音组合2,这里音组合1和音组合2可被分开定义成两个不同的音素。For music scores such as pianos, each sound is generally accompanied by a crown (pronounced simultaneously with the root), such as the crown of the third and the crown of the octave, as shown in Figure 11. Tones pronounced at different times, then add some crowns to these several tones, these several tones and these crowns constitute sound combination 1, or add some other crowns to these several tones, these several tones and these In addition, these crown sounds form the sound combination 2, where the sound combination 1 and the sound combination 2 can be separately defined as two different phonemes.

对于有些情况,定义的音素里面的音,可能同一时间点激发了多个音,且这多个音是音高各不同的音。For some cases, the sound in the defined phoneme may stimulate multiple tones at the same time point, and these multiple tones are sounds with different pitches.

另外,几个不同的单音,各音的音长相同,组合起来后,其可能处在五线谱的不同高低位置,如果把处在五线谱不同高低位置的其各音互相之间的音程关系数据相同的音组合分别定义成不同音素,由这些音素来拼合成乐段发音,原理上也可行,另也可以改进一下,将其各音互相之间的音程关系数据相同的音组合定义为一个音素。本段有关‘其各音互相之间的音程关系数据相同的音组合’这种说法,这里相同是在不同音组合之间作比较,比如其各音互相之间的音程关系数据相同的两个音组合,是指其中一个音组合里各音互相之间的音程关系数据与另一个音组合里各音互相之间的音程关系数据相同,下段有类似说法(‘其3个音互相之间的音程关系数据相同的音组合’,‘其3个音互相之间的音程关系数据不同的音组合’)也按这样类推理解。In addition, several different monotones have the same sound length. After they are combined, they may be in different high and low positions on the staff. The combination of sounds is defined as different phonemes respectively, and it is also feasible in principle to combine these phonemes into the pronunciation of a passage. In addition, it can be improved, and the combination of sounds with the same interval relationship data between each tone is defined as a phoneme. This paragraph is concerned with the statement of 'sound combinations with the same interval relationship data between each tone', the same here is a comparison between different tone combinations, such as two tones with the same interval relationship data between each tone Combination means that the interval relationship data between the tones in one tone combination is the same as the interval relationship data between the tones in the other tone combination. There is a similar statement in the next paragraph ('the intervals between the three tones The same tone combination of relational data', 'the sound combination of its 3 tone interval relational data difference') also understand by analogy like this.

比如可看图10,其简谱读法为‘35.1’(其中的小点表示低8度),如把此音组合(3个音)都向上移4个距离单位(本说明书有对距离单位的解释),其简谱读法变为‘725’,这3个音互相之间的音程关系数据与‘35.1’情况时3个音互相之间的音程关系数据相同,而如是将‘35.1’的3个音都向上移2个距离单位(本申请文件有对距离单位的解释),其简谱读法为‘57.3’,这3个音互相之间的音程关系数据与‘35.1’情况时3个音互相之间的音程关系数据不同,比如‘57.3’中5与3之间是小三度,而原来‘35.1’中3与1之间是大三度;可以穷举此音组合(‘35.1’3个音)移位到五线谱上不同高低位置而形成的各个音组合(每次移动,这3个音都向上或都向下移且都移同样多个距离单位),考察这各个音组合和此音组合(‘35.1’3个音),其中把其3个音互相之间的音程关系数据相同的音组合定义为相同的音素,把其3个音互相之间的音程关系数据不同的音组合定义为不同的音素。比如‘35.1’定义成音素A(‘725’看成是音素A移位所得,不用再另定义成别的音素),‘57.3’定义成音素B,因其3个音互相之间的音程关系数据与音素A的3个音互相之间的音程关系数据不同。For example, see Figure 10, the numbered notation reads as '35.1' (the dots represent 8 degrees lower), such as moving this sound combination (3 tones) upwards by 4 distance units (this manual has a description of the distance unit Explanation), its numbered notation reading becomes '725', and the interval relationship data between the three tones is the same as the interval relationship data between the three tones in the '35.1' situation, and in this way the 3 tones of '35.1' Each sound is moved up by 2 distance units (this application document has an explanation to the distance unit), and its numbered notation reading method is '57.3', and the interval relationship data between these 3 sounds is the same as the '35.1' situation. Interval relationship data between each other is different, for example, between 5 and 3 in '57.3' is a minor third, while in the original '35.1', between 3 and 1 is a major third; you can exhaustively enumerate this sound combination ('35.1'3 each note) shifted to different high and low positions on the stave (each time the three notes move up or down and all move the same number of distance units), investigate the combination of each note and this Tone combination ('35.1' 3 tones), wherein the tone combination with the same interval relationship data between its 3 tones is defined as the same phoneme, and the tone combination with different interval relationship data between its 3 tones defined as different phonemes. For example, '35.1' is defined as phoneme A ('725' is regarded as the transposition of phoneme A, and there is no need to define it as another phoneme), and '57.3' is defined as phoneme B because of the interval relationship between the three tones The data is different from the interval relation data among the three tones of the phoneme A.

以上两段说到的用于定义成音素的音组合中,各音的音长是相同的,对于各音的音长是不同的时,其定义音素的方式与以上两段所述类似,比如有两个音,第一个音占8分之3拍,第2音占8分之1拍,第2音的音高高于第1音,这两个音处在五线谱上不同上下位置时(这两个音之间在五线谱垂直方向相距的距离单位恒为2)这两个音有两个音程情况(两音之间是大三度,两音之间是小三度),那就可相应定义两个音素,一个音素含两个音的信息,第一个音占8分之3拍,第2音占8分之1拍,第2音的音高高于第1音,两音之间是大三度,另一个音素含两个音的信息,第一个音占8分之3拍,第2音占8分之1拍,第2音的音高高于第1音,两音之间是小三度。In the combination of sounds used to define phonemes mentioned in the above two paragraphs, the sound length of each sound is the same. When the sound length of each sound is different, the way of defining the phoneme is similar to that described in the above two paragraphs, for example There are two tones, the first tone occupies 3/8 beats, the second tone occupies 1/8 beat, the pitch of the second tone is higher than the first tone, when these two tones are in different upper and lower positions on the stave (The distance unit between the two tones in the vertical direction of the staff is always 2) These two tones have two intervals (there is a major third between the two tones, and a minor third between the two tones), then you can Correspondingly define two phonemes, one phoneme contains information of two tones, the first tone occupies 3/8 beats, the second tone occupies 1/8 beat, the pitch of the second tone is higher than the first tone, and the two tones There is a major third in between, and another phoneme contains information of two tones, the first tone occupies 3/8 of a beat, the second tone occupies 1/8 of a beat, the pitch of the 2nd tone is higher than the 1st tone, Between the two tones is a minor third.

对于其它的音的构成的情况(比如一段音阶),可类似定义处理。For other situations of sound composition (such as a section of musical scale), it can be similarly defined and processed.

而对于完全是单音重复组合的音素,因各音之间无音程值,所以一单音重复组合在五线谱上各上下位置时的情况都应统一定义为同一个音素(如乐段中有此音素,那编码时要在拼合信息的值里记录此音素的音素编号值和此音素在五线谱上的上下位置值1)。And for the phoneme that is monophonic repetition combination completely, because there is no interval value between each sound, so the situation when a monophonic repetition combination is in each upper and lower position on the staff all should be defined as the same phoneme (as there is this in the musical section Phoneme, when encoding, the phoneme number value of this phoneme and the upper and lower position values of this phoneme on the stave should be recorded in the value of the splicing information 1).

对于单音,可把1分之1拍的单音,2分之1拍的单音,4分之1拍的单音,8分之1拍的单音,16分之1拍的单音,4分之3拍的单音,8分之3拍的单音,16分之3拍的单音,等各自定义成不同音素(本编码规则举例和解码规则举例是把4分音符定义成一拍),单音音素里一般只记录一个音长值即可;单音也算是音组合的特例。For monophonic notes, you can use monophonic notes of 1/1 beat, 1/2 beats, 1/4 beats, 1/8 beats, 1/16 beats , the single tone of 3/4 beat, the single tone of 3/8 beat, the single tone of 3/16 beat, etc. are respectively defined as different phonemes (this coding rule example and decoding rule example are to define the quarter note as a Beat), generally only one sound length value can be recorded in a monophonic phoneme; a monophonic phoneme is also a special case of a tone combination.

一般的,一个音素中可能含有一个或多个音的信息,含多个音的信息时,这些音可能是在一个或多个时间点上发音,对于任一时间点,如有发多个音,那这多个音是不同音高的音,并且这多个音各自持续的时间全相同或不同。Generally, a phoneme may contain information of one or more tones. When information of multiple tones is included, these tones may be pronounced at one or more time points. For any time point, if there are multiple tones , then these multiple tones are sounds of different pitches, and the respective durations of these multiple tones are all the same or different.

因为音组合的个数很多,难以预计有多少,所以一般不用对每种音组合都对应定义成音素,而是根据需要来定义,比如如编码规则和解码规则涉及编解码的乐段只是一本读物里有限个乐段,统计对这些乐段各自作拼合发音所涉及用到的所有音组合(有限个音组合),比如拼合这些乐段中的乐段1的发音要用到音组合1和音组合2,拼合这些乐段中的乐段2的发音要用到音组合1和音组合3,那这里的所有音组合就要包含音组合1和音组合2和音组合3,对这所有音组合中的各音组合都各自定义成音素,拼合此读物里任一乐段时,用这些音素中的全部或部分音素即可,再比如如编码规则和解码规则涉及编解码的乐段可以是任意乐段,那仍可定义一些单音重复,和弦分解,音阶,围绕音,单音等常用音素,在拼合时可选择使用,而如对于乐段中无法用音素编号(或音素编号及上下位置值等)来表示的陌生音组合,则可以使用修改的方式,比如拼合时用一个音素,对此音素先作定位再对其进行修改(比如增减音,调整某音的音高,修改音的音长值等),修改得到的就是陌生音组合里音的信息,这样一来就能完成任意乐段的拼合,本编码规则举例和解码规则举例是以上所述的可以对任意乐段进行编解码,当然本段所说的任意乐段,是指用编解码规则对其编解码时其它方面也能顺利编解码的,比如如一乐段的发音要涉及力度,滑音,而编码规则,解码规则不能处理力度,滑音,那此乐段就不属于此编解码规则能编解码处理的任意乐段。Because there are many sound combinations, it is difficult to predict how many there are. Therefore, it is generally not necessary to define each sound combination as a phoneme, but to define it according to the needs. For example, the coding rules and decoding rules involving codecs are just a book. There are a limited number of passages in the reading material, and statistics are made of all the sound combinations (limited number of sound combinations) involved in the joint pronunciation of these passages. For example, the pronunciation of passage 1 in these passages needs to use the sound combination 1 and the sound Combination 2, the pronunciation of the passage 2 in these passages needs to use the sound combination 1 and the sound combination 3, then all the sound combinations here will include the sound combination 1 and the sound combination 2 and the sound combination 3, for all the sound combinations Each sound combination is defined as a phoneme. When assembling any passage in this reading, all or part of these phonemes can be used. For example, if the coding rules and decoding rules involve encoding and decoding, the passages can be any passages. , that can still define some commonly used phonemes such as monotone repetition, chord decomposition, scale, surround sound, monotone, etc., which can be selected for use when assembling, and for example, phoneme numbers (or phoneme numbers and upper and lower position values, etc. ) to represent the combination of unfamiliar sounds, then you can use a modified method, such as using a phoneme when assembling, and then positioning this phoneme and then modifying it (such as increasing or decreasing the sound, adjusting the pitch of a certain sound, modifying the sound of the sound Long value, etc.), the modified information is the information of the sound in the combination of unfamiliar sounds, so that the combination of any music section can be completed. The example of this encoding rule and the decoding rule are the above-mentioned examples that can encode and decode any music section. , of course, the arbitrary section mentioned in this paragraph refers to the codec that can be coded and decoded in other aspects smoothly when it is coded and decoded by the codec rules. If velocity and portamento are handled, then this section does not belong to any section that can be processed by this codec rule.

对于以上述的本编码规则举例和解码规则举例里涉及定义的音素,如是把一个音定义成的音素,这音素里包括这个音的音长值(音素里无别的信息),这也属于前面述的1型音素,如是把同时发音的多个音定义成的音素,这音素里包括这多个音各自的音长值,还包括这多个音各音互相之间的音程关系数据(音素里无别的信息),这也属于前面述的2型音素,如是把其中至少含有不同时发音的两个音的多个音定义成的音素,这音素里包括这多个音各自的发音时刻值,这多个音各自的音长值,这多个音各音互相之间的音程关系数据(音素里无别的信息),这也属于前面述的3型音素。For the phonemes that are defined in the examples of the encoding rules and the decoding rules mentioned above, if a phoneme is defined as a sound, the phoneme includes the sound length value of the sound (no other information in the phoneme), which also belongs to the front The type 1 phoneme described above is a phoneme that defines multiple sounds that are pronounced at the same time. This phoneme includes the respective sound length values of these multiple sounds, and also includes the interval relationship data (phoneme) between each of these multiple sounds. There is no other information in it), which also belongs to the above-mentioned type 2 phoneme, such as a phoneme defined by multiple sounds containing at least two sounds that are not pronounced at the same time, and this phoneme includes the respective pronunciation moments of these multiple sounds value, the respective sound length values of these multiple sounds, the interval relationship data between each of these multiple sounds (there is no other information in the phoneme), and this also belongs to the aforementioned 3-type phoneme.

音素可用音素编号来表示,凡是可从拼合信息的值里直接得到的音素编号,一般需在拼合信息的值里记载其音素的上下位置值1。各音素的上下位置值1的确定以各音素中的第1次发的音中的最低音为准,即音素的上下位置值1就是音素中的第1次发的音中的最低音的上下位置值(后面的编码举例和解码举例也是这样的)。各个类音素的上下位置值1的确定以各个类音素中的第1次发的音中的最低音为准,即类音素的上下位置值1就是类音素中的第1次发的音中的最低音的上下位置值(后面的编码举例和解码举例也是这样的),以上两句说的类音素是含有数个音的信息的类音素。(本段各处说到的上下位置值,上下位置值1都是指在五线谱上的,本段说的第1次发的音也可能是静音,即休止符对应的音)。本编码规则举例和解码规则举例里一个音的上下位置值,是指这个音比这个音所处声道的五线谱五条线中最中间一线高或低多少距离单位。A phoneme can be represented by a phoneme number. For any phoneme number that can be directly obtained from the value of the combined information, the upper and lower position values of the phoneme need to be recorded as 1 in the value of the combined information. The determination of the upper and lower position value 1 of each phoneme is based on the lowest sound in the first sound in each phoneme, that is, the upper and lower position value 1 of the phoneme is the upper and lower of the lowest sound in the first sound in the phoneme Position value (the same is true for the following encoding and decoding examples). The determination of the upper and lower position value 1 of each class phoneme is based on the lowest sound in the first sound of each class phoneme, that is, the upper and lower position value 1 of the class phoneme is the first sound in the class phoneme. The upper and lower position values of the lowest tone (the following coding examples and decoding examples are also like this), the phonemes mentioned in the above two sentences are phonemes containing information of several tones. (The upper and lower position values mentioned everywhere in this paragraph, the upper and lower position value 1 all refer to the staff, and the first sound mentioned in this paragraph may also be mute, that is, the sound corresponding to the rest). The upper and lower position values of a sound in this encoding rule example and decoding rule example refer to the distance unit higher or lower than the middle line of the five lines of the staff of the sound channel where the sound is located.

举一个有关音素定义的实例(这些音组合主要是由两个基本的音组成的,第2音的音高比第1音的音高高,这两个音各自还带一个冠音):Give an example of phoneme definition (these sound combinations are mainly composed of two basic sounds, the pitch of the second sound is higher than that of the first sound, and each of these two sounds has a crown sound):

音组合1:第1音与第2音之间为大3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 1: There is a major 3rd between the 1st tone and the 2nd tone. Both of these two tones have a 3rd-degree crown sound. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合2:第1音与第2音之间为小3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 2: There is a minor 3rd between the 1st tone and the 2nd tone. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合3:第1音与第2音之间为大3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 3: There is a major 3rd between the 1st tone and the 2nd tone. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合4:第1音与第2音之间为小3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 4: There is a minor 3rd between the 1st tone and the 2nd tone. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合5:第1音与第2音之间为大3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Sound combination 5: between the 1st tone and the 2nd tone is a major 3rd degree, both of these two tones have an 8th interval crown sound, the sound length of the 1st tone and its crown sound is 1 beat, the 2nd tone and its crown sound The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合6:第1音与第2音之间为小3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 6: There is a minor 3rd between the first and second tones, and both of these two tones have an 8-degree crown. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合7:第1音与第2音之间为大3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 7: There is a major 3rd between the 1st tone and the 2nd tone. Both of these two tones have an 8th interval crown sound. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合8:第1音与第2音之间为小3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是1拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 8: There is a minor 3 degree between the 1st sound and the 2nd sound. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合9:第1音与第2音之间为大3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 9: There is a major 3rd between the 1st tone and the 2nd tone. Both of these two tones have a 3rd-degree crown sound. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合10:第1音与第2音之间为小3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 10: There is a minor 3rd between the 1st tone and the 2nd tone. Both of these two tones have a 3rd-degree crown sound. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合11:第1音与第2音之间为大3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 11: There is a major 3rd between the 1st tone and the 2nd tone. Both of these two tones have a 3rd-degree crown sound. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合12:第1音与第2音之间为小3度,这两个音都带3度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 12: There is a minor 3rd between the 1st tone and the 2nd tone. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合13:第1音与第2音之间为大3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 13: There is a major 3rd between the 1st tone and the 2nd tone. Both of these two tones have an 8th interval crown sound. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合14:第1音与第2音之间为小3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是1拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 14: There is a minor 3rd between the first and second tones, and both of these two tones have an 8-degree crown. The sound length of the crown sound is 1 beat, and the pronunciation time of the second sound is 1 beat slower than that of the first sound (these sounds also have a definite pitch value)

音组合15:第1音与第2音之间为大3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Sound combination 15: between the 1st tone and the 2nd tone is a major 3rd degree, both of these two tones have an 8th interval crown sound, the sound length of the 1st tone and its crown sound is half a beat, the 2nd tone and its The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

音组合16:第1音与第2音之间为小3度,这两个音都带8度音程冠音,第1音及其冠音的音长都是半拍,第2音及其冠音的音长都是半拍,第2音发音时刻比第1音发音时刻慢1拍,(这些音也都是有确定的音高值的)Tone combination 16: There is a minor 3rd between the 1st tone and the 2nd tone. Both of these two tones have an 8th interval crown sound. The sound length of the crown sound is half a beat, and the second sound is pronounced one beat slower than the first sound (these sounds also have a definite pitch value)

以上16种音组合可被分别定义为16个不同音素(任一音素里不包含任一音的音高值),可各自赋予一个音素编号,共16个音素编号,每个音素里包含两个音各自的发音时刻值(以第1音为时刻起点),两个音各自的音长值,两个音互相之间的音程关系数据(这里两个音互相之间的音程关系数据就是两个音之间的音程关系值),音素里再无别的信息。(以上说到的拍都是把4分音符定义成一拍)The above 16 kinds of tone combinations can be defined as 16 different phonemes (any phoneme does not contain the pitch value of any tone), and each phoneme number can be assigned to a total of 16 phoneme numbers, each phoneme contains two The respective pronunciation time values of the sounds (with the first sound as the starting point of the moment), the respective sound length values of the two sounds, and the interval relationship data between the two sounds (the interval relationship data between the two sounds here is two Interval relationship between tones), there is no other information in the phoneme. (The beats mentioned above all define quarter notes as one beat)

在乐谱中时常可见临时的升号,降号,重升号,重降号,还原号,这些符号的作用仅是把符号所作用的音组合里的各音互相之间的音程关系临时改变了,但如音素数据库数据完备的话,那改变音程关系后的这几个音形成的新的音组合,对应定义的音素(此音素是未定位的)应该与在音素数据库里记录的某音素(此音素是未定位的)相同,所以编码时,用音素数据库里的此某音素的音素编号再加上一个此某音素的上下位置值1来表示这新的音组合即可,或者改变音程关系后的这几个音形成的这新的音组合,对应定义的自定位的音素应该与在音素数据库里记录的某自定位的音素相同,所以编码时,用音素数据库里的此某自定位的音素的音素编号来表示这新的音组合即可;比如(假设以下提到的抽象拍和乐段的实际拍都是以4分音符为1拍):乐段的五线谱一声道上有一含有两个音的音组合(两音各占半拍;这里的拍是指乐段的实际拍)有被定义成一未定位的音素(音素里的两音各占半拍;这里和下面3句提到的拍都是抽象拍,其值等于乐段的实际拍),后音的音高比前音的音高高,两音之间音程为大3度,后音比前音慢半拍,在音素数据库里记录有按这个音组合定义的未定位的音素(此音素对应有一个音素编号),然后如这乐段后面五线谱这声道上(调式,谱号都不变),出现一含有两个音的音组合,其相当于是把本段第1句提到的那个音组合复制一遍(各音的音高,音长都对应相同)并把其中后一个音赋予临时降号,这样两个音的音程就成为小3度,不是原来大3度的音素了,但在前述音素数据库里记录有的另一音素(未定位的)是按这个音组合来定义的即这另一音素里的两音之间音程为小3度且两音各占半拍且后音的音高比前音的音高高且后音比前音慢半拍(本句说的拍是指抽象拍),(此音素也对应有另一个音素编号),所以编码时,使用这另一个音素的音素编号加上一个此另一个音素的上下位置值1来表示此赋予了临时降号的音组合即可。所以在做音素数据库的时候,不必考虑临时升号,降号等符号,只需数据库里尽量把各种音组合里各音之间不同的音程关系情况都考虑到,并定义足够多的音素,那在乐谱编码时,可根据音组合里实际的临时升号,降号等来决定编码使用哪个音素。Temporary sharps, flats, double sharps, double flats, and restorations are often seen in music scores. The function of these symbols is only to temporarily change the interval relationship between the tones in the sound combination on which the symbols act. , but if the phoneme database data is complete, the new sound combination formed by these several tones after changing the interval relationship should correspond to a defined phoneme (this phoneme is unlocated) and a certain phoneme recorded in the phoneme database (this The phoneme is unlocated), so when encoding, use the phoneme number of this certain phoneme in the phoneme database plus a value of 1 for the upper and lower position of this certain phoneme to represent the new sound combination, or after changing the interval relationship The new sound combination formed by these several sounds, the corresponding defined self-positioned phoneme should be the same as a certain self-positioned phoneme recorded in the phoneme database, so when encoding, use this certain self-positioned phoneme in the phoneme database The phoneme number of the music section can be used to represent the new sound combination; A sound combination of two tones (two tones each occupy half a beat; the beat here refers to the actual beat of the passage) is defined as an unlocated phoneme (two tones in a phoneme each occupy a half beat; mentioned here and in the following three sentences The beats are abstract beats, whose value is equal to the actual beat of the passage), the pitch of the back tone is higher than that of the front tone, the interval between the two tones is 3 degrees larger, and the back tone is half a beat slower than the front tone. The phoneme database records the unlocated phoneme defined according to this sound combination (this phoneme corresponds to a phoneme number), and then on the track of the stave (mode and clef are unchanged) at the back of the music section, a phoneme containing two The sound combination of a sound is equivalent to copying the sound combination mentioned in the first sentence of this paragraph (the pitch of each sound and the sound length are all corresponding to the same) and the latter sound is given to the temporary flat number, so that the two The interval of the sound becomes a small 3 degree, not the original 3 degree phoneme, but another phoneme (unlocated) recorded in the aforementioned phoneme database is defined according to this sound combination, that is, the phoneme in this other phoneme The interval between the two tones is 3 degrees smaller and the two tones each occupy half a beat and the pitch of the back tone is higher than that of the front tone and the back tone is half a beat slower than the front tone (the beat in this sentence refers to the abstract beat), (This phoneme also corresponds to another phoneme number), so when encoding, use the phoneme number of this other phoneme plus the upper and lower position value 1 of this other phoneme to represent that this has given the sound combination of the temporary flat number. Therefore, when building a phoneme database, you don’t need to consider symbols such as temporary sharps and flats. You only need to take into account the different interval relationships between the sounds in various sound combinations in the database, and define enough phonemes. When the music score is encoded, it can be determined which phoneme to use for encoding according to the actual temporary sharps, flats, etc. in the sound combination.

对于每个不同的音素,都赋予一个不同的音素编号(比如可用二进制编号来表示各音素),在编码时,可用音素编号来表示各音素。各音素编号的长度可以一致(本说明书作为举例的编码规则和解码规则采用以一致的方式为例),也可不一致长度,可以统计在平常的乐曲中,各音素出现的频率,经常出现的可以赋予短编号,不常出现的赋予长编号,比如可以使用哈夫曼编号方式或其它编号方式,这样一来可以达到压缩数据长度的作用。For each different phoneme, a different phoneme number is given (for example, each phoneme can be represented by a binary number), and each phoneme can be represented by a phoneme number during encoding. The length of each phoneme number can be consistent (this instruction manual uses the same method as an example for the coding rules and decoding rules as an example), and can also be inconsistent in length. In ordinary music, the frequency of each phoneme's occurrence can be counted. Frequently occurring ones can be Assign short numbers, and assign long numbers that do not appear frequently, for example, you can use Huffman numbering or other numbering methods, so that the effect of compressing the data length can be achieved.

另外对于不同体系的音乐,其使用的音乐手法,和弦种类都具有区域性,比如欧洲古典音乐里的各音乐家的作品,其音乐手法,和弦种类很相似,而越剧的各个作品的音乐手法互相也相似,但欧洲古典音乐与越剧的音乐手法相差较大,所以可以针对不同的音乐体系,分别定义不同的音素数据库(发相同音的音素,在不同的音素数据库里一般需根据使用频率情况来赋予不同的音素编号),以方便编码和提高压缩效率,一般可在乐段拼合信息的值前面开一个或多个BIT表示是使用哪种音素数据库,而在与拾音器有关的存储器集里可把各个音素数据库都存进去,解码时根据乐段拼合信息的值前面的区别信息,调对应的音素数据库解码;也可把常见的各不同音乐体系统一当成一个总音乐体系,各音乐体系(此各音乐体系是指上句说的不同音乐体系中的各音乐体系)使用的音乐手法和和弦种类都属于此总音乐体系,按统一的编码规则来编码,按统一的解码规则作解码处理;本说明书作为举例的编码规则是以针对总音乐体系的乐段来编码为例的,本说明书作为举例的是以针对总音乐体系的乐段的拼合信息的值来解码为例的,以下乐段拼合信息的值编解码举例涉及的乐段也是以是总音乐体系里的乐段为例的。而权利要求书中提到的编码规则和解码规则可以是作了使用哪种音乐体系的限制的,也可以是没有作使用哪种音乐体系的限制的。In addition, for different systems of music, the musical techniques and chord types used are all regional. For example, the musical techniques and chord types of the works of various musicians in European classical music are very similar. It is also similar, but the music techniques of European classical music and Yue opera are quite different, so different phoneme databases can be defined for different music systems (phonemes that pronounce the same sound generally need to be classified according to the frequency of use in different phoneme databases) give different phoneme numbers), to facilitate coding and improve compression efficiency, generally one or more BITs can be opened in front of the value of the piece combination information to indicate which phoneme database is used, and in the memory set related to the pickup, the Each phoneme database is all stored in, and when decoding, according to the difference information in front of the value of the music segment combination information, the corresponding phoneme database is decoded; also the common different music systems can be unified as a total music system, and each music system (these each The music system refers to the different music systems mentioned in the previous sentence. The musical techniques and chord types used belong to this general music system, and are encoded according to the unified coding rules and decoded according to the unified decoding rules; this manual As an example, the encoding rule is based on the encoding of the music section of the overall music system. This manual is based on the decoding of the value of the combination information of the music section of the overall music system as an example. The following section combination information The passages involved in the value codec example are also taken as examples in the general music system. The encoding rules and decoding rules mentioned in the claims may or may not restrict which music system to use.

音素定义好后,对各音素赋予音素编号,把各音素集合起来做成音素数据库,以供编码使用,解码时也要用音素数据库;本说明书述的编码规则举例和解码规则举例和编码举例和解码举例涉及用的音素数据库里每个记录项记录一个音素的信息(对于是把一个音定义成的音素,这里说的一个音素的信息包括音素编号,这个音的音长值等信息,对于是把同时发音的多个音定义成的音素,这里说的一个音素的信息包括音素编号,这多个音各自的音长值,还包括这多个音各音互相之间的音程关系数据等信息,对于是把其中至少含有不同时发音的两个音的多个音定义成的音素,这里说的一个音素的信息包括音素编号,这多个音各自的发音时刻值,各自的音长值,这多个音各音互相之间的音程关系数据等信息)。After the phonemes are defined, assign phoneme numbers to each phoneme, and assemble each phoneme to form a phoneme database for encoding, and also use the phoneme database when decoding; the examples of encoding rules, decoding rules, encoding examples and The decoding example involves the information of a phoneme recorded in each record item in the phoneme database used (for a phoneme defined as a phoneme, the information of a phoneme mentioned here includes the phoneme number, the sound length value of the sound, etc. A phoneme that defines multiple sounds that are pronounced at the same time. The information of a phoneme mentioned here includes the phoneme number, the respective sound length values of the multiple sounds, and the interval relationship data between the multiple sounds. , for a phoneme that defines a plurality of sounds that contain at least two sounds that are not pronounced at the same time, the information of a phoneme mentioned here includes the phoneme number, the respective pronunciation time values of these multiple sounds, and the respective sound length values, Information such as interval relationship data between each tone of these multiple tones).

显然同一个音素,一般可用于不同乐段的发音拼合;即编码生成表示如何用音素来拼合一乐段发音的拼合信息的值时,涉及使用的音素(这里说的音素就是上句说的‘如何用音素来拼合一乐段发音’中提到的音素)中可使用某一音素,编码生成表示如何用音素来拼合另一乐段发音的拼合信息的值时,涉及使用的音素(这里说的音素就是上句说的‘如何用音素来拼合另一乐段发音’中提到的音素)中也可使用此某一音素(此‘此某一音素’是指前面提到的‘某一音素’),对一乐段拼合信息的值解码时,用来拼合此乐段发音的音素中可使用某一音素,对另一乐段拼合信息的值解码时,用来拼合这乐段发音的音素中也可使用此某一音素。Obviously, the same phoneme can generally be used for the pronunciation combination of different sections; that is, when encoding generates the value of the combination information indicating how to use phonemes to combine the pronunciation of a section, it involves the phoneme used (the phoneme mentioned here is exactly what the previous sentence said' How to use phonemes to join together the phoneme mentioned in the pronunciation of a passage' can use a certain phoneme, when encoding generates the value of the joining information representing how to use phonemes to join together the pronunciation of another passage, the phoneme used (here said The phoneme is the phoneme mentioned in the above sentence "how to use phonemes to combine the pronunciation of another passage") and this certain phoneme can also be used (this "this certain phoneme" refers to the aforementioned "a certain phoneme") Phoneme'), when decoding the value of a segment’s combination information, a certain phoneme can be used in the phonemes used to combine the pronunciation of this segment, and when decoding the value of another segment’s combination information, it can be used to combine the pronunciation of this segment This phoneme can also be used in the phoneme of .

本说明书的编码规则举例和解码规则举例所涉及的被定位的音素,主要是以被定位的音素里的音在同一个声道为例的,即一个被定位的音素内的各音是属于同一个声道的(可称这种被定位的音素为被定位的单声道音素),实际上一个被定位的音素内的各音也可以是分属于多个声道的(可称这种被定位的音素为被定位的多声道音素);可全用被定位的单声道音素来拼合乐段发音(本说明书作为举例的编码规则和解码规则是以此为例的),也可以用数个被定位的多声道音素与数个被定位的单声道音素来拼合乐段发音,也可全用被定位的多声道音素来拼合乐段发音,拼合时被定位的多声道音素的用法与被定位的单声道音素的用法类似,可参考本说明书述的被定位的单声道音素的用法。(本申请文件编解码举例涉及的每个被定位的音素都是这样,即被定位的音素里的各音是属于同一个声道的;本申请文件编解码举例涉及的每个定位的类音素里的各音是属于同一个声道的)The localized phonemes involved in the examples of coding rules and decoding rules in this manual mainly take the tones in the localized phoneme as an example in the same channel, that is, each sound in a localized phoneme belongs to the same channel. One channel (this kind of positioned phoneme can be called a positioned monophonic phoneme), in fact, each sound in a positioned phoneme can also belong to multiple channels (this kind of positioned phoneme can be called a positioned monophonic phoneme). The positioned phoneme is a positioned multi-channel phoneme); the positioned monophonic phoneme can be used to stitch together the pronunciation of the segment (the encoding rule and decoding rule used as an example in this manual are taken as an example), and it can also be used Several positioned multi-channel phonemes and several positioned monophonic phonemes are used to combine the pronunciation of the passage, or all the positioned multi-channel phonemes can be used to combine the pronunciation of the passage, and the positioned multi-channel phonemes The usage of the phoneme is similar to that of the positioned monophonic phoneme, and reference can be made to the usage of the positioned monophonic phoneme described in this manual. (every positioned phoneme involved in the codec example of the application document is like this, that is, each sound in the positioned phoneme belongs to the same channel; Each sound in the sound belongs to the same channel)

音素数据库及其它解码用的数据可存放在拾音器内的存储器里以供解码。并且解码用的音素数据库与编码用的音素数据库一般应是相同或兼容的;以下编码举例用的音素数据库和解码举例用的音素数据库是相同的。The phoneme database and other data for decoding can be stored in memory within the pickup for decoding. In addition, the phoneme database used for decoding and the phoneme database used for encoding should generally be the same or compatible; the phoneme database used for the following encoding examples and the phoneme database used for decoding examples are the same.

音乐乐段的其它构成特点:Other compositional features of musical passages:

重复:即把前面一段乐曲,紧接着重复一遍,如图12是把被定位的音素56(此音素含4个音)重复2次的示意图。Repeat: promptly repeat the previous piece of music, as shown in Figure 12, which is a schematic diagram of repeating the positioned phoneme 56 (this phoneme contains 4 tones) twice.

有点变化的重复:把前面一段乐曲,重复一遍并有对重复得到的某些音稍做修改。Repetition with some changes: Repeat the previous piece of music and slightly modify some of the repeated sounds.

移动音程的重复:把前面一段乐曲,重复一遍并有对重复得到的向上或向下移动一个或多个音程(模进)。Repetition of moving intervals: Repeat the previous piece of music and move up or down one or more intervals (modulus) obtained by the repetition.

这些特点,可以利用起来达到压缩编码信息量的作用;对乐段中的音组合的重复情况,在拼合时可采用复制的方式来得到重复的音组合。(以上几行所指一段乐曲,一般不是指一个乐段,而是没有事先固定的长度的,比如可以把前面1个小节的音作复制,或把前面3个小节的音作复制,或把前面1个被定位的音素作复制等)These characteristics can be utilized to achieve the effect of compressing the amount of encoded information; for the repetition of sound combinations in the music section, duplication can be used to obtain repeated sound combinations when assembling. (A piece of music referred to in the above lines generally does not refer to a piece of music, but does not have a fixed length in advance. For example, the sound of the previous 1 measure can be copied, or the sound of the first 3 measures can be copied, or the sound of the first 3 measures can be copied. The previous 1 positioned phoneme is copied, etc.)

对乐段的编码是可以适用于一个声道或多个声道的乐段的,对乐段拼合信息的值的解码发音是可以适用于一个声道或多个声道的。The coding of the segment can be applied to the segment of one channel or multiple channels, and the decoding pronunciation of the value of the combination information of the segment can be applied to one channel or multiple channels.

拼合时可使用顺序拼合,交织拼合等组拼形式(顺序拼合,交织拼合的处理方式可参考本说明书前面所述),还可使用对定位的音素进行修改,复制等拼合相关处理形式(对定位的音素进行修改,复制的处理方式可参考本说明书前面所述)。When stitching, you can use sequential stitching, interweaving stitching and other grouping forms (the processing methods of sequential stitching and interweaving stitching can refer to the previous description of this manual), and you can also use related processing forms such as modifying and copying the positioned phonemes (for positioning Phonemes can be modified, and the copying process can refer to the previous description of this manual).

本编码规则对乐段编码而得到的乐段拼合信息的值的结构的定义可如下:The definition of the structure of the value structure of the segment combination information obtained by encoding the segment in this encoding rule can be as follows:

前面是编码头信息:含有调式及谱号信息,节奏信息等信息,(另还可以含有音色信息但这不是必需的)。The front is encoding header information: information such as mode and clef information, rhythm information, (can also contain timbre information, but this is not necessary).

接着的数据由各种段组合而成,这些段一般有:音素段,控制段,其中控制段一般有改变声道段,复制控制段,模进复制控制段,有点变化复制段,修改段,交织拼合段,结束标志段,等,控制段里还可包括调式改变段等。The following data is composed of various segments. These segments generally include: phoneme segment, control segment, among which the control segment generally includes channel change segment, copy control segment, modular copy control segment, a little change copy segment, modification segment, The interweaving and splicing segment, the end marker segment, etc., and the control segment may also include a mode change segment and the like.

各段含有的信息可为:The information contained in each paragraph may be:

音素段:0001,音素个数,接着是各音素的上下位置值1和音素编号。Phoneme segment: 0001, the number of phonemes, followed by the upper and lower position value 1 of each phoneme and the phoneme number.

改变(编码)声道段:0010,新声道号码。Change (code) channel segment: 0010, new channel number.

复制控制段:0011,需复制的前面被定位的音素的个数,复制次数。Copy control segment: 0011, the number of previously positioned phonemes to be copied, and the number of times of copying.

模进复制段:0100,需复制的前面被定位的音素的个数,模进的音步的值。Modulus copy section: 0100, the number of previously positioned phonemes to be copied, and the value of the modulo step.

有点变化的复制段:0110,需复制的前面被定位的音素的个数,需改几个音,第几个音需变化,变化后的音的音高的值。Copy section with some changes: 0110, the number of previously positioned phonemes to be copied, how many tones need to be changed, the number of tones to be changed, and the value of the pitch of the changed tones.

修改段:1000,用于修改的音素的上下位置值1,用于修改的音素的音素编号,修改的具体内容。Modification section: 1000, for the upper and lower position value of the modified phoneme 1, for the phoneme number of the modified phoneme, and the specific content of the modification.

交织拼合段:1001,用于交织拼合的各音素的上下位置值1和音素编号。Interweaving and splicing segment: 1001, used for the upper and lower position values 1 and phoneme numbers of each phoneme in the interweaving and splicing.

结束标志段:0111,结束标志。End flag segment: 0111, end flag.

以上象0111这样的数字表示的是控制段的起始标志。The above numbers like 0111 represent the start flag of the control segment.

以上9行涉及的复制,每次复制是针对一个声道里的被定位的音素作复制(而不是一次对多个声道里的被定位的音素作复制),比如复制控制段里有需复制的前面被定位的音素的个数,这里的‘前面被定位的音素’都是指当前指定或默认的那个声道里的被定位的音素,当然这里的声道可以是声道1,或声道2等。The copying involved in the above 9 lines, each copying is to copy the positioned phonemes in one channel (instead of copying the positioned phonemes in multiple channels at one time), for example, there is a need to copy in the copy control section The number of the previously positioned phonemes, the "previously positioned phonemes" here refers to the positioned phonemes in the currently specified or default channel, of course the channel here can be channel 1, or Road 2 and so on.

修改段里的修改具体内容是这样的:The specific content of the modification in the modification section is as follows:

开始两个BIT表示是作什么修改:00表示删除音,01表示增加音,10表示对音进行上下移动,(本编码规则和本解码规则和编码举例和解码举例仅以对音进行上下移动来作为对被定位的音素进行修改的例子),开始两个BIT为10时,接着3个BIT表示在哪个发音时刻位置作音的上下移动(假设可用于修改的被定位的音素中最多只有8个发音时刻位置,用3个BIT来表示是从左数第几个发音时刻位置,比如图16的音素只有两个发音时刻位置,这两个发音时刻位置分别由000,001来表示),接着2个BIT表示此发音时刻位置上从最下面一个音算起第几个音要被上下移动(假设可用于修改的被定位的音素中每个发音时刻位置最多有4个音,用2个BIT来表示是哪个音),接着5个BIT表示在此发音时刻位置上这个要被移动的音往上移动(抬高)或往下移动(降低)几个音步(这5个BIT的最高位BIT为0表示往上,为1表示往下,这5个BIT的低4位表示移动的音步的个数)。本段说的发音时刻位置是针对具体的音素来说的,音素里的所有音可能在一个时刻发音(这个时刻就是这个音素里唯一的发音时刻位置),也可能是分开在多个不同时刻发音(这些不同时刻就是这个音素里的各发音时刻位置)。The first two BITs indicate what modification is made: 00 means to delete the sound, 01 means to add the sound, and 10 means to move the sound up and down. As an example of modifying the positioned phoneme), when the first two BITs are 10, the next 3 BITs indicate the up and down movement of the sound at which pronunciation position (assuming that there are only 8 at most of the positioned phonemes that can be used for modification Pronunciation time position, using 3 BITs to represent the pronunciation time position from the left, for example, the phoneme in Figure 16 has only two pronunciation time positions, and these two pronunciation time positions are represented by 000, 001 respectively), then 2 A BIT indicates that the number of sounds from the bottommost sound on the pronunciation moment position will be moved up and down (assuming that there are at most 4 sounds in each pronunciation moment position in the positioned phoneme that can be used for modification, use 2 BITs to Indicate which sound it is), and then 5 BITs indicate that the sound to be moved moves up (raised) or moves down (lowers) several steps at the position of the pronunciation moment (the highest BIT of these 5 BITs 0 means up, 1 means down, the lower 4 bits of these 5 BITs represent the number of moving steps). The pronunciation time position mentioned in this paragraph is for a specific phoneme. All sounds in a phoneme may be pronounced at one time (this time is the only pronunciation time position in this phoneme), or they may be pronounced separately at different times (These different moments are exactly the positions of each pronunciation moment in this phoneme).

关于本申请文件各处提及的距离单位这个概念的定义,五线谱上五条线中相邻两条线之间的距离是2个距离单位,相邻两条线中间的空与其上或下相邻的线的距离是1个距离单位,依此类推,两条线之间间隔一条线和两个空,那这两条线的距离是2+2=4个距离单位,其它情况依此类推,五线谱的五条线的上面或下面有加的加线时也是按这样算,计算距离单位时是不管五线谱上的调式及谱号如何的。比如一个音的符头(就是那个椭圆形)处于五线谱的五条线的最中间一线和最中间一线的紧邻上面一线之间,那这个音的符头与五线谱的五条线的最中间一线的距离是1个距离单位,如这个音的符头处于五线谱的五条线的最中间一线的紧邻上面一线上,那这个音的符头与五线谱的五条线的最中间一线的距离是2个距离单位。With regard to the definition of the concept of distance units mentioned throughout this application document, the distance between two adjacent lines among the five lines on the stave is 2 distance units, and the space between adjacent two lines is adjacent to its upper or lower The distance between the lines is 1 distance unit, and so on, if there is a line and two spaces between two lines, then the distance between these two lines is 2+2=4 distance units, and so on in other cases, This is also the case when there are added lines above or below the five lines of the staff, and the distance unit is calculated regardless of the mode and clef on the staff. For example, the note head of a sound (that is, the ellipse) is between the middle line of the five lines of the staff and the line immediately above the middle line, then the distance between the note head of this note and the middle line of the five lines of the staff is 1 distance unit, if the note head of this note is on the middle line of the five lines of the staff immediately adjacent to the upper line, then the distance between the note head of this note and the middle line of the five lines of the staff is 2 distance units.

本说明书编码举例,解码举例里提到的每个音素的上下位置值1由5个BIT来表示,5个BIT的最高那个BIT表示要被定位的音素的第1个音是要放在这要被定位的音素要处于的声道的五线谱的五条线的最中间一条线上方还是下方,如是上方,那最高那个BIT为0,如是下方,那最高那个BIT为1,后面4个BIT表示这要被定位的音素的第1个音与这要被定位的音素要处于的声道的五线谱的五条线的最中间一条线距离多少距离单位(比如这要被定位的音素的第1个音要放在五条线的最上面一条线上,那距离就是4个距离单位),如这要被定位的音素的第1个音是要放在这要被定位的音素要处于的声道的五线谱的五条线的最中间一条线上,那这个音素的上下位置值1就是00000,而且计算这要被定位的音素里的第1个音(也是要放在五线谱上的第1个音)的位置与这要被定位的音素要处于的声道的五线谱的五条线的最中间一条线距离多少距离单位是只看被编码乐段的五线谱上要放的这音素的第1个音的符头(就是那个小椭圆形)所处的五线谱上的位置与这要被定位的音素要处于的声道的五线谱的五条线的最中间一条线距离多少距离单位,而不管被定位后的音素的第1个音是否带升号,降号,重升号,重降号,还原号。本说明书各处算声道1,声道2的音素的上下位置值1都是这样算的。The encoding example in this manual, the upper and lower position value 1 of each phoneme mentioned in the decoding example is represented by 5 BITs, and the highest BIT of the 5 BITs indicates that the first sound of the phoneme to be positioned should be placed here Whether the phoneme to be positioned is above or below the middle line of the five-line staff of the vocal channel, if it is above, the highest BIT is 0, if it is below, the highest BIT is 1, and the last 4 BITs indicate that What is the distance unit between the first sound of the phoneme to be located and the middle line of the five lines of the staff of the channel where the phoneme to be located is located (for example, the first sound of the phoneme to be located should be placed On the top line of the five lines, the distance is 4 distance units), for example, the first sound of the phoneme to be positioned is to be placed on the five lines of the staff of the channel where the phoneme to be positioned is located line, the upper and lower position value of this phoneme is 00000, and the position of the first sound in the phoneme to be positioned (also the first sound to be placed on the staff) is calculated and this How much is the distance from the middle line of the five-line staff of the vocal channel where the phoneme to be located is? The unit of distance is only the notehead of the first sound of the phoneme to be placed on the staff of the coded section (that is, the small ellipse) on the staff and the distance unit between the position on the stave where the phoneme to be positioned is located and the middlemost line of the five lines of the staff of the channel where the phoneme is to be located, regardless of the first sound of the phoneme after being positioned Whether with sharps, flats, double sharps, double flats, or reversions. In this manual, the upper and lower position values of the phonemes of channel 1 and channel 2 are calculated in this way.

本编码规则举例和解码规则举例和编码举例和解码举例里所说的音素的第1个音,是指音素中的第1时刻发的所有音中音高最低的音,如音素中的第1时刻发的所有音就只有一个音,那这个音就是此音素的第1个音。比如图16中左边那个音就是这个音素第1时刻发音的音,而且第1时刻只发了这一个音,这个音就是此音素的第1个音;再比如把图11那样的4个音的组合定义成一个音素,那这个音素第1时刻发的音就是左边两个音(即一个音和它的冠音),这两个音中音高最低的那个就是此音素的第1个音;图13中上面声道最左边的那个含一个音的信息的音素,显然其第1音就是唯一那个音。同理,本编码规则举例和解码规则举例和编码举例和解码举例里所说的类音素的第1个音,是指类音素中的第1时刻发的所有音中音高最低的音。The first sound of the phoneme mentioned in this encoding rule example and decoding rule example and encoding example and decoding example refers to the sound with the lowest pitch among all the sounds pronounced at the first moment in the phoneme, such as the first sound in the phoneme All the sounds produced at all times have only one sound, and this sound is the first sound of this phoneme. For example, the sound on the left in Figure 16 is the sound pronounced at the first moment of this phoneme, and only this sound is pronounced at the first moment, and this sound is the first sound of this phoneme; The combination is defined as a phoneme, and the first sound of this phoneme is the two tones on the left (that is, a sound and its crown sound), and the lowest pitch of these two tones is the first sound of this phoneme; In Fig. 13, the phoneme containing the information of one tone on the leftmost side of the upper channel, obviously its first tone is the only tone. In the same way, the first sound of the quasi-phoneme mentioned in this encoding rule example and decoding rule example and encoding example and decoding example refers to the sound with the lowest pitch among all the sounds issued at the first moment in the class phoneme.

同样,后面解码举例2里提到的自定位的类音素里的最先发音的所有音中最低音的上下位置值,也是由5个BIT来表示,5个BIT的最高那个BIT表示这个音是处于这自定位的类音素所处声道的五线谱的五条线的最中间一条线上方还是下方,如是上方,那最高那个BIT为0,如是下方,那最高那个BIT为1,后面4个BIT表示这个音与这自定位的类音素所处声道的五线谱的五条线的最中间一条线距离多少距离单位(比如这个音处于五条线的最上面一条线上,那距离就是4个距离单位),如这个音是处于这自定位的类音素所处声道的五线谱的五条线的最中间一条线上,那这个音的上下位置值就是00000;后面解码举例2里提到的用于交织拼合的第1个音素里的最先发音的所有音中最低音的上下位置值与后面解码举例2里提到的自定位的类音素里的最先发音的所有音中最低音的上下位置值的计算是相同的。Similarly, the upper and lower position values of the lowest tone among all the first sounds in the self-positioning phoneme mentioned in the decoding example 2 later are also represented by 5 BITs, and the highest BIT of the 5 BITs indicates that the sound is It is above or below the middle line of the five-line staff of the vocal channel where the self-positioned phoneme is located. If it is above, the highest BIT is 0; if it is below, the highest BIT is 1, and the last 4 BITs indicate How many distance units are there between this sound and the middlemost line of the five lines of the staff of the vocal channel where the self-localized phoneme is located (for example, if the sound is on the top line of the five lines, the distance is 4 distance units), If this sound is on the middle line of the five lines of the staff of the vocal channel where the self-positioned phoneme is located, then the upper and lower position values of this sound are 00000; Calculation of the upper and lower position values of the lowest tone of all the first sounds in the first phoneme and the upper and lower position values of the lowest sound of all the first sounds in the self-positioned phoneme mentioned in the decoding example 2 later Are the same.

编码得到的乐段拼合信息的值的数据这样布置:先是编码头,接着放拼合此被编码乐段中声道1的所有发音的那些相关信息(比如用音素段记录此声道的所有音素的信息,或用音素段记录此声道的一些音素的信息且用复制控制段记录此声道的一些对音素作定位及复制的信息;音素段可有多个,其它有些种控制段中的每一种控制段也可在一拼合信息的值中出现多次,比如重复控制段,修改段,或交织拼合段等都可在一拼合信息的值中多次出现),如还有别的声道那接着放拼合此被编码乐段中别的声道的所有发音的那些相关信息(比如用音素段记录此声道的所有音素的信息,或用音素段记录此声道的一些音素的信息且用复制控制段记录此声道的一些对音素作定位及复制的信息;音素段可有多个,其它有些种控制段中的每一种控制段也可在一拼合信息的值中出现多次,比如重复控制段,修改段,或交织拼合段等都可在一拼合信息的值中多次出现),照这样如还有更多声道那接着放拼合此被编码乐段中这些更多声道中的各声道的所有发音的那些相关信息;乐段拼合信息的值的最后是结束标志段。The data of the value of the piece of information that is encoded is arranged like this: First, the encoding header, and then put together those related information of all the pronunciations of channel 1 in the coded section (such as recording the information of all phonemes of this channel with phoneme segments, Or use the phoneme segment to record some phoneme information of this channel and use the copy control segment to record some phoneme positioning and copy information of this channel; there can be multiple phoneme segments, and there are other types of control segments for each The control segment can also appear multiple times in the value of a mosaic information, such as repeating the control segment, modifying the segment, or interleaving the mosaic segment, etc., can appear multiple times in the value of a mosaic information), if there are other channels Then put together those relevant information of all the pronunciations of other channels in this coded section (for example, record the information of all phonemes of this channel with phoneme segments, or record the information of some phonemes of this channel with phoneme segments and use The copy control section records some phoneme positioning and copying information of this channel; there can be multiple phoneme sections, and each control section in some other types of control sections can also appear multiple times in the value of a combined information. Such as repeating the control section, modifying the section, or interweaving and merging sections, etc. can appear multiple times in the value of a merging information), in this way, if there are more sound channels, then put and merging these more voices in this coded music section The relevant information of all the pronunciations of each channel in the channel; the end of the value of the segment stitching information is the end marker segment.

上段说的不同声道的发音的起始时间都相同,如有的声道在开始几实际拍没发音,那也要用休止符音素来充填这些没发音的实际拍,然后休止符仍看成是发音的只是发的是静音。The starting time of the pronunciation of different vocal channels mentioned in the previous paragraph is the same. If some vocal channels are not pronounced in the first few beats, then the rest phonemes should be used to fill these unpronounced actual beats, and then the rests are still regarded as pronunciation It's just that it's muted.

声道个数没有限定,一般为2个,4个,8个等。The number of channels is not limited, generally 2, 4, 8 and so on.

对于钢琴谱,如使用两声道,那声道号为1可表示右手主旋律,声道号为2可表示左手伴奏旋律,For the piano score, if two channels are used, the channel number is 1 to represent the main melody of the right hand, and the channel number is 2 to represent the accompaniment melody of the left hand.

对于一些复杂乐曲,有时还需定义那些冠音各自由哪个声道演奏,不过对于简单乐曲不带冠音的,无需定义。For some complex music, sometimes it is necessary to define which channel the crowns are played from, but for simple music without crowns, no definition is required.

本说明书述及的编码规则举例和解码规则举例里涉及定义的音素里是不包括任一音的音高值的,后面编码举例和解码举例涉及的可从拼合信息的值里直接得到的音素也是这样的(音素里是不包括任一音的音高值)。本说明所说的把图9这样的音组合定义成音素,音素里实际是只包含这4个音各自的发音时刻值,以及这4个音各音互相之间的音程关系数据,以及这4个音每个音的音长这些信息,而不包括这4个音中任一音的音高值,所以图9只是这个音素被放在五线谱上时的一个特例(这个音素还可放到五线谱的垂直方向的其它位置),其它把图8这样的音组合定义成音素,把图10这样的音组合定义成音素,把图11这样的音组合定义成音素都照图9这样理解;本说明书述及图12,图13,图14,图15,图16的五线谱上的音素,也是指音素被放在五线谱上时的特例,这些音素里本身是不包括任一音的音高值的,比如图13声道1的最右边的音素56(含2个音的信息),此音素里本身是不包括任一音的音高值的,五线谱的垂直方向的多个不同位置都可放此音素,而图13声道1中最右边放的这个音素56只是这个音素放在五线谱上的一个特例。The phonemes defined in the encoding rule examples and decoding rule examples mentioned in this manual do not include the pitch value of any tone, and the phonemes that can be directly obtained from the value of the combined information involved in the following encoding examples and decoding examples are also Like this (the pitch value of any tone is not included in the phoneme). What this description says defines the combination of sounds as shown in Figure 9 as a phoneme, and the phoneme actually only includes the respective pronunciation time values of these 4 tones, and the interval relationship data between each of these 4 tones, and these 4 tones. These information of the sound length of each sound of each sound, do not include the pitch value of any sound in these 4 sounds, so Figure 9 is only a special case when this phoneme is placed on the staff (this phoneme can also be placed on the staff other positions in the vertical direction), other the sound combinations like Fig. 8 are defined as phonemes, the tone combinations like Fig. 10 are defined as phonemes, and the tone combinations like Fig. 11 are defined as phonemes and are all understood as shown in Fig. 9; Referring to Fig. 12, Fig. 13, Fig. 14, Fig. 15, the phoneme on the stave of Fig. 16 also refers to the special case when the phoneme is placed on the stave, in these phonemes itself does not comprise the pitch value of any sound, For example, the rightmost phoneme 56 (information containing 2 tones) of Fig. 13 sound channel 1, itself does not include the pitch value of any tone in this phoneme, and multiple different positions in the vertical direction of the staff can put this phoneme, and the phoneme 56 placed on the far right in channel 1 of Figure 13 is just a special case of this phoneme being placed on the stave.

解码时按上述各段定义对应解码。When decoding, define corresponding decoding according to the above paragraphs.

解码时可用整步发音方式,即在解码时,根据乐段拼合信息的值,得到所有解码结果后,再把解码结果拿去发音而得到乐段的全部发音(或根据需要只根据所有解码结果播乐段的部分音);在实际中,对于整步发音方式,解码结果也可由MCU转成乐段谱数据(解码结果也可能是别的可转化成音乐声的数据形式),再由MCU(或MCU外接的音乐芯片等)将乐段谱数据用音乐合成方式使喇叭播放此乐段的发音(如是用MCU外接的音乐芯片来播音乐,那一般应是由MCU通过连接线将乐段谱数据传给MCU外接的音乐芯片,由MCU外接的音乐芯片将乐段谱数据用音乐合成方式使喇叭播放此乐段的发音),所以拾音器里的MCU(或MCU外接的音乐芯片等)一般需要有音乐合成功能。对于整步发音方式,解码结果也可以是由MCU转成除乐段谱数据外的别的可转化成音乐声的数据形式,再由MCU(或MCU外接的音乐芯片等)按这些形式的解码结果使喇叭播放此乐段的发音(如是用MCU外接的音乐芯片来播音乐,那一般应是由MCU通过连接线将这些形式的解码结果传给MCU外接的音乐芯片,由MCU外接的音乐芯片按这些形式的解码结果使喇叭播放此乐段的发音),相应的,拾音器里的MCU(或MCU外接的音乐芯片等)一般也是按音乐合成方式来按这些形式的解码结果播音乐。When decoding, the full-step pronunciation method can be used, that is, when decoding, according to the value of the music piece combination information, after obtaining all the decoding results, then take the decoding results to pronounce and get the whole pronunciation of the music section (or only according to all the decoding results as needed) In practice, for the full-step pronunciation method, the decoding result can also be converted into music spectrum data by the MCU (the decoding result may also be other data forms that can be converted into musical sounds), and then the MCU (or a music chip external to the MCU, etc.) use the music synthesis method to make the music section spectrum data play the pronunciation of this section (if the music chip external to the MCU is used to play music, it should generally be the music section by the MCU through the connection line) The spectrum data is transmitted to the music chip external to the MCU, and the music chip external to the MCU synthesizes the music spectrum data to make the speaker play the pronunciation of this section), so the MCU in the pickup (or the music chip external to the MCU, etc.) generally Music synthesis is required. For the full-step pronunciation method, the decoding result can also be converted by the MCU into other data forms that can be converted into musical sounds except the music spectrum data, and then decoded by the MCU (or an external music chip, etc.) according to these forms As a result, the speaker plays the pronunciation of this section (if the music chip external to the MCU is used to play music, then generally the MCU should transmit the decoding results of these forms to the music chip external to the MCU through the connection line, and the music chip external to the MCU According to the decoding results of these forms, the loudspeaker plays the pronunciation of this section), correspondingly, the MCU in the pickup (or the music chip external to the MCU, etc.) generally also plays music according to the decoding results of these forms according to the music synthesis method.

解码时除了可用整步发音方式外,也可用分步发音方式,即在解码过程中,每次(即每步)得到解码结果的一部分,就把解码结果的这一部分拿去发音,照这样一直解码下去直到解码结束,即完成解码乐段的全部发音(或完成播放乐段的部分发音,如是播放乐段的部分发音,那一般是分次把非全部解码结果的各部分拿去发音,以形成乐段的部分发音);并且每次(每步)得到的解码结果的一部分也可转化成乐段的谱数据的一部分等形式的数据并拿去发音;并且显然分步发音方式也可以是分两步发音,以播放被编码乐段的全部或部分发音,比如在解码过程中,第1步先解码得到解码结果的前半部分并将其拿去发音,第2步再解码得到解码结果的后半部分并将其拿去发音,这样前后两部分发音即形成解码发音(这里是被编码乐段的全部发音)。因为播放音乐时是要精确确定各个音在什么时刻发音的,所以解码采用分步发音方式时,一般在MCU里需要作记时,以便在各合适时刻作发各步音的动作。In addition to the full-step pronunciation method, a step-by-step pronunciation method can also be used during decoding, that is, in the decoding process, each time (that is, each step) a part of the decoding result is obtained, this part of the decoding result is used for pronunciation, and it continues like this Decoding continues until the end of decoding, that is, completes the entire pronunciation of the decoded section (or completes the partial pronunciation of the playback section, if it is the partial pronunciation of the playback section, then generally the parts that are not all decoding results are pronounced in stages, so as to Form part of the pronunciation of the music section); and a part of the decoding result obtained each time (each step) can also be converted into a part of the spectral data of the music section and other forms of data and take it for pronunciation; and obviously the step-by-step pronunciation method can also be Pronunciation in two steps to play all or part of the encoded passage. For example, in the decoding process, the first half of the decoding result is first decoded and used for pronunciation, and the second step is decoded to obtain the decoding result. The second half is taken and pronounced, so that the two parts before and after the pronunciation form the decoded pronunciation (here is the whole pronunciation of the coded section). Because when playing music, it is necessary to accurately determine when each sound is pronounced, so when the decoding adopts a step-by-step pronunciation method, it is generally necessary to record the time in the MCU, so that the action of each step sound can be performed at each appropriate moment.

编码举例1:Coding example 1:

如图13,假设是一个钢琴曲的乐段,两个声道,调式是C大调。各音不带冠音。As shown in Figure 13, it is assumed that it is a passage of a piano piece, two channels, and the mode is C major. Each sound has no crown sound.

音素的上下位置值1,以音素所处声道的五线谱的五条线中最中间一线为基准,共5BIT,最高位BIT表示上下。The upper and lower position value of the phoneme is 1, based on the middle line among the five lines of the staff of the vocal channel where the phoneme is located, a total of 5 BITs, and the highest bit BIT indicates up and down.

主声道涉及3个不同的音素56(这3个音素分别含1,1,2个音),副声道涉及1个音素56(此音素含4个音),而且被定位的这个音素被重复了3次。The main channel involves 3 different phonemes 56 (these 3 phonemes contain 1, 1, and 2 tones respectively), and the sub-channel involves 1 phoneme 56 (this phoneme contains 4 tones), and the positioned phoneme is Repeated 3 times.

假设主声道涉及的3个音素的音素编号已分别定义为0010010110,1001101001,0100001100,假设副声道涉及的1个音素的音素编号已定义为0110001010,Assume that the phoneme numbers of the three phonemes involved in the main channel have been defined as 0010010110, 1001101001, and 0100001100 respectively, and that the phoneme numbers of one phoneme involved in the sub-channel have been defined as 0110001010,

假设主声道(声道1)涉及的3个音素按从左至右顺序被称为音素1,音素2,音素3,副声道(声道2)涉及的那个音素被称为音素4,Assume that the three phonemes involved in the main channel (channel 1) are called phoneme 1, phoneme 2, and phoneme 3 from left to right, and the phoneme involved in the secondary channel (channel 2) is called phoneme 4.

编码的各段数据:Encoded segments of data:

先做出编码头信息:First make the encoding header information:

节拍信息:0010(即每小节占4拍,4分音符为1拍,这里的拍是指乐段的实际拍)Beat information: 0010 (that is, each measure occupies 4 beats, and a quarter note is 1 beat. The beat here refers to the actual beat of the passage)

速度信息:10000000(128拍/分钟,这里的拍是指乐段的实际拍)Tempo information: 10000000 (128 beats/minute, the beat here refers to the actual beat of the music section)

声道个数:0010(2个声道)Number of channels: 0010 (2 channels)

声道1调式信息:0000(C大调)Channel 1 mode information: 0000 (C major)

声道1谱号信息:01(高音谱号,00,01,10,11分别表示低,高,中,次中音谱号)Channel 1 clef information: 01 (treble clef, 00, 01, 10, 11 represent low, high, middle, tenor clef respectively)

声道2调式信息:0000(C大调)Channel 2 mode information: 0000 (C major)

声道2谱号信息:00(低音谱号)Channel 2 Clef Information: 00 (bass clef)

接着编码:Then code:

转到声道1:0010,000Go to channel 1: 0010,000

音素段:0001,0010(有3个音素)Phoneme segment: 0001, 0010 (with 3 phonemes)

音素1的上下位置值1:00000(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 1: 00000 (the highest BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1, which means it is below)

音素1的音素编号:0010010110The phoneme number of phoneme 1: 0010010110

音素2的上下位置值1:00010(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 2 is 1: 00010 (the highest BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1, which means it is below)

音素2的音素编号:1001101001The phoneme number of phoneme 2: 1001101001

音素3的上下位置值1:00000(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 3 is 1: 00000 (the highest BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1, which means it is below)

音素3的音素编号:0100001100The phoneme number of phoneme 3: 0100001100

转到声道2:0010,001Go to channel 2: 0010, 001

音素段:0001,0000(有1个音素)Phoneme segment: 0001, 0000 (with 1 phoneme)

音素4的上下位置值1:10100(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 4 is 1: 10100 (the highest BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1, which means it is below)

音素4的音素编号:0110001010The phoneme number of phoneme 4: 0110001010

复制控制段:0011,0000(复制当前声道前面1个被定位的音素),0010(复制3次)Copy control section: 0011, 0000 (copy the first positioned phoneme in front of the current channel), 0010 (copy 3 times)

结束标志段:0111End flag segment: 0111

将以上包括编码头在内的数据都顺序合起来(即顺序排列起来),这乐段的编码值为:Combining the above data including the encoding header in order (that is, arranging them in order), the encoding value of this section is:

001010000000001000000100000000100000001001000000001001011000010001010000000001000000100000000100000001001000000001001011000010

1001101001000000100001100001000100010000101000110001010001100000010011110011010010000001000011000010001000100001010001100010100011000000100111

此编码值即是乐段拼合信息的值,可以放到一个码字里等。This coded value is the value of the segment combination information, which can be put into a code word, etc.

(这里用于举例的编码数据BIT位数较多达到130位以上,但可放在二维条形码里)(The BIT digits of the encoded data used for example here are more than 130 digits, but it can be placed in a two-dimensional barcode)

解码举例1:Decoding example 1:

乐段拼合信息的值为(此乐段拼合信息的值是编码举例1的编码结果):The value of the segment combination information (the value of the segment combination information is the encoding result of encoding example 1):

001010000000001000000100000000100000001001000000001001011000010001010000000001000000100000000100000001001000000001001011000010

1001101001000000100001100001000100010000101000110001010001100000010011110011010010000001000011000010001000100001010001100010100011000000100111

解码时,顺序先读编码头得里面的各信息:When decoding, read the information in the encoding header first in order:

0010(即每小节占4拍,4分音符为1拍,这里的拍是指乐段的实际拍)0010 (that is, each measure occupies 4 beats, and a quarter note is 1 beat. The beat here refers to the actual beat of the passage)

10000000(128拍/分钟,这里的拍是指乐段的实际拍)10000000 (128 beats/minute, the beat here refers to the actual beat of the music section)

0010(2个声道)0010 (2 channels)

0000(声道1调式为C大调)0000 (channel 1 mode is C major)

01(声道1谱号为高音谱号)01 (channel 1 clef is treble clef)

0000(声道2调式为C大调)0000 (channel 2 mode is C major)

00(声道2谱号为低音谱号)00 (channel 2 clef is bass clef)

再顺序看下面信息:Then look at the following information in order:

0010(转声道),0000010 (return channel), 000

0001(音素段),0010(有3个音素)0001 (phoneme segment), 0010 (3 phonemes)

00000(这是声道1的音素1的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00000 (this is the upper and lower position value of phoneme 1 of channel 1, the highest bit BIT is 0 means it is above the middle line of the five lines of the stave of this channel, and 1 means it is below)

0010010110音素1的音素编号0010010110 phoneme number of phoneme 1

00010(音素2的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00010 (the upper and lower position value of phoneme 2 is 1, the highest BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1 means it is below)

1001101001音素2的音素编号1001101001 phoneme number of phoneme 2

00000(音素3的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00000 (the upper and lower position value of phoneme 3 is 1, the highest bit BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1 means it is below)

0100001100音素3的音素编号0100001100 phoneme number of phoneme 3

0010(转声道),0010010 (return channel), 001

0001(音素段),0000(有1个音素)0001 (phoneme segment), 0000 (1 phoneme)

10100(这是音素4的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10100 (this is the upper and lower position value of phoneme 4, the highest BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1 means it is below)

0110001010音素4的音素编号0110001010 phoneme number of phoneme 4

0011(复制控制段),0000(复制当前声道前面1个被定位的音素),0010(复制3次)0011 (copy the control section), 0000 (copy the first positioned phoneme in front of the current channel), 0010 (copy 3 times)

0111(结束标志段)0111 (end flag segment)

各拼合步骤和解码步骤定义如下:The stitching steps and decoding steps are defined as follows:

把前面述的读编码头的信息定义为解码步骤1,Define the information of the aforementioned read encoding header as decoding step 1,

拼合步骤1也即解码步骤2:从声道1的数据里的音素段的数据,得到声道1的音素1的上下位置值1,再从这个音素段得到音素1的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素1里的音的构成情况,从以上得到的这些数据(音素1的上下位置值1,音素1里的音的构成情况),就可确定被定位的音素1里的各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 1 is also decoding step 2: From the data of the phoneme segment in the data of channel 1, get the upper and lower position value 1 of phoneme 1 of channel 1, and then get the phoneme number of phoneme 1 from this phoneme segment, press this phoneme The number finds the record item of this phoneme in the phoneme database, and the composition of the sound in phoneme 1 is obtained from this record item, and the data obtained above (the upper and lower position values of phoneme 1 are 1, the composition of the sound in phoneme 1 ), just can determine the actual pitch value of each tone in the phoneme 1 li of being positioned and music section articulation moment value and sound length value,

拼合步骤2也即解码步骤3:接着从声道1的数据里的音素段的数据,得到声道1的音素2的上下位置值1,再从这个音素段得到音素2的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素2里的音的构成情况,从以上得到的这些数据(音素2的上下位置值1,音素2里的音的构成情况),就可确定被定位的音素2里的各音的实际的音高值及乐段发音时刻值及音长值,Combining step 2 is also decoding step 3: Then, from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of phoneme 2 of channel 1 is obtained, and then the phoneme number of phoneme 2 is obtained from this phoneme segment, click here The phoneme number finds the record item of this phoneme in the phoneme database, and obtains the composition of the sound in the phoneme 2 from this record item, and the data obtained above (the upper and lower position values of the phoneme 2 are 1, and the composition of the sound in the phoneme 2 Situation), just can determine the actual pitch value of each tone in the phoneme 2 li of being positioned and music section articulation moment value and sound length value,

拼合步骤3也即解码步骤4:接着从声道1的数据里的音素段的数据,得到声道1的音素3的上下位置值1,再从这个音素段得到音素3的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素3里的音的构成情况,从以上得到的这些数据(音素3的上下位置值1,音素3里的音的构成情况),就可确定被定位的音素3里的各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 3 is also decoding step 4: Then, from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of phoneme 3 of channel 1 is obtained, and then the phoneme number of phoneme 3 is obtained from this phoneme segment, click here The phoneme number finds the record item of this phoneme in the phoneme database, and obtains the composition of the sound in the phoneme 3 from this record item, and the data obtained from the above (the upper and lower position values of the phoneme 3 are 1, and the composition of the sound in the phoneme 3 Situation), just can determine the actual pitch value of each tone in the phoneme 3 li of being positioned and music section articulation moment value and sound length value,

拼合步骤4也即解码步骤5:从声道2的数据里的音素段的数据,得到声道2的音素4的上下位置值1,再从这个音素段得到音素4的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素4里的音的构成情况,从以上得到的这些数据(音素4的上下位置值1,音素4里的音的构成情况),就可确定被定位的音素4里的各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 4 is decoding step 5: From the data of the phoneme segment in the data of channel 2, the upper and lower position value 1 of phoneme 4 of channel 2 is obtained, and then the phoneme number of phoneme 4 is obtained from this phoneme segment, press this phoneme The number finds the record item of this phoneme in the phoneme database, and the composition of the sound in phoneme 4 is obtained from this record item, and the data obtained above (the upper and lower position values of phoneme 4 are 1, the composition of the sound in phoneme 4 ), just can determine the actual pitch value of each sound in the phoneme 4 li that is positioned and the music section articulation time value and the sound length value,

拼合步骤5也即解码步骤6:接着得到声道2的数据里的复制段的数据,将上一拼合步骤得到的被定位的音素4里的各音复制一遍,再复制一遍,再复制一遍,每次是把被定位的音素4里的各音复制一遍,每次复制得到的是一个自定位的类音素(每个自定位的类音素的上下位置值3就是音素4的上下位置值1)里的4个音各自的实际的音高值和乐段发音时刻值和音长值(每次复制得到的这些信息可看成是一次复制得到的一个自定位的类音素),Combining step 5 is also decoding step 6: then obtain the data of the copied section in the data of channel 2, copy each sound in the positioned phoneme 4 obtained in the previous combining step, and then copy it again, and then copy it again, Every time, each sound in the positioned phoneme 4 is copied again, and what is copied each time is a self-positioned phoneme (the upper and lower position value 3 of each self-located phoneme-like element is the upper and lower position value 1 of the phoneme 4) The respective actual pitch values of the 4 tones and the value of the sounding time of the music segment and the value of the sound length (the information obtained by each copy can be regarded as a self-positioned phoneme obtained by one copy),

拼合步骤6也即解码步骤7:将以上5个拼合步骤得到的各音的数据(各音的实际音高值和乐段发音时刻值和音长值)按得到的先后顺序合起来即成一结果(这结果里要有BIT区分哪些数据是声道1的,哪些数据是声道2的),再在此结果的前面按某约定格式加上从乐段拼合信息的值的编码头里得到的节奏信息及两个声道各自的调式及谱号信息等,即得解码结果。以上拼合步骤1,拼合步骤2,拼合步骤3,拼合步骤4都是得到一个被定位的音素,拼合步骤5是得到3个自定位的类音素,所以本段前面说的将以上5个拼合步骤得到的各音的数据按得到的先后顺序合起来即成一结果,相当于是按顺序拼合的组拼方式,将以上拼合步骤1得到的被定位的音素1和拼合步骤2得到的被定位的音素2和拼合步骤3得到的被定位的音素3和拼合步骤4得到的被定位的音素4和拼合步骤5复制得到的3个自定位的类音素拼合起来,成一结果(这结果里要有BIT注明被定位的音素1的数据,被定位的音素2的数据,被定位的音素3的数据是声道1的,被定位的音素4的数据,复制得到的3个自定位的类音素的数据是声道2的)。Combining step 6 is also decoding step 7: the data (the actual pitch value of each sound and the music section pronunciation time value and the sound length value) of the data (the actual pitch value of each sound and the music section pronunciation time value and the sound length value) obtained by the above 5 assembling steps are combined to form a result ( In this result, there must be BIT to distinguish which data is for channel 1 and which data is for channel 2), and then add the rhythm obtained from the encoding header of the value of the segment combination information in front of the result according to a certain agreed format Information and the respective mode and clef information of the two channels, etc., to obtain the decoding result. The above merging step 1, merging step 2, merging step 3, and merging step 4 are all to obtain a positioned phoneme, and the merging step 5 is to get 3 self-positioned phonemes, so the above 5 merging steps mentioned earlier in this paragraph The obtained data of each sound is combined according to the obtained order to form a result, which is equivalent to the combination method of combining in order, the positioned phoneme 1 obtained in the above step 1 and the positioned phoneme 2 obtained in the step 2 Combine with the positioned phoneme 3 obtained in step 3 and the positioned phoneme 4 obtained in step 4 and the 3 self-positioned phonemes copied in step 5 to form a result (this result must be marked with BIT The data of the positioned phoneme 1, the data of the positioned phoneme 2, the data of the positioned phoneme 3 is the channel 1, the data of the positioned phoneme 4, and the copied data of the three self-located phonemes are channel 2).

以上声道1和声道2是同时开始发音的(按解码结果发音时有约定是用解码结果中声道1的数据和声道2的数据来同时发音,一般情况,按解码结果发音前可由MCU或用作发音的MCU外接的音乐芯片先找到解码结果中声道1的数据和声道2的数据以便用两个声道的数据同时发两个声道的音)。The above channel 1 and channel 2 start to pronounce at the same time (according to the decoding result, there is an agreement that the data of channel 1 and channel 2 in the decoding result are used to pronounce at the same time. In general, before pronunciation according to the decoding result, it can be pronounced by The MCU or the music chip external to the MCU used for pronunciation first finds the data of channel 1 and the data of channel 2 in the decoding result so that the data of the two channels can be used to simultaneously emit the sound of the two channels).

然后再根据解码结果用拾音器演奏出来即可且演奏的音乐就如图13所示(这里采用的解码发音方式是整步发音方式);可由拾音器里的MCU(或MCU外接的音乐芯片等)直接按解码结果用音乐合成方式使喇叭播放此乐段的发音(如是用MCU外接的音乐芯片来播音乐,那一般应是由MCU通过连接线将解码结果传给MCU外接的音乐芯片,由MCU外接的音乐芯片将解码结果用音乐合成方式使喇叭播放此乐段的发音),拾音器里的MCU(或MCU外接的音乐芯片等)有音乐合成功能;本解码举例是采用整步发音方式,并且解码时播的音是被编码乐段的全部发音(也可由MCU或MCU外接的音乐芯片根据解码结果的部分来播被编码乐段的部分发音),并且发音时也可把解码得到的解码结果转成别的数据形式(比如乐段的谱数据)并用别的数据形式来播放乐段发音。(本例所说的解码结果是指解码得到的乐段的全部解码结果)Then according to the decoding result, it can be played with a pickup and the music played is as shown in Figure 13 (the decoding pronunciation method adopted here is a full-step pronunciation method); According to the decoding result, use the music synthesis method to make the speaker play the pronunciation of this section. The music chip of the music chip will decode the decoding result and make the speaker play the pronunciation of this section in the way of music synthesis), and the MCU in the pickup (or the music chip external to the MCU, etc.) The sound broadcasted at the time is the entire pronunciation of the coded section (the MCU or the music chip external to the MCU can also play part of the pronunciation of the coded section according to the part of the decoding result), and the decoding result obtained by decoding can also be converted to Divide into other data forms (such as spectrum data of the music section) and use other data forms to play the music section pronunciation. (The decoding result mentioned in this example refers to all the decoding results of the decoded music section)

以上各拼合步骤得到的音的乐段发音时刻值,是按拍来计量的,即某一音的乐段发音时刻值,比如可以是1拍,3拍加4分之3拍等,这些拍数都是从乐段最开始发音算起(乐段最开始发音的音的发音时刻值为0),每个声道里的每个类音素或被定位的音素里各音的乐段发音时刻值的计算方式可参看本说明书第28页段落开头是‘至于前面说到的解码时’的那一段所述。(本段所说的拍都是指被编码乐段的实际拍)The pronunciation time value of the music section of the sound obtained by the above steps of merging is measured by beat, that is, the pronunciation time value of the music section of a certain sound, for example, it can be 1 beat, 3 beats plus 3/4 beats, etc., these beats The number is calculated from the beginning of the section (the pronunciation time value of the first tone of the section is 0), and the pronunciation time of each tone in each class phoneme or positioned phoneme in each channel For the calculation method of the value, please refer to the paragraph beginning with "As for the above-mentioned decoding time" on page 28 of this manual. (The beat mentioned in this paragraph refers to the actual beat of the coded section)

以上解码时是可从拼合信息的值里直接得到音素1,音素2,音素3,音素4的。During the above decoding, phoneme 1, phoneme 2, phoneme 3, and phoneme 4 can be directly obtained from the value of the combined information.

编码举例2:Coding example 2:

如图14,假设是一个钢琴曲的乐段,两个声道,调式是C大调。各音不带冠音。As shown in Figure 14, it is assumed that it is a passage of a piano piece, two channels, and the mode is C major. Each sound has no crown sound.

音素的上下位置值1,以音素所处声道的五线谱的五条线的最中间一线为基准,共5BIT,最高位BIT表示上下。The upper and lower position value of the phoneme is 1, based on the middle line of the five lines of the staff of the vocal channel where the phoneme is located, a total of 5 BITs, and the highest bit BIT indicates up and down.

主声道(声道1)涉及2个不同的音素56(1),56(2),(这2个音素分别含1,1个音),被定位的音素56(2)被重复了一次,主声道最右边还有一音组合(此音组合含2个音),此音组合可对图16所示的音素先作定位再作修改而成,副声道(声道2)涉及1个音素56(3),(此音素含4个音),副声道的音素56(3)的后面是一音组合,此音组合包含8个音,此音组合可由图15中示的两个音素交织拼合而成(作交织拼合前是有确定用于交织拼合的这两个音素在五线谱上所处的上下位置的即所处上下位置是象图15那样),副声道最右边还涉及一音素56(4),(此音素含2个音)。The main channel (channel 1) involves 2 different phonemes 56(1), 56(2), (these 2 phonemes contain 1 and 1 sound respectively), and the located phoneme 56(2) is repeated once , there is also a sound combination on the far right of the main channel (this sound combination contains 2 sounds), this sound combination can be made by first positioning the phonemes shown in Figure 16 and then modifying it, and the sub-channel (channel 2) involves 1 A phoneme 56 (3), (this phoneme contains 4 sounds), the back of the phoneme 56 (3) of the sub-channel is a sound combination, and this sound combination includes 8 sounds, and this sound combination can be shown in Figure 15 by two It is formed by interweaving two phonemes (before the interweaving and mosaic, the upper and lower positions of the two phonemes used for interweaving and mosaic are determined, that is, the upper and lower positions are as shown in Figure 15), and the rightmost side of the sub-channel is still Involving a phoneme 56 (4), (this phoneme contains 2 tones).

假设图14主声道涉及的音素56(1),音素56(2)的音素编号已分别定义为0010010110,1001101001Assuming that the phoneme 56(1) and phoneme 56(2) involved in the main channel in Figure 14 have been defined as 0010010110 and 1001101001 respectively

假设图14副声道涉及的音素56(3),音素56(4)的音素编号已定义为0110001010,0110001010Assuming that the phoneme 56(3) involved in the secondary channel in Figure 14, the phoneme number of the phoneme 56(4) has been defined as 0110001010, 0110001010

图16中的音素56的音素编号为0100001101The phoneme number of phoneme 56 in Figure 16 is 0100001101

图15中的从左到右两个音素56的音素编号为0111001100,0111001101The phoneme numbers of the two phonemes 56 from left to right in Fig. 15 are 0111001100, 0111001101

编码的各段数据:Encoded segments of data:

先做出编码头信息:First make the encoding header information:

节拍信息:0010(即每小节占4拍,4分音符为1拍,这里的拍是指乐段的实际拍)Beat information: 0010 (that is, each measure occupies 4 beats, and a quarter note is 1 beat. The beat here refers to the actual beat of the passage)

速度信息:10000000(128拍/分钟,这里的拍是指乐段的实际拍)Tempo information: 10000000 (128 beats/minute, the beat here refers to the actual beat of the music section)

声道个数:0010(2个声道)Number of channels: 0010 (2 channels)

声道1调式信息:0000(C大调)Channel 1 mode information: 0000 (C major)

声道1谱号信息:01(高音谱号,00,01,10,11分别表示低,高,中,次中音谱号)Channel 1 clef information: 01 (treble clef, 00, 01, 10, 11 represent low, high, middle, tenor clef respectively)

声道2调式信息:0000(C大调)Channel 2 mode information: 0000 (C major)

声道2谱号信息:00(低音谱号)Channel 2 Clef Information: 00 (bass clef)

接着编码:Then code:

转到声道1:0010,000Go to channel 1: 0010,000

音素段:0001,0001(有2个音素)Phoneme segment: 0001, 0001 (with 2 phonemes)

音素56(1)的上下位置值1:00000(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 56(1) is 1: 00000 (the highest BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1, which means it is below)

音素56(1)的音素编号:0010010110The phoneme number of phoneme 56(1): 0010010110

音素56(2)的上下位置值1:00010(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position value of phoneme 56(2) is 1: 00010 (the highest BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1, which means it is below)

音素56(2)的音素编号:1001101001The phoneme number of phoneme 56(2): 1001101001

复制控制段:0011,0000(复制当前声道前面1个被定位的音素),0000(复制1次)Copy control segment: 0011, 0000 (copy the first positioned phoneme in front of the current channel), 0000 (copy once)

修改段:1000Modified segment: 1000

00000(用于修改的音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00000 (the upper and lower position value of the phoneme used for modification is 1, the highest BIT is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1 means it is below)

0100001101用于修改的音素的音素编号0100001101 The phoneme number for the modified phoneme

10表示对音进行上下移动10 means to move up and down the tone

001这3个BIT表示在从左数第2个发音位置作音的上下移动001 These 3 BITs represent the up and down movement of the sound at the second pronunciation position from the left

00这2个BIT表示此发音位置上从最下面一个音算起第1个音要被移动00 These 2 BITs indicate that the first sound from the bottom sound at this pronunciation position will be moved

00001这5个BIT表示在此发音位置上这个要被移动的音往上移动1个音步(这5个BIT的最高位BIT为0表示往上移,为1表示往下移)00001 These 5 BITs indicate that the sound to be moved moves up by 1 step at the pronunciation position (the highest BIT of these 5 BITs is 0, which means moving up, and 1, which means moving down)

转到声道2:0010,001Go to channel 2: 0010, 001

音素段(声道2的第1个音素段):0001,0000(有1个音素)Phoneme segment (the first phoneme segment of channel 2): 0001, 0000 (with 1 phoneme)

音素56(3)的上下位置值1:10100(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position values of phoneme 56(3) 1: 10100 (the highest bit BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1, which means it is below)

音素56(3)的音素编号:0110001010The phoneme number of phoneme 56(3): 0110001010

交织拼合段:1001Interweave segment: 1001

10100(用于交织拼合的第1个音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10100 (the upper and lower position value of the first phoneme used for interweaving and splicing is 1, the highest bit BIT is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1 means it is below)

0111001100用于交织拼合的第1个音素的音素编号0111001100 The phoneme number of the first phoneme used for interweaving and stitching

10001(用于交织拼合的第2个音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10001 (the upper and lower position value of the second phoneme used for interweaving and splicing is 1, the highest bit BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1 means it is below)

0111001101用于交织拼合的第2个音素的音素编号0111001101 The phoneme number of the second phoneme used for interweaving and stitching

音素段(声道2的第2个音素段):0001,0000(有1个音素)Phoneme segment (the second phoneme segment of channel 2): 0001, 0000 (with 1 phoneme)

音素56(4)的上下位置值1:10100(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)The upper and lower position values of phoneme 56(4) 1: 10100 (the highest bit BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1, which means it is below)

音素56(4)的音素编号:1001101001The phoneme number of phoneme 56(4): 1001101001

结束标志段:0111End flag segment: 0111

将以上包括编码头在内的数据都顺序合起来(即顺序排列起来),这乐段的编码值为:Combining the above data including the encoding header in order (that is, arranging them in order), the encoding value of this section is:

00101000000000100000010000000010000000100010000000100101100001010011010010010100000000010000001000000001000000010001000000010010110000101001101001

00110000000010000000001000011011000100000010010001000100001010001100010011000000001000000000100001101100010000001001000100010000101000110001

01010011010001110011001000101110011010001000010100100110100101110101001101000111001100100010111001101000100001010010011010010111

此编码值即是乐段拼合信息的值,可以放到一个码字里等。This coded value is the value of the segment combination information, which can be put into a code word, etc.

(这里用于举例的编码数据BIT位数较多达到130位以上,但可放在二维条形码里)(The BIT digits of the encoded data used for example here are more than 130 digits, but it can be placed in a two-dimensional barcode)

解码举例2:Decoding example 2:

乐段拼合信息的值为(此乐段拼合信息的值是编码举例2的编码结果):The value of the segment combination information (the value of the segment combination information is the encoding result of encoding example 2):

00101000000000100000010000000010000000100010000000100101100001010011010010010100000000010000001000000001000000010001000000010010110000101001101001

00110000000010000000001000011011000100000010010001000100001010001100010011000000001000000000100001101100010000001001000100010000101000110001

01010011010001110011001000101110011010001000010100100110100101110101001101000111001100100010111001101000100001010010011010010111

解码时,顺序先读编码头得里面的各信息:When decoding, read the information in the encoding header first in order:

0010(即每小节占4拍,4分音符为1拍,这里的拍是指乐段的实际拍)0010 (that is, each measure occupies 4 beats, and a quarter note is 1 beat. The beat here refers to the actual beat of the passage)

10000000(128拍/分钟,这里的拍是指乐段的实际拍)10000000 (128 beats/minute, the beat here refers to the actual beat of the music section)

0010(2个声道)0010 (2 channels)

0000(声道1调式为C大调)0000 (channel 1 mode is C major)

01(声道1谱号为高音谱号)01 (channel 1 clef is treble clef)

0000(声道2调式为C大调)0000 (channel 2 mode is C major)

00(声道2谱号为低音谱号)00 (channel 2 clef is bass clef)

再顺序看下面信息:Then look at the following information in order:

0010(转声道),0000010 (return channel), 000

0001(音素段),0001(有2个音素)0001 (phoneme segment), 0001 (with 2 phonemes)

00000这是声道2的音素56(1)的上下位置值1,(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00000 This is the upper and lower position value of the phoneme 56(1) of channel 2. (The highest bit BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1, which means it is below)

0010010110音素56(1)的音素编号0010010110 phoneme number of phoneme 56(1)

00010这是声道2的音素56(2)的上下位置值1,(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00010 This is the upper and lower position value of the phoneme 56(2) of channel 2. (The highest bit BIT is 0, which means it is above the middle line of the five lines of the staff in this channel, and 1, which means it is below)

1001101001音素56(2)的音素编号1001101001 phoneme number of phoneme 56(2)

0011(复制控制段),0000(复制当前声道前面1个被定位的音素),0000(复制1次)0011 (copy the control section), 0000 (copy the first positioned phoneme in front of the current channel), 0000 (copy once)

1000修改段1000 modified segments

00000(用于修改的音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)00000 (the upper and lower position value of the phoneme used for modification is 1, the highest BIT is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1 means it is below)

0100001101用于修改的音素的音素编号0100001101 The phoneme number for the modified phoneme

10表示对音进行上下移动10 means to move up and down the tone

001这3个BIT表示在从左数第2个发音位置作音的上下移动001 These 3 BITs represent the up and down movement of the sound at the second pronunciation position from the left

00这2个BIT表示此发音位置上从最下面一个音算起第1个音要被移动00 These 2 BITs indicate that the first sound from the bottom sound at this pronunciation position will be moved

00001这5个BIT表示在此发音位置上这个要被移动的音往上移动1个音步(这5个BIT的最高位BIT为0表示往上移,为1表示往下移)00001 These 5 BITs indicate that the sound to be moved moves up by 1 step at the pronunciation position (the highest BIT of these 5 BITs is 0, which means moving up, and 1, which means moving down)

0010(转声道),0010010 (return channel), 001

0001(音素段,这是声道2的第1个音素段),0000(有1个音素)0001 (phoneme segment, this is the first phoneme segment of channel 2), 0000 (with 1 phoneme)

10100这是声道2的音素56(3)的上下位置值1,(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10100 This is the upper and lower position value of the phoneme 56(3) of channel 2. (The highest BIT is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1, which means it is below)

0110001010音素56(3)的音素编号The phoneme number of 0110001010 phoneme 56(3)

1001交织拼合段1001 interweaving segment

10100(用于交织拼合的第1个音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10100 (the upper and lower position value of the first phoneme used for interweaving and splicing is 1, the highest bit BIT is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1 means it is below)

0111001100用于交织拼合的第1个音素的音素编号0111001100 The phoneme number of the first phoneme used for interweaving and stitching

10001(用于交织拼合的第2个音素的上下位置值1,最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10001 (the upper and lower position value of the second phoneme used for interweaving and splicing is 1, the highest bit BIT is 0, which means it is above the middle line of the five lines of the vocal staff, and 1 means it is below)

0111001101用于交织拼合的第2个音素的音素编号0111001101 The phoneme number of the second phoneme used for interweaving and stitching

0001(音素段,这是声道2的第2个音素段),0000(有1个音素)0001 (phoneme segment, this is the second phoneme segment of channel 2), 0000 (with 1 phoneme)

10100这是声道2的音素56(4)的上下位置值1,(最高位BIT为0表示在此声道五线谱的五条线最中间线以上,为1表示以下)10100 This is the upper and lower position value of the phoneme 56 (4) of channel 2. (The highest bit bit is 0, which means it is above the middle line of the five lines of the staff of this channel, and 1 means it is below)

1001101001音素56(4)的音素编号1001101001 phoneme number of phoneme 56(4)

0111(结束标志段)0111 (end flag segment)

各拼合步骤和解码步骤定义如下:The stitching steps and decoding steps are defined as follows:

把前面述的读编码头的信息定义为解码步骤1,Define the information of the aforementioned read encoding header as decoding step 1,

拼合步骤1也即解码步骤2:从声道1的数据里的音素段的数据,得到音素56(1)的上下位置值1,再从这个音素段得到音素56(1)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(1)里的音的构成情况,从以上得到的这些数据(音素56(1)的上下位置值1,音素56(1)里的音的构成情况),就可确定被定位的音素56(1)里的各音的实际的音高值及乐段发音时刻值及音长值,Combining step 1 is also decoding step 2: from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of phoneme 56 (1) is obtained, and then the phoneme number of phoneme 56 (1) is obtained from this phoneme segment, press This phoneme numbering finds the record item of this phoneme in the phoneme database, obtains the composition situation of the tone in the phoneme 56 (1) from this record item, obtains these data (the upper and lower position value 1 of phoneme 56 (1) from above, Phoneme 56 (1) Li's sound composition situation), just can determine the actual pitch value of each tone in the phoneme 56 (1) li of being positioned and music section articulation time value and sound length value,

拼合步骤2也即解码步骤3:接着从声道1的数据里的音素段的数据,得到音素56(2)的上下位置值1,再从这个音素段得到音素56(2)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(2)里的音的构成情况,从以上得到的这些数据(音素56(2)的上下位置值1,音素56(2)里的音的构成情况),就可确定被定位的音素56(2)里的各音的实际的音高值及乐段发音时刻值及音长值,Combining step 2 is also decoding step 3: then from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of the phoneme 56 (2) is obtained, and then the phoneme number of the phoneme 56 (2) is obtained from this phoneme segment, Find the record item of this phoneme in the phoneme database by this phoneme number, obtain the composition situation of the sound in the phoneme 56 (2) from this record item, these data obtained from above (the upper and lower position values of phoneme 56 (2) 1 , phoneme 56 (2) in the composition of the sound), just can determine the actual pitch value of each sound in the positioned phoneme 56 (2) and the value of the pronunciation time of the music section and the sound length value,

拼合步骤3也即解码步骤4:接着得到声道1的数据里的复制段的数据,将上一拼合步骤得到的被定位的音素56(2)里的各音复制一遍(复制时得到的音的音高及音长与被定位的音素56(2)里的音相比都不变),复制得到的一个自定位的类音素(这个自定位的类音素的上下位置值3就是音素56(2)的上下位置值1)里的1个音的实际的音高值和乐段发音时刻值和音长值(复制得到的这些信息可看成是复制得到的一个自定位的类音素),Combining step 3 is also decoding step 4: then obtain the data of the copy section in the data of channel 1, and copy each tone in the positioned phoneme 56 (2) obtained in the previous step of combining (the sound obtained during duplication) Compared with the sound in the positioned phoneme 56 (2), the pitch and sound length are all unchanged), a self-positioned phoneme that is copied (the upper and lower position value 3 of this self-located phoneme-like phoneme is phoneme 56 ( 2) the actual pitch value of a sound in the upper and lower position value 1) and the pronunciation time value and sound length value of the passage (the information obtained by copying can be regarded as a self-positioned phoneme obtained by copying),

拼合步骤4也即解码步骤5:接着在声道1的数据里的修改段里,取出用于修改的音素的音素编号,再根据修改段里的用于修改的音素的上下位置值1及这音素里各音互相之间的音程关系数据以及从编码头得到的声道1的调式(C大调)和谱号信息,来得到这个被定位的用于修改的音素中各音的实际的音高值(也即音调值)及乐段发音时刻值及音长值,再根据修改段里的修改信息,对此被定位的音素里的音进行修改,即将其第2个发音位置的音抬高一个音步,这个音的音长值仍不变,对此被定位的音素进行此修改后得到一个自定位的类音素(这个类音素表示的是一个音组合的信息),这样最后得出的是这个自定位的类音素中各音的实际的音高值及乐段发音时刻值及音长值,Combining step 4 is also decoding step 5: Then, in the modified section in the data of channel 1, take out the phoneme number of the phoneme used for modification, and then according to the upper and lower position values of the phoneme used for modification in the modified section 1 and this Interval relationship data between each tone in the phoneme and the mode (C major) and clef information of channel 1 obtained from the encoding head to obtain the actual pitch of each tone in the positioned phoneme for modification High value (that is, the pitch value) and the pronunciation time value and the sound length value of the section, and then according to the modification information in the modification section, the sound in the positioned phoneme is modified, that is, the sound of its second pronunciation position is lifted One step higher, the sound length value of this sound is still unchanged, and after this modified phoneme is modified, a self-localized phoneme (this phoneme represents the information of a sound combination) is obtained, so that at last It is the actual pitch value of each tone in this self-positioned phoneme, the value of the pronunciation time of the music segment and the value of the sound length.

拼合步骤5也即解码步骤6:从声道2的数据里的第1个音素段的数据,得到音素56(3)的上下位置值1,再从这个音素段得到音素56(3)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(3)里的音的构成情况,从以上得到的这些数据(音素56(3)的上下位置值1,音素56(3)里的音的构成情况),就可确定被定位的音素56(3)里的各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 5 is decoding step 6: get the upper and lower position value 1 of phoneme 56(3) from the data of the first phoneme segment in the data of channel 2, and then get the phoneme of phoneme 56(3) from this phoneme segment Numbering, find the record item of this phoneme in the phoneme database by this phoneme numbering, obtain the composition situation of the sound in the phoneme 56 (3) from this record item, obtain these data from above (the upper and lower positions of the phoneme 56 (3) Value 1, the composition situation of the sound in phoneme 56 (3) ), just can determine the actual pitch value and the music section pronunciation time value and the sound length value of each sound in the phoneme 56 (3) li that is positioned,

拼合步骤6也即解码步骤7:接着在声道2的数据里的交织拼合段里,取出两个用于交织拼合的音素各自的音素编号,并从这交织拼合段里得到用于交织拼合的第1个音素的上下位置值1以对用于交织拼合的第1个音素进行定位,并从这交织拼合段里得到用于交织拼合的第2个音素的上下位置值1以对用于交织拼合的第2个音素进行定位,并按交织拼合方式把这两个被定位的音素拼合起来得一个自定位的类音素(这个类音素表示的是一个音组合的信息;假设这里是根据使用的这两个音素来相应按交织拼合的约定用这两个被定位的音素交织拼合得到这个类音素),这个自定位的类音素的上下位置值3(也即这个自定位的类音素里的最先发音的所有音中最低音的上下位置值,也即这个自定位的类音素里的最先发音的那个音的上下位置值)就是前述用于交织拼合的第1个音素的上下位置值1(这个自定位的类音素里的最先发音的所有音中最低音的上下位置值就等于前述用于交织拼合的第1个音素里的最先发音的所有音中最低音的上下位置值),这样最后得出的是这个自定位的类音素里各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 6 is also decoding step 7: then in the interleaving and stitching section in the data of channel 2, take out the respective phoneme numbers of the two phonemes used for interleaving and stitching, and obtain the phoneme numbers for interleaving and stitching from this interleaving and stitching section. The upper and lower position value of the first phoneme is 1 to locate the first phoneme used for interleaving and mosaicing, and the upper and lower position value of the second phoneme for interleaving and mosaicing is obtained from this interleaving and mosaic segment to be used for interleaving The combined second phoneme is positioned, and the two positioned phonemes are put together to obtain a self-positioned phoneme (this phoneme represents the information of a sound combination; it is assumed that it is based on the used These two phonemes come to obtain this phoneme by interweaving and combining these two positioned phonemes correspondingly according to the agreement of interlacing and mosaicing), and the upper and lower position values of this self-positioning-like phoneme are 3 (that is, the most in this self-positioning-like phoneme) The upper and lower position values of the lowest tone among all the sounds pronounced first, that is, the upper and lower position values of the first sound pronounced in this self-positioned phoneme) are the upper and lower position values of the first phoneme used for interweaving and stitching. (The upper and lower position values of the lowest tone in all the sounds that are pronounced first in this self-positioned phoneme are equal to the upper and lower position values of the lowest tone in all the sounds that are first pronounced in the first phoneme that is used for interweaving and stitching) , so that the final result is the actual pitch value of each sound in this self-positioned phoneme, the pronunciation time value and the sound length value of the segment,

拼合步骤7也即解码步骤8:接着从声道2的数据里的第2个音素段的数据,得到音素56(4)的上下位置值1,再从这个音素段得到音素56(4)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(4)里的音的构成情况,从以上得到的这些数据(音素56(4)的上下位置值1,音素56(4)里的音的构成情况),就可确定被定位的音素56(4)里的各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 7 is also decoding step 8: then from the data of the second phoneme segment in the data of channel 2, the upper and lower position value 1 of phoneme 56(4) is obtained, and then the phoneme 56(4) is obtained from this phoneme segment Phoneme numbering, find the record item of this phoneme in the phoneme database by this phoneme numbering, obtain the composition situation of the sound in the phoneme 56 (4) from this record item, obtain these data (up and down of phoneme 56 (4) from above Position value 1, the composition situation of the sound in phoneme 56 (4) li), just can determine the actual pitch value of each tone in the phoneme 56 (4) li that is positioned and the music section articulation time value and the sound length value,

拼合步骤8也即解码步骤9:将以上7个拼合步骤得到的各音的数据(各音的实际音高值和乐段发音时刻值和音长值)按得到的先后顺序合起来即成一结果(这结果里要有BIT区分哪些数据是声道1的,哪些数据是声道2的),再在此结果的前面按某约定格式加上从乐段拼合信息的值的编码头里得到的节奏信息及两个声道各自的调式及谱号信息等,即得解码结果。以上拼合步骤1,拼合步骤2,拼合步骤5,拼合步骤7都是得到一个被定位的音素,所以本段前面说的将以上7个拼合步骤得到的各音的数据按得到的先后顺序合起来即成一结果,相当于是按顺序拼合的组拼方式,将以上拼合步骤1得到的被定位的音素56(1)和拼合步骤2得到的被定位的音素56(2)和拼合步骤3复制得到的1个自定位的类音素和拼合步骤4修改得到的自定位的类音素和拼合步骤5得到的被定位的音素56(3)和拼合步骤6交织拼合得到的自定位的类音素和拼合步骤7得到的被定位的音素56(4)拼合起来,成一结果,这结果里要有BIT注明被定位的音素56(1)的数据,被定位的音素56(2)的数据,复制得到的1个自定位的类音素的数据,修改得到的自定位的类音素的数据是声道1的,注明被定位的音素56(3)的数据,交织拼合得到的自定位的类音素的数据,被定位的音素56(4)的数据是声道2的。Combining step 8 is also decoding step 9: the data (the actual pitch value of each sound and the music section pronunciation time value and the sound length value) of the data (the actual pitch value of each tone and the music section pronunciation time value and the sound length value) obtained by the above 7 assembling steps are combined to form a result ( In this result, there must be BIT to distinguish which data is for channel 1 and which data is for channel 2), and then add the rhythm obtained from the encoding header of the value of the segment combination information in front of the result according to a certain agreed format Information and the respective mode and clef information of the two channels, etc., to obtain the decoding result. The above merging step 1, merging step 2, merging step 5, and merging step 7 all get a positioned phoneme, so the data of each sound obtained in the above 7 merging steps mentioned earlier in this paragraph are combined according to the order obtained. Serve as a result, which is equivalent to the method of merging in sequence, copying the positioned phoneme 56(1) obtained in the above merging step 1 and the positioned phoneme 56(2) obtained in the merging step 2 and merging step 3 1 self-localized phone-like and merging step 4 modify the self-localized phone-like obtained in merging step 5 and the localized phoneme 56(3) and merging step 6 interweave and merging the self-localized phone-like obtained in merging step 7 The obtained located phoneme 56(4) is put together to form a result, and the data of the located phoneme 56(1) and the located phoneme 56(2) must be indicated by BIT in this result, and the 1 obtained by copying The phoneme-like data of self-positioning, the phoneme-like data of self-positioning obtained by modifying is channel 1, and the data of the phoneme 56 (3) being indicated is marked, and the phoneme-like data of self-positioning obtained by interweaving and splicing, The data for the localized phoneme 56(4) is for channel 2.

以上声道1和声道2是同时开始发音的(按解码结果发音时有约定是用声道1的数据和声道2的数据来同时发音,一般情况,按解码结果发音前可由MCU或用作发音的MCU外接的音乐芯片先找到解码结果中声道1的数据和声道2的数据以便用两个声道的数据同时发两个声道的音)。The above channel 1 and channel 2 start to pronounce at the same time (according to the decoding result, there is an agreement that the data of channel 1 and channel 2 are used to pronounce at the same time. In general, the MCU or user can use the decoding result before pronunciation) The music chip externally connected to the MCU for pronunciation first finds the data of the channel 1 and the data of the channel 2 in the decoding result so that the data of the two channels can be used to simultaneously emit the sound of the two channels).

然后再根据解码结果用拾音器演奏出来即可且演奏的乐段就如图14所示(这里采用的解码发音方式是整步发音方式);可由拾音器里的MCU(或MCU外接的音乐芯片等)直接按解码结果用音乐合成方式使喇叭播放此乐段的发音(如是用MCU外接的音乐芯片来播音乐,那一般应是由MCU通过连接线将解码结果传给MCU外接的音乐芯片,由MCU外接的音乐芯片将解码结果用音乐合成方式使喇叭播放此乐段的发音),拾音器里的MCU(或MCU外接的音乐芯片等)有音乐合成功能;本解码举例是采用整步发音方式,并且解码时播的音是被编码乐段的全部发音(也可由MCU或MCU外接的音乐芯片根据解码结果的部分来播被编码乐段的部分发音),并且发音时也可把解码得到的解码结果转成别的数据形式(比如乐段的谱数据)并用别的数据形式来播放乐段发音。(本例所说的解码结果是指解码得到的乐段的全部解码结果)Then according to the decoding result, it can be played with the pickup and the music section played is just as shown in Figure 14 (the decoding pronunciation method adopted here is the full-step pronunciation method); Directly follow the decoding result and use the music synthesis method to make the speaker play the pronunciation of this section (if the music chip external to the MCU is used to play music, then the MCU should generally transmit the decoding result to the external music chip of the MCU through the connection line, and the MCU The external music chip uses the decoding result to make the speaker play the pronunciation of this section in the way of music synthesis), and the MCU in the pickup (or the music chip external to the MCU, etc.) has the music synthesis function; this decoding example adopts the full-step pronunciation method, and The sound played during decoding is all the pronunciation of the coded section (the MCU or the music chip external to the MCU can also play part of the pronunciation of the coded section according to the part of the decoding result), and the decoding result obtained by decoding can also be used during pronunciation. Convert to other data forms (such as the spectral data of the music section) and use other data forms to play the music section pronunciation. (The decoding result mentioned in this example refers to all the decoding results of the decoded music section)

以上用整步发音方式解码时是可从拼合信息的值里直接得到音素56(1),音素56(2),音素56(3),音素56(4),用于修改的音素,用于交织拼合的第1个音素,用于交织拼合的第2个音素的。When the above is decoded with the full-step pronunciation method, the phoneme 56 (1), phoneme 56 (2), phoneme 56 (3), phoneme 56 (4) can be directly obtained from the value of the combined information, and the phoneme used for modification is used for The first phoneme of interweaving and stitching is used for the second phoneme of interweaving and stitching.

如是采用分步发音方式,各拼合步骤和解码步骤定义如下(采用分步发音方式时,一般要事先定位拼合信息的值里各声道的数据在哪里,本例可在解码开始读编码头的数据后在拼合信息的值里找到两个转声道段,即以0010开头的段,其中一个转到声道1的转声道段的后面是声道1的数据,其中一个转到声道2的转声道段的后面是声道2的数据):If the step-by-step pronunciation method is adopted, the stitching steps and decoding steps are defined as follows (when the step-by-step pronunciation method is adopted, it is generally necessary to locate the data of each channel in the value of the stitching information in advance. In this example, you can read the encoding header at the beginning of decoding. After the data, find two channel-transfer segments in the value of the combined information, that is, the segment starting with 0010, one of which is transferred to channel 1, followed by the data of channel 1, and one of them is transferred to channel 1 2 is followed by channel 2 data):

解码步骤1:读编码头里的信息,找到拼合信息的值里声道1的数据的位置和声道2的数据的位置,再查看声道1的数据和声道2的数据,按其发音会产生哪些要拿去发音的被定位的音素及定位的类音素,以及它们发音占的时间,从而确定以下按分步发音方式发音时每步要作些什么,Decoding step 1: Read the information in the encoding header, find the position of the data of channel 1 and the position of the data of channel 2 in the value of the combined information, then check the data of channel 1 and the data of channel 2, and pronounce it according to its pronunciation Which localized phonemes and localized phonemes to be pronounced will be generated, as well as the time taken for their pronunciation, so as to determine what to do in each step of the following step-by-step pronunciation method,

拼合步骤1也即解码步骤2:从声道1的数据里的音素段的数据,得到音素56(1)的上下位置值1,再从这个音素段得到音素56(1)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(1)里的音的构成情况,从以上得到的这些数据(音素56(1)的上下位置值1,音素56(1)里的音的构成情况),就可确定被定位的音素56(1)里的各音的实际的音高值及乐段发音时刻值及音长值,Combining step 1 is also decoding step 2: from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of phoneme 56 (1) is obtained, and then the phoneme number of phoneme 56 (1) is obtained from this phoneme segment, press This phoneme numbering finds the record item of this phoneme in the phoneme database, obtains the composition situation of the tone in the phoneme 56 (1) from this record item, obtains these data (the upper and lower position value 1 of phoneme 56 (1) from above, Phoneme 56 (1) Li's sound composition situation), just can determine the actual pitch value of each tone in the phoneme 56 (1) li of being positioned and music section articulation time value and sound length value,

从声道2的数据里的第1个音素段的数据,得到音素56(3)的上下位置值1,再从这个音素段得到音素56(3)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(3)里的音的构成情况,从以上得到的这些数据(音素56(3)的上下位置值1,音素56(3)里的音的构成情况),就可确定被定位的音素56(3)里的各音的实际的音高值及乐段发音时刻值及音长值,From the data of the first phoneme segment in the data of channel 2, get the upper and lower position value 1 of phoneme 56(3), and then get the phoneme number of phoneme 56(3) from this phoneme segment, press this phoneme number in the phoneme database Find the record item of this phoneme in this record item, and get the composition of the sound in phoneme 56(3) from this record item, from the data obtained above (the upper and lower position values of phoneme 56(3) are 1, in phoneme 56(3) The composition situation of the tone), just can determine the actual pitch value of each tone in the positioned phoneme 56 (3) and the music section pronunciation time value and the sound length value,

将从乐段拼合信息的值的编码头里得到的节奏信息及两个声道各自的调式及谱号信息等和前面得到的被定位的音素56(1)里的各音的实际音高值及乐段发音时刻值及音长值和前面得到的被定位的音素56(3)里的各音的实际音高值及乐段发音时刻值及音长值这些信息作为解码结果的部分,拿去发音,The actual pitch value of each sound in the positioned phoneme 56 (1) obtained earlier from the rhythm information obtained in the encoding header of the value of the segment combination information and the respective modes and clef information of the two sound channels And the actual pitch value of each tone in the positioned phoneme 56 (3) and the music section pronunciation time value and the sound length value obtained in front and the music section pronunciation time value and the sound length value as the part of the decoding result, take to pronounce,

拼合步骤2也即解码步骤3:接着从声道1的数据里的音素段的数据,得到音素56(2)的上下位置值1,再从这个音素段得到音素56(2)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(2)里的音的构成情况,从以上得到的这些数据(音素56(2)的上下位置值1,音素56(2)里的音的构成情况),就可确定被定位的音素56(2)里的各音的实际的音高值及乐段发音时刻值及音长值,Combining step 2 is also decoding step 3: then from the data of the phoneme segment in the data of channel 1, the upper and lower position value 1 of the phoneme 56 (2) is obtained, and then the phoneme number of the phoneme 56 (2) is obtained from this phoneme segment, Find the record item of this phoneme in the phoneme database by this phoneme number, obtain the composition situation of the sound in the phoneme 56 (2) from this record item, these data obtained from above (the upper and lower position values of phoneme 56 (2) 1 , phoneme 56 (2) in the composition of the sound), just can determine the actual pitch value of each sound in the positioned phoneme 56 (2) and the value of the pronunciation time of the music section and the sound length value,

接着得到声道1的数据里的复制段的数据,将前面得到的被定位的音素56(2)里的各音复制一遍(复制时得到的音的音高及音长与被定位的音素56(2)里的音相比都不变),复制得到的一个自定位的类音素(这个自定位的类音素的上下位置值3就是音素56(2)的上下位置值1)里的1个音的实际的音高值和乐段发音时刻值和音长值(复制得到的这些信息可看成是复制得到的一个自定位的类音素),Then obtain the data of the copy section in the data of channel 1, and copy each tone in the positioned phoneme 56 (2) obtained in front (the pitch and the sound length of the sound obtained during duplication are the same as the positioned phoneme 56 (2) The sound in (2) is relatively unchanged), a self-positioned phoneme that is copied (the upper and lower position value 3 of this self-positioned phoneme is exactly the upper and lower position value 1 of the phoneme 56 (2)) in 1 The actual pitch value of the sound, the pronunciation time value and the sound length value of the music segment (the information obtained by copying can be regarded as a self-positioned phoneme obtained by copying),

接着在声道2的数据里的交织拼合段里,取出两个用于交织拼合的音素各自的音素编号,并从这交织拼合段里得到用于交织拼合的第1个音素的上下位置值1以对用于交织拼合的第1个音素进行定位,并从这交织拼合段里得到用于交织拼合的第2个音素的上下位置值1以对用于交织拼合的第2个音素进行定位,并按交织拼合方式把这两个被定位的音素拼合起来得一个自定位的类音素(这个类音素表示的是一个音组合的信息;假设这里是根据使用的这两个音素来相应按交织拼合的约定用这两个被定位的音素交织拼合得到这个类音素),这个自定位的类音素的上下位置值3(也即这个自定位的类音素里的最先发音的所有音中最低音的上下位置值,也即这个自定位的类音素里的最先发音的那个音的上下位置值)就是前述用于交织拼合的第1个音素的上下位置值1(这个自定位的类音素里的最先发音的所有音中最低音的上下位置值就等于前述用于交织拼合的第1个音素里的最先发音的所有音中最低音的上下位置值),这样最后得出的是这个自定位的类音素里各音的实际的音高值及乐段发音时刻值及音长值,Then, in the interleaving and stitching section in the data of channel 2, take out the phoneme numbers of the two phonemes used for interleaving and stitching, and get the upper and lower position values of the first phoneme for interleaving and stitching from this interleaving and stitching section. To locate the first phoneme used for interweaving and mosaicing, and obtain the upper and lower position value 1 of the second phoneme for interweaving and mosaicing from this interweaving and mosaic segment to position the second phoneme for interweaving and mosaicing, And put together these two positioned phonemes by interweaving and stitching together to get a self-positioned phoneme (this class phoneme represents the information of a sound combination; assuming that this is based on these two phonemes used to correspondingly stitch together by interweaving The convention is to use these two positioned phonemes to interweave and stitch together to get this phoneme), the upper and lower position value of this self-positioned phoneme is 3 (that is, the lowest of all the first sounds in this self-located phoneme) The upper and lower position values, that is, the upper and lower position values of the first pronounced sound in this self-positioned phoneme) are the upper and lower position values 1 of the first phoneme used for interweaving and stitching (the upper and lower position values in this self-positioned phoneme) The upper and lower position values of the lowest tone in all sounds that are pronounced at first are just equal to the upper and lower position values of the lowest tone in all the sounds that are first pronounced in the first phoneme that is used for interweaving and mosaicing), what finally draws like this is this self The actual pitch value of each sound in the positioned phoneme, the pronunciation time value and the sound length value of the segment,

将从乐段拼合信息的值的编码头里得到的节奏信息及两个声道各自的调式及谱号信息等和前面得到的被定位的音素56(2)里的各音的实际音高值及乐段发音时刻值及音长值和前面复制得到的各音的信息和前面得到的类音素里各音的实际音高值及乐段发音时刻值及音长值这些信息作为解码结果的部分,拿去发音,The actual pitch value of each sound in the positioned phoneme 56 (2) obtained in front with the rhythm information obtained from the code header of the value of the segment combination information and the respective mode and clef information of the two sound channels And the pronunciation time value and sound length value of the segment, the information of each sound copied earlier, the actual pitch value of each sound in the phoneme obtained earlier, the pronunciation time value of the segment and the sound length value, these information are used as part of the decoding result , take it to pronounce,

拼合步骤3也即解码步骤4:接着在声道1的数据里的修改段里,取出用于修改的音素的音素编号,再根据修改段里的用于修改的音素的上下位置值1及这音素里各音互相之间的音程关系数据以及从编码头得到的声道1的调式(C大调)和谱号信息,来得到这个被定位的用于修改的音素中各音的实际的音高值(也即音调值)及乐段发音时刻值及音长值,再根据修改段里的修改信息,对此被定位的音素里的音进行修改,即将其第2个发音位置的音抬高一个音步,这个音的音长值仍不变,对此被定位的音素进行此修改后得到一个自定位的类音素(这个类音素表示是一个音组合的信息),这样最后得出的是这个自定位的类音素中各音的实际的音高值及乐段发音时刻值及音长值,Stitching step 3 is also decoding step 4: Then, in the modification section in the data of channel 1, take out the phoneme number of the phoneme used for modification, and then according to the upper and lower position value 1 of the phoneme used for modification in the modification section and this Interval relationship data between each tone in the phoneme and the mode (C major) and clef information of channel 1 obtained from the encoding head to obtain the actual pitch of each tone in the positioned phoneme for modification High value (that is, the pitch value) and the pronunciation time value and the sound length value of the section, and then according to the modification information in the modification section, the sound in the positioned phoneme is modified, that is, the sound of its second pronunciation position is lifted One step higher, the sound length value of this sound is still unchanged, after this modification is carried out to this positioned phoneme, a self-positioned phoneme (this phoneme represents the information of a sound combination) is obtained, so that the final is the actual pitch value of each sound in this self-positioned phoneme, the pronunciation time value of the segment and the sound length value,

接着从声道2的数据里的第2个音素段的数据,得到音素56(4)的上下位置值1,再从这个音素段得到音素56(4)的音素编号,按此音素编号在音素数据库里找到此音素的记录项,从这记录项里得到音素56(4)里的音的构成情况,从以上得到的这些数据(音素56(4)的上下位置值1,音素56(4)里的音的构成情况),就可确定被定位的音素56(4)里的各音的实际的音高值及乐段发音时刻值及音长值,Then from the data of the second phoneme segment in the data of sound channel 2, the upper and lower position value 1 of the phoneme 56 (4) is obtained, and then the phoneme number of the phoneme 56 (4) is obtained from this phoneme segment, and the phoneme number is set according to this phoneme number Find the record item of this phoneme in the database, and get the composition of the sound in phoneme 56 (4) from this record item, these data obtained from above (the upper and lower position values of phoneme 56 (4), phoneme 56 (4) The composition situation of the tone in the ), just can determine the actual pitch value of each sound in the positioned phoneme 56 (4) and the music section pronunciation time value and the sound length value,

将从乐段拼合信息的值的编码头里得到的节奏信息及两个声道各自的调式及谱号信息等和前面得到的这个类音素中各音的实际音高值及乐段发音时刻值及音长值和前面得到的被定位的音素56(4)里的各音的实际音高值及乐段发音时刻值及音长值这些信息作为解码结果的部分,拿去发音,Combine the rhythm information obtained from the encoding header of the value of the segment combination information and the respective mode and clef information of the two channels with the actual pitch value of each tone in this type of phoneme obtained earlier and the value of the pronunciation time of the segment And sound length value and the actual pitch value of each tone in the positioned phoneme 56 (4) that obtains before and the music section pronunciation time value and the sound length value are used as the part of the decoding result, and are used for pronunciation.

以上3个拼合步骤发的音就形成了被编码乐段的全部发音(乐段就如图14所示),这里的分步发音方式是分3步。以上每个拼合步骤里,声道1和声道2都是同时发音,可在以上各拼合步骤里拿去发音的解码结果的部分里用BIT表示哪些是声道1的数据,哪些是声道2的数据。以上3个拼合步骤里涉及的发音是以由拾音器里的MCU按解码结果的部分用音乐合成方式使喇叭发音为例的,每个拼合步骤发音都是按在解码步骤1里从编码头里得到的节奏信息和两个声道各自的调式及谱号信息来发音的,拾音器里的MCU有音乐合成功能;以上3个拼合步骤里涉及的发音也可以是由MCU外接的音乐芯片按解码结果的部分用音乐合成方式使喇叭发音,并可在解码步骤1里把从编码头里得到的节奏信息和两个声道各自的调式及谱号信息由MCU传给MCU外接的音乐芯片以用于发音(这样一来,后面的解码步骤里由MCU传给MCU外接的音乐芯片的解码结果的部分里就可没有调式及谱号信息和节奏信息),MCU外接的音乐芯片有音乐合成功能。The sound that above 3 stitching steps sends has just formed the whole pronunciation of encoded music section (music section is just as shown in Figure 14), and the step-by-step pronunciation mode here is to divide 3 steps. In each of the above stitching steps, channel 1 and channel 2 are both pronounced at the same time, and BIT can be used in the part of the decoding result that is used for pronunciation in the above stitching steps to indicate which is the data of channel 1 and which is the channel 2 data. The pronunciation involved in the above 3 stitching steps is based on the example of the MCU in the pickup using the music synthesis method to make the speaker sound according to the decoding result. The pronunciation of each stitching step is obtained from the encoding head in the decoding step 1. The rhythm information of the two channels and the respective mode and clef information of the two channels are used to pronounce. The MCU in the pickup has a music synthesis function; the pronunciation involved in the above three stitching steps can also be decoded by an external music chip connected to the MCU. Part of the speaker is made to sound by music synthesis, and the rhythm information obtained from the encoding head and the respective mode and clef information of the two channels can be sent from the MCU to the external music chip of the MCU in the decoding step 1 for pronunciation (In this way, in the following decoding steps, the part of the decoding result passed from the MCU to the music chip connected to the MCU may not have mode and clef information and rhythm information), and the music chip connected to the MCU has a music synthesis function.

因为播放音乐时是要精确确定各个音在什么时刻发音的,所以解码采用分步发音方式时,一般在MCU里需要作记时,以便在各合适时刻作发各步音的动作。Because when playing music, it is necessary to accurately determine when each sound is pronounced, so when the decoding adopts a step-by-step pronunciation method, it is generally necessary to record the time in the MCU, so that the action of each step sound can be performed at each appropriate moment.

以上用分步发音方式解码时是可从拼合信息的值里直接得到音素56(1),音素56(2),音素56(3),音素56(4),用于修改的音素,用于交织拼合的第1个音素,用于交织拼合的第2个音素的。When the above is decoded with the step-by-step pronunciation method, the phoneme 56 (1), phoneme 56 (2), phoneme 56 (3), and phoneme 56 (4) can be directly obtained from the value of the combined information, and the phoneme used for modification is used for The first phoneme of interweaving and stitching is used for the second phoneme of interweaving and stitching.

本例所说的解码结果是指解码得到的乐段的全部解码结果。The decoding result mentioned in this example refers to all the decoding results of the music section obtained by decoding.

本解码举例中以上整步发音方式和分步发音方式里各拼合步骤得到的音的乐段发音时刻值,是按拍来计量的(且是以4分音符为一拍的),即某一音的乐段发音时刻值,比如可以是2拍,3拍加4分之3拍等,这些拍数都是从乐段最开始发音算起(乐段最开始发音的音的发音时刻值为0),以上解码举例中按整步发音方式和分步发音方式解码时每个声道里的每个类音素或被定位的音素里各音的乐段发音时刻值的计算方式可参看本说明书第28页段落开头是‘至于前面说到的解码时’的那一段所述。(本段所说的拍都是指被编码乐段的实际拍)In this decoding example, the music segment pronunciation time value of the tone obtained in the above whole-step pronunciation mode and the step-by-step pronunciation mode is measured by beat (and is a beat with a quarter note), that is, a certain The pronunciation time value of the music section, for example, can be 2 beats, 3 beats plus 3/4 beats, etc. These beats are all calculated from the beginning of the music section (the pronunciation time value of the music section's first pronunciation is 0), in the above decoding example, when decoding according to the full-step pronunciation mode and the step-by-step pronunciation mode, please refer to this manual for the calculation method of the pronunciation time value of each tone in each class phoneme in each channel or in the positioned phoneme The paragraph on page 28 begins with the paragraph 'As for the decoding time' mentioned above. (The beat mentioned in this paragraph refers to the actual beat of the coded section)

本说明书以上各解码举例(解码举例1和解码举例2)里提到的音长值,是按乐段的实际拍来计量的,且是以4分音符为一实际拍的。The sound length values mentioned in the above decoding examples (decoding example 1 and decoding example 2) in this manual are measured according to the actual beat of the music section, and the quarter note is taken as an actual beat.

本说明书以上各解码举例里(解码举例1和解码举例2)提到的‘各音的实际的音高值及乐段发音时刻值及音长值’这样的说法,其中‘各音’是修饰‘实际的音高值及乐段发音时刻值及音长值’的;对于‘各音的实际的音高值和乐段发音时刻值和音长值’也是与此类似的。In the above decoding examples (decoding example 1 and decoding example 2) of this manual, the phrases "the actual pitch value of each sound, the pronunciation time value of the music section and the sound length value" are mentioned, in which "each sound" is a modified The 'actual pitch value, music section pronunciation time value and sound length value'; for the 'actual pitch value of each tone, music section pronunciation time value and sound length value' is also similar to this.

本说明书以上各解码举例(解码举例1和解码举例2)里提到的被定位的音素(是根据上下位置值1进行定位的)里的音的实际音高值,是由前述上下位置值1和这被定位的音素里各音互相之间的音程关系数据和从编码头得到的此被定位的音素所处声道的调式(C大调)和谱号信息来确定的音调值,本说明书以上各解码举例里提到的解码时得到的自定位的类音素里的音的实际音高值,是由解码时得到的自定位的类音素的上下位置值3和这自定位的类音素里各音互相之间的音程关系数据和从编码头得到的此自定位的类音素所处声道的调式(C大调)和谱号信息来确定的音调值。The actual pitch value of the sound in the positioned phoneme (positioned according to the upper and lower position value 1) mentioned in the above decoding examples (decoding example 1 and decoding example 2) in this manual is determined by the aforementioned upper and lower position value 1 and the pitch value determined by the interval relationship data between the tones in the located phoneme and the mode (C major) and clef information of the channel where the located phoneme is located obtained from the coding head. The actual pitch value of the sound in the self-positioned phoneme-like obtained during decoding mentioned in the above decoding examples is the upper and lower position value 3 of the self-located phoneme-like obtained during decoding and the self-located phoneme-like The interval relationship data between each tone and the pitch value determined from the mode (C major) and clef information of the channel where the self-localized phoneme is located obtained from the coding head.

以上解码举例2整步发音方式和分步发音方式里,交织拼合时得到自定位的类音素里各音的实际的音高值是这样得到的:由用于交织拼合的第1个音素的上下位置值1和用于交织拼合的第2个音素的上下位置值1,可得到被定位的这第1个音素中最先发音的所有音中最低音与被定位后的这第2个音素中最先发音的所有音中最低音之间的音程关系值,再结合被定位的这第1个音素里各音互相之间的音程关系值和被定位的这第2个音素里各音互相之间的音程关系值,而得到交织拼合得到的自定位的类音素里各音互相之间的音程关系值,再按交织拼合得到的自定位的类音素的上下位置值3,即得到这个自定位的类音素里各音的实际的音高值。In the above decoding example 2 full-step pronunciation mode and step-by-step pronunciation mode, the actual pitch value of each sound in the self-positioned phoneme obtained during interweaving and stitching is obtained in this way: by the upper and lower of the first phoneme used for interweaving and stitching The position value 1 and the upper and lower position value 1 of the second phoneme used for interweaving and stitching can get the lowest tone of all the first sounds in the first phoneme being positioned and the second phoneme after being positioned The interval relationship value between the lowest notes in all the sounds that are pronounced first, combined with the interval relationship value between the sounds in the first phoneme that is positioned and the relationship between the sounds in the second phoneme that is positioned Interval relation value between, and obtain the interval relation value between each tone in the self-positioned phoneme obtained by interweaving and mosaicing, and then press the upper and lower position value 3 of the self-positioning phoneme that is obtained by interweaving and mosaicing, obtain this self-positioning The actual pitch value of each tone in the class phoneme.

本申请文件关于声道的定义,与平常的定义方式相同的,即一系列音高比较接近,互相构成和弦关系的音属于一个声道,比如图13,处于上面五条线线上或上面五条线之间的音就属于同一声道(主声道),上面五条线上方的音也属于这个声道,处于下面五条线线上或下面五条线之间的音就属于另外一声道(副声道),下面五条线下方的音也属于这个声道,而对于处于上面五条线和下面五条线之间的音,如其是与上面声道的音构成和弦关系的那它就属于上面声道,如其是与下面声道的音构成和弦关系的那它就属于下面声道,也可根据编解码处理的方便将其归入上面声道或下面声道。当然对于大型音乐作品的总谱来说,如在相同的音域,由两把提琴分别奏不同的旋律,那这两把提琴各自奏出的音一般应归入各自不同的声道。本段所述是音乐常识。The definition of the vocal tract in this application document is the same as the usual definition, that is, a series of pitches are relatively close, and the tones that form a chord relationship with each other belong to a vocal tract, such as Figure 13, which is on the upper five lines or the upper five lines The sound between them belongs to the same channel (main channel), the sound above the five lines above also belongs to this channel, and the sound between the five lines below or between the five lines below belongs to another channel (secondary sound). Road), the sound below the five lines below also belongs to this channel, and for the sound between the five lines above and the five lines below, if it forms a chord relationship with the sound of the above channel, it belongs to the above channel, If it forms a chord relationship with the sound of the lower channel, it belongs to the lower channel, and it can also be classified into the upper channel or the lower channel according to the convenience of codec processing. Of course, for the score of a large-scale musical work, if two violins play different melodies in the same range, the sounds played by the two violins should generally be classified into different channels. This paragraph is about music common sense.

对于码字这边,一般不需用BIT位来表示码字用的是什么码种(条形码,导电码等),因为从码字的物理特性就可判断其是什么码种(条形码,导电码等),比如条形码,其条空宽度比有现成规定,一般的条形码码制还有自校验功能,所以读码头取得条空宽度比数值后,结合校验信息即可判断是条形码,再如2006102010785号申请文件中描述的导电码,其除了可被机电式读码头读取外,也可被光电式读码头读取,被光电式读码头读取的导电码的码线区的条空宽度一般相同,而且一般遵循通断代名码数的条空格式,所以也可依此判断出是导电码;所以一般在码字上不用开BIT表示是什么码种,当然如在码字前面开几个BIT以表示是什么码种(条形码,导电码等)也是可以的。然后是码字表示的值的问题,值里主要是有关乐段拼合信息的值的信息,至于码字里还含有其它信息(如显示信息)的,一般也应统一规划码字里码值的形式。统一规划码字里的码值的形式,是为了码字可被各种拾音器,多种读码头读取。在码值里,还可以根据具体情况增删一些信息并作统一或兼容格式定义,本段所述仅作参考。For the code word, it is generally not necessary to use the BIT bit to indicate what type of code (bar code, conductive code, etc.) the code word uses, because it can be judged from the physical characteristics of the code word (bar code, conductive code, etc. etc.), such as barcodes, the bar-space width ratio has ready-made regulations, and the general barcode code system also has a self-verification function, so after reading the terminal to obtain the bar-space width ratio value, it can be judged as a barcode by combining the verification information, and another example The conductive code described in the application document No. 2006102010785 can not only be read by the electromechanical reading dock, but also can be read by the photoelectric reading dock. The width of the bar space of the code line area of the conductive code read by the photoelectric reading dock It is generally the same, and generally follows the bar-space format of the number of on-off code names, so it can also be judged that it is a conductive code; therefore, it is generally not necessary to open BIT on the code word to indicate what kind of code it is, of course, if you open a few digits in front of the code word A BIT to indicate what type of code (barcode, conductive code, etc.) is also possible. Then there is the question of the value represented by the codeword. The value mainly contains the information about the value of the music segment combination information. As for the codeword that also contains other information (such as display information), the code value in the codeword should also be planned uniformly. form. The form of the code value in the unified planning codeword is to allow the codeword to be read by various pickups and multiple reading docks. In the code value, some information can also be added or deleted according to the specific situation and defined in a unified or compatible format. The description in this paragraph is for reference only.

可将解码程序和解码数据存放在拾音器内的存储器里,可让各有声音乐读物整页面上诸个码字中的全部或部分码字中各个码字的值表示的信息是与拼合信息的值有关的信息,用此拾音器即可读这些码字并解码播放出音乐。一个拾音器可通读各种其整页面上有这种码字的有声音乐读物。这里有声音乐读物整页面上各单个码字可以是或含有或可转换得到数个乐段拼合信息的值(或一个乐段拼合信息的值的部分,即把一个乐段拼合信息的值的BIT放在多个码字里)等,各单个码字的值表示的信息也是与拼合信息的值有关的信息。The decoding program and decoding data can be stored in the memory of the pickup, so that the information represented by the value of all or part of the codewords in the codewords on the entire page of each audio music book is the value of the combined information For relevant information, these code words can be read and decoded to play music with this pickup. A pickup can read through various audible music books that have such codewords on their entire pages. Here each single code word on the whole page of the audio music book can be or contain or can be converted to obtain the value of several pieces of music piece together information (or the part of the value of one piece of piece together information, that is, the BIT of the value of one piece of piece together information placed in multiple codewords), etc., the information represented by the value of each single codeword is also information related to the value of the spliced information.

对于使用乐谱拼合方式的有声音乐读物及拾音器,只要各有声音乐读物的码字里乐段拼合信息的值涉及的编码规则是相同或兼容的,各拾音器的解码程序是按照相同或兼容的解码规则来解码且解码规则与各有声音乐读物的码字里乐段拼合信息的值涉及的编码规则是专门对应的,即可做到各有声音乐读物与各拾音器的通读。上一段提到的‘一个拾音器可通读各种其整页面上有这种码字的有声音乐读物’一般是要在满足本段以上所说的条件的前提下才成立的。For audio music books and pick-ups that use the music score splicing method, as long as the coding rules involved in the value of the segment splicing information in the codeword of each audio music book are the same or compatible, the decoding procedures of each pickup are based on the same or compatible decoding rules To decode and the decoding rules are specially corresponding to the encoding rules involved in the value of the segment combination information in the code word of each audio music book, so that the full reading of each audio music book and each pickup can be achieved. The above mentioned "a pickup can read all kinds of audio music books with this code word on its entire page" is generally just established under the premise of satisfying the conditions mentioned above in this paragraph.

(本申请文件各处提到的某乐段拼合信息的值涉及的编码规则,是指用以编码生成此乐段拼合信息的值的编码规则)(The coding rules involved in the value of a piece of combination information mentioned in this application document refer to the coding rules used to encode and generate the value of this piece of combination information)

本申请文件涉及的有声音乐读物的整页面上的码字的形式,可以是常规的,即是直接在作为有声音乐读物主体的读物的页面上印刷条形码,或直接在读物页面上印刷或涂敷导电物质形成导电码,等,可称这样形成的码字(可以是条形码,导电码,图形码或磁码等)的载体是读物的页,码字的载体平面是读物的页面;另外也可在一小块平面材料的表面上形成数个码字(码字可以是条形码,导电码,图形码或磁码等)并把此小块平面材料粘贴到作为有声音乐读物主体的读物的页面上,即以这一小块平面材料作为码字的载体,此平面材料的表面作为码字的载体平面(如码字是导电码,那一般要求平面材料的表面不导电);可以以一小块纸作为一小块平面材料的例子,这一小块纸的形状可为长方形或其它形,面积能容下一个或多个码字(导电码或条形码等)即可,再采用本说明所述及的导电码形成方式,采用印刷导电油墨或涂敷导电物质等方式,在这一小块纸表面上形成导电码码字,或在这一小块纸表面上印刷条形码,或是形成其它种类的码字,在这一小块纸上可形成一个或多个码字(多个码字中各码字可以是或不是同一码种的),再把这一小块纸粘贴到读物页面上,对要布置到读物整页面上的其它码字都可按这样形成到小块纸上并粘贴到读物相应页面上,这也是一种有关读物页面上的码字的形式;一种等同替换形式,是把读物各页面用的许多个码字,一齐形成(比如印刷)到一张或数张大的纸张上,再剪成一些小块纸,每小块纸表面上含有一个或多个码字(码字可以任一码种的码字;多个码字中各码字可以是或不是同一码种的),再把各小块纸贴到读物各页面其需要发音乐的地方;还可采用其它形式,比如可把各小块表面没有码字的纸先粘贴到读物各页面上,再在各小块纸的表面上采取印刷或涂敷等方式形成一个或多个码字。采用本段述的粘贴小块纸方式的码字形式后,如印在小块纸上的是导电码,其码字仍可以是2006102010785号发明申请里述的甲方式导电码或乙方式导电码,用于读码的甲方式或乙方式读码头的结构及其读码的原理过程显然仍可按2006102010785号发明申请里所述;采用粘贴小块纸方式,如印在小块纸上的是其它码种的码字比如条形码等,其读码方式显然与将其印刷在读物页面上的方式是一样的。此方式类似于百货商场里,把条形码打印到一小块纸质标签上,再把标签粘贴到商品表面。(本申请文件各处所说的码字的载体平面,显然是指码字的载体的一个平面,而且此码字处于这个平面上)The form of the code word on the entire page of the audio music book involved in this application document can be conventional, that is, the barcode is directly printed on the page of the book as the main body of the audio book, or printed or coated directly on the page of the book Conductive substance forms conductive code, etc., the carrier of the code word (can be bar code, conductive code, graphic code or magnetic code etc.) that can claim to form like this is the page of reading material, and the carrier plane of code word is the page of reading material; Form several codewords on the surface of a small piece of planar material (codewords can be barcodes, conductive codes, graphic codes or magnetic codes, etc.) and paste this small piece of planar material on the page of the reading as the main body of the audio music book , that is, use this small piece of planar material as the carrier of the codeword, and the surface of this planar material is used as the carrier plane of the codeword (as the codeword is a conductive code, that generally requires the surface of the planar material to be non-conductive); Paper is used as an example of a small piece of planar material. The shape of this small piece of paper can be rectangular or other, and the area can accommodate one or more code words (conductive code or barcode, etc.). And the conductive code formation method is to use printing conductive ink or coating conductive substances to form conductive code characters on the surface of this small piece of paper, or to print barcodes on the surface of this small piece of paper, or to form other types. One or more codewords can be formed on this small piece of paper (each codeword can be or not of the same code in multiple codewords), and this small piece of paper is pasted on the reading material page , other codewords to be arranged on the entire page of the reading material can be formed on a small piece of paper and pasted on the corresponding page of the reading material in this way, which is also a form of the codeword on the page of the relevant reading material; a kind of equivalent replacement form , is to form (for example, print) many codewords used on each page of the reading material on one or several large sheets of paper, and then cut them into small pieces of paper, each small piece of paper contains one or more codewords on the surface (the code word can be the code word of any code word; Each code word can be or not the same code type in a plurality of code words), then each small piece of paper is pasted to the place where it needs to send music on each page of the reading material; Adopt other forms, such as the paper that does not have codewords on the surface of each small piece can be pasted on each page of the reading material earlier, and then take printing or coating on the surface of each small piece of paper to form one or more codewords. After adopting the code word form of pasting the small piece of paper mode described in this paragraph, if it is a conductive code printed on the small piece of paper, its code word can still be the first mode conductive code or the second mode conductive code in No. 2006102010785 invention application The structure and the principle process of reading the wharf and the reading code of the method A or method B used for code reading can obviously still be described in the invention application No. 2006102010785; adopt the method of pasting small pieces of paper, such as printing on the small piece of paper is The code word of other code kind is such as bar code etc., and its mode of code reading is obviously the same with the mode that it is printed on the reading material page. This method is similar to that in a department store, where the barcode is printed on a small piece of paper label, and then the label is pasted on the surface of the product. (the carrier plane of the codeword mentioned throughout the application documents obviously refers to a plane of the carrier of the codeword, and this codeword is on this plane)

本申请文件各处所指的有声音乐读物的整页面上的任一码字的形式,可以是直接把有声音乐读物的页面(也即作为有声音乐读物主体的读物的页面)当成载体平面的码字的形式(比如在读物页面上直接印刷条形码),也可以是如上段述的一小块平面材料的表面上有码字(即以这一小块平面材料作为码字的载体,这一小块平面材料的表面上只有此码字或有包括此码字在内的多个码字,这里说的多个码字中各码字可以是同一码种或不是同一码种的)且这一小块平面材料是粘贴在读物的页面上的形式;本段和上一段所说的码种是指条形码,导电码,图形码或磁码等;而且对于有声音乐读物来说,这里说的两种码字的形式(直接把读物页面当成载体平面的码字的形式与小块平面材料的表面上有码字且这一小块平面材料是粘贴在读物页面上的形式)互相可看成是等同替换。且一本有声音乐读物的整页面上可只有这两种形式的码字中的一种码字,也可以同时存在这两种形式的码字。The form of any code word on the entire page of the audio music reading referred to in this application document can be a code that directly regards the page of the audio music reading (that is, the page of the reading as the main body of the audio music reading) as the carrier plane. The form of words (such as directly printing barcodes on the pages of reading materials), can also be that there are code words on the surface of a small piece of plane material as mentioned in the above paragraph (that is, use this small piece of plane material as the carrier of code words, and this small piece of plane material can be used as the carrier of code words. There is only this code word or a plurality of code words including this code word on the surface of the block planar material, and each code word can be the same code type or not the same code type in the multiple code words mentioned here) and this Small pieces of plane material are in the form of pasting on the pages of reading materials; the code types mentioned in this paragraph and the previous paragraph refer to bar codes, conductive codes, graphic codes or magnetic codes, etc.; The form of a code word (directly regard the page of the reading material as the form of the code word of the carrier plane and the surface of the small piece of plane material with code words and this small piece of plane material is the form pasted on the page of the reading material) can be regarded as each other equivalent replacement. And the whole page of an audio music book can only have a kind of code word in the code word of these two forms, also can exist the code word of these two forms simultaneously.

有声音乐读物的一些特殊地方:Some special places for audiobooks:

对于有声音乐读物的音乐发音,涉及到的与语音发音相同的,有发音速度问题,不同于语音发音的,音乐可以调整调式,节奏,还可考虑使用不同的音色,比如钢琴,扬琴音色;对于这几种信息,一方面可以记录在码字的值里的乐段拼合信息的值里,演奏时根据这些信息来设定演奏的速度,调式,音色等(在一般具音乐功能的MCU里都有设置这些演奏参数的功能),再是在拾音器这边(在拾音器上),可加一些键,比如速度增减键,音程增减键(将演奏音调加8度或减8度),音色改换键(改换音色),在演奏音乐过程中按这些键来调整演奏参数。For the music pronunciation of audio music books, it involves the same as the voice pronunciation, and there is a problem of pronunciation speed. It is different from the voice pronunciation. The music can adjust the mode and rhythm, and you can also consider using different timbres, such as piano and dulcimer timbres; for These types of information, on the one hand, can be recorded in the value of the piece combination information in the value of the code word, and the speed, mode, tone, etc. of the performance are set according to these information during performance (in the general MCU with music function, all There are functions to set these performance parameters), and then on the pickup side (on the pickup), some keys can be added, such as speed increase and decrease keys, interval increase and decrease keys (to increase or decrease the playing tone by 8 degrees), timbre Change keys (change timbre), press these keys to adjust performance parameters during playing music.

以下实施例中涉及到的单片机芯片和音乐芯片具备多声道音乐功能的,是指其按音乐合成方式播音乐时(比如按拼合信息的值解码得到的乐段谱数据播音乐时)是播多声道音乐的,此也是技术常识。The single-chip microcomputer chip and the music chip involved in the following embodiments have multi-channel music functions, which means that when they play music in a music synthesis mode (such as when playing music by decoding the music piece spectrum data obtained by decoding the value of the combined information) For multi-channel music, this is also technical common sense.

拾音器实施例1:(使用光电式读码头的拾音器)Pickup embodiment 1: (use the pickup of the photoelectric reading dock)

使用附图2使用光电式读码头的拾音器的电路,其读码头使用光笔,主MCU芯片3是SNC710,其配有24个IO脚,具备按音乐合成方式播音乐的功能(四声道),32K*16BIT的内部数据及程序存储器;此芯片还可外接存储器。此实施例SNC710带两个SR008(每个含8MBIT存储空间),其中一个是程序存储器55(可把SNC710的系统主控及读码程序存放在此存储器里,此存储器固定布置在拾音器内部),另外一个是外接数据存储器6。发音器件为喇叭。光笔是成熟共知技术,不属于本发明内容,实施可参考公知电路,利用其光电原理读出码字的黑白条空值(比如读条形码,光笔读条形码的原理也属于公知技术),这里给出的是一个参考电路,以读条形码为例(这里假定印刷条形码的油墨,是黑色的,条形码的载体平面也即有声音乐读物的各页面是白色的,条形码的颜色与载体平面的颜色不同以便于光笔读取),所读的码字除了可以是条形码外,也可以是导电码等。Use accompanying drawing 2 to use the circuit of the pick-up of photoelectric type reading pier, its reading pier uses light pen, main MCU chip 3 is SNC710, and it is equipped with 24 IO pins, possesses the function (four channels) of playing music by music synthesis mode, 32K*16BIT internal data and program memory; this chip can also be connected with external memory. This embodiment SNC710 has two SR008s (each containing 8MBIT storage space), one of which is a program memory 55 (the system master control and code reading program of SNC710 can be stored in this memory, and this memory is fixedly arranged inside the pickup), The other is an external data memory 6 . The pronunciation device is a horn. The light pen is a mature and well-known technology, which does not belong to the content of the present invention. The implementation can refer to the known circuit, and use its photoelectric principle to read the black and white bar null values of the codeword (such as reading barcodes, and the principle of light pen reading barcodes also belongs to the known technology). What is shown is a reference circuit, taking barcode reading as an example (here it is assumed that the printing ink of the barcode is black, and the carrier plane of the barcode, that is, each page of the audio music book is white, and the color of the barcode is different from the color of the carrier plane so that For reading with a light pen), the read codeword can also be a conductive code, etc. in addition to a barcode.

读码时,使光笔与条形码接触(或接近)并拖动光笔沿水平方向匀速扫过条形码,D1是发光二极管用于照射码字,D2是光敏二极管,用于感受从码字的码线回来的光(条形码上有油墨的地方是黑的不反射光,无油墨的地方是白的反射光,这是常识),O1运放是信号的第1级放大,O2运放组成滞回比较器,用于信号整形成方波输出,其中Rx的阻值可以调节,D4,D5是稳压管,Qb组成开关回路,提供适合MCU读取的方波输出。这个实施例,将解码的工作放在MCU里面进行,光笔的输出在Out那里是方波输出,方波波形的波峰波谷宽度比与条形码的条空宽度比是一样的,这样就可以在MCU里记录从P12口进来的方波波形的每个波峰的高电平持续时间及每个波谷的低电平持续时间,即可得到条形码的条空布置情况,然后根据条形码的条空布置情况,用软件解出其最后码值来,当然软件里一般需有对从P12脚进来的方波的波形误差等的处理;对高低电平持续时间的计时可使用MCU的中断来做。When reading the code, make the light pen touch (or close to) the bar code and drag the light pen to scan the bar code at a constant speed in the horizontal direction. D1 is a light-emitting diode for illuminating the code word, and D2 is a photodiode for feeling the return from the code line of the code word. (the place with ink on the barcode is black and non-reflected light, and the place without ink is white reflected light, which is common sense), the O1 op-amp is the first stage amplification of the signal, and the O2 op-amp forms a hysteresis comparator , used for signal shaping to form a square wave output, where the resistance of Rx can be adjusted, D4 and D5 are regulator tubes, and Qb forms a switch circuit to provide a square wave output suitable for MCU reading. In this embodiment, the decoding work is carried out in the MCU, and the output of the light pen is a square wave output at Out. Record the high-level duration of each peak and the low-level duration of each trough of the square wave waveform coming in from the P12 port, and you can get the bar-space layout of the barcode, and then according to the bar-space layout of the barcode, use The software solves the final code value. Of course, the software generally needs to deal with the waveform error of the square wave coming in from the P12 pin; the timing of the high and low level duration can be done using the interrupt of the MCU.

本实施例的外接数据存储器6,是图中的右边的SR008,The external data memory 6 of this embodiment is SR008 on the right side of the figure,

(这里的MCU型号选择仅供参考,还可选用其它型号)。(The MCU model selection here is for reference only, and other models can also be used).

可以把定义好的用于解码的音素数据库及其它解码数据,以及解码程序存放在外接数据存储器6里,外接数据存储器6与MCU有通讯线连接,程序存储器55里存放SNC710的运行程序,程序存储器55与MCU有通讯线连接,有声音乐读物(是如图1所示的有声音乐读物)整页面上只有一些条形码,有声音乐读物整页面上的各单个条形码的值是一个乐段拼合信息的值的部分(每个乐段拼合信息的值是分开放在多个码字里的),此有声音乐读物整页面上的每个条形码都可被此拾音器读取以用于此拾音器解码播音乐,此有声音乐读物整页面上的所有条形码的集合是此有声音乐读物整页面上的诸个码字,也是诸个条形码,可把此有声音乐读物的整页面上的诸个码字中的所有条形码定义成此拾音器能读的部诸码字,使用时,用此拾音器读此有声音乐读物整页面上的诸个码字中的部诸码字中的多个条形码以得到乐段拼合信息的值并按外接数据存储器6里的解码程序及解码数据对乐段拼合信息的值进行解码播音乐即可。The phoneme database and other decoding data defined for decoding can be stored in the external data memory 6, and the external data memory 6 is connected with the MCU by a communication line, and the program memory 55 stores the operating program of the SNC710, and the program memory 55 is connected with the MCU by a communication line. There are only some barcodes on the entire page of the audio music book (the audio music book shown in Figure 1), and the value of each single barcode on the entire page of the audio music book is the value of a piece of combined information (The value of each segment's stitching information is separately placed in multiple codewords), each barcode on the entire page of this audio music book can be read by this pickup for this pickup to decode the playback music, The collection of all the barcodes on the whole page of the audio music book is the code words on the whole page of the audio music book, and also the bar codes, all the bar codes in the code words on the whole page of the audio music book can be combined It is defined as all the codewords that can be read by the pickup. When in use, use the pickup to read multiple barcodes in some of the codewords on the entire page of the audio music book to obtain the value of the music segment splicing information And according to the decoding program and the decoding data in the external data memory 6, the value of the music piece combination information is decoded and the music is played.

本实施例的外接数据存储器6和程序存储器55也是属于本实施例所说的拾音器的一部分。本实施例所提到的拾音器,是包括图2中所有部分的。The external data memory 6 and the program memory 55 of the present embodiment also belong to the said pickup of the present embodiment. The pickup mentioned in the present embodiment includes all parts in Fig. 2 .

本实施例的光笔还可由其它种类的光笔代替,也可由其它类型的光电式读码头代替,比如ccd读码头(ccd读码头在读码时不用拖动扫过码线区)。The light pen of the present embodiment can also be replaced by other kinds of light pens, and can also be replaced by other types of photoelectric reading docks, such as ccd reading docks (ccd reading docks do not need to be dragged to scan the code line area when reading codes).

拾音器实施例2:(使用MCU外接的音乐芯片)Pickup embodiment 2: (using an external music chip connected to the MCU)

如附图3,使用MCU外接的音乐芯片的拾音器的电路,MCU3型号仍为SNC710,其配有24个IO脚,32K*16BIT的数据及程序存储器;此芯片还可外接存储器;可把其系统主控,读码及通讯程序放在SNC710内部;发音器件为带喇叭的SNC82030,45是光笔读码头。光笔是成熟共知技术,不属于本发明内容,实施可参考公知电路,利用其光电原理读出码字的黑白条空值。本实施例以读条形码为例,所读的码字除了可以是条形码外,也可以是导电码。本实施例涉及的有声音乐读物以图7下半部分所示为例,其整页面上的只有一些条形码,其整页面上的所有条形码中每个条形码的码值是一个乐段拼合信息的值,此有声音乐读物整页面上的每个条形码都可被此拾音器读取以用于此拾音器解码播音乐,此有声音乐读物整页面上的所有条形码是此有声音乐读物整页面上的诸个码字。本实施例的发音过程是由SNC710把解码得到的乐段谱数据一个一个字节顺序传给SNC82030,由SNC82030一边接收数据一边播放音乐(因没有足够的RAM空间,所以不能一次把一个乐段数千字节的谱数据全传给SNC82030);SNC82030是专用于根据乐段谱数据播放音乐的芯片(更具体的可称其为播放MIDI音乐的芯片),12个音乐声道,16个IO脚,96K*12BIT存储空间,芯片内可存储用户程序;本实施例可做成这样:在播放音乐时,由SNC710将解码得到的乐段谱数据通过3根通讯线(从P08,P09,P010接出)不断传给SNC82030,SNC82030得到数据后放入一个先进先出RAM暂存区,并不断在暂存区里取乐段谱数据的BIT用于播放多声道音乐;两个芯片的通讯协议自定,并且在两个芯片里应有相应的通讯程序,SNC82030的播放音乐的程序要做成可以取暂存区数据以播放音乐。As shown in Figure 3, the pickup circuit using the music chip external to the MCU, the MCU3 model is still SNC710, which is equipped with 24 IO pins, 32K*16BIT data and program memory; this chip can also be connected to an external memory; The main control, code reading and communication programs are placed inside the SNC710; the pronunciation device is the SNC82030 with a speaker, and 45 is the light pen reading terminal. The light pen is a mature and well-known technology, which does not belong to the content of the present invention. The implementation can refer to the known circuit, and utilize its photoelectric principle to read the black and white bar null values of the codeword. In this embodiment, barcode reading is taken as an example, and the read codewords may not only be barcodes, but also conductive codes. The audible music book that present embodiment relates is shown in the lower part of Fig. 7 as an example, and there are only some barcodes on its whole page, and the code value of each barcode in all barcodes on its whole page is the value of a music section splicing information , each barcode on the entire page of the audio music book can be read by the pickup for the pickup to decode playing music, and all the barcodes on the entire page of the audio music book are the codes on the entire page of the audio music book Character. The pronunciation process of the present embodiment is that SNC710 sends the music piece spectrum data that decoding obtains byte order one by one to SNC82030, plays music while receiving data by SNC82030 (because there is not enough RAM space, so can't once a piece number Kilobytes of spectrum data are all transmitted to SNC82030); SNC82030 is a chip dedicated to playing music according to the spectrum data of the music section (more specifically, it can be called a chip for playing MIDI music), 12 music channels, and 16 IO pins , 96K*12BIT storage space, user program can be stored in the chip; Present embodiment can be made like this: when playing music, by SNC710, the music section spectrum data obtained by decoding is passed through 3 communication wires (connecting from P08, P09, P010 output) to the SNC82030 continuously, and the SNC82030 puts the data into a FIFO RAM temporary storage area after getting the data, and continuously fetches the BIT of the music spectrum data in the temporary storage area to play multi-channel music; the communication protocol of the two chips is automatic And there should be a corresponding communication program in the two chips, the music playing program of SNC82030 should be made to be able to fetch data from the temporary storage area to play music.

这里的MCU3型号选择仅供参考,还可选用其它型号。The MCU3 model selection here is for reference only, and other models are also available.

本实施例的外接数据存储器6,是图中的SR008(8M BIT存储空间)。The external data memory 6 of the present embodiment is SR008 (8M BIT storage space) among the figure.

本实施例是把用于拼合信息的值解码的解码程序及解码数据存放在外接数据存储器6上(因SNC710内部存储器较少),外接数据存储器6与SNC710有通讯线连接;有声音乐读物(如图7下半部分所示)整页面上的所有条形码中各单个条形码的值是一个乐段拼合信息的值。拾音器的SNC710从P07口输入的波形得到读码值后(光笔45的硬件结构及读条形码原理可参考公知技术,光笔45的硬件结构也可如图2所示光笔的硬件结构,读条形码原理及过程也可参考前面的实施例1对用光笔读条形码的叙述),再从此条形码的值中取得乐段拼合信息的值。本来此拾音器能读此有声音乐读物的整页面上的诸个码字中的每个条形码且可按码值里的拼合信息的值解码播放乐段发音,可把这诸个码字中的所有条形码定义成此拾音器能读的部诸码字,使用时,用此拾音器读此有声音乐读物整页面上诸个码字中的这部诸码字中的某个码字(条形码)并按解码程序及解码数据对码字中的乐段拼合信息的值进行解码得到乐段谱数据,并将乐段谱数据发送给SNC82030播放此乐段的发音。(关于SNC710读取SR008的方式及协议,可参考它们的芯片资料)Present embodiment is to store the decoding program and the decoded data for the value decoding of assembling information on the external data memory 6 (less because of the SNC710 internal memory), and the external data memory 6 is connected with the SNC710 by a communication line; The value of each single barcode among all the barcodes on the whole page (shown in the lower part of Fig. 7) is the value of a piece of combined information. After the SNC710 of the pickup obtains the code reading value from the waveform of the P07 port input (the hardware structure of the light pen 45 and the principle of reading the bar code can refer to known technologies, the hardware structure of the light pen 45 can also be the hardware structure of the light pen as shown in Figure 2, the principle of reading the bar code and The process can also refer to the description of reading the barcode with a light pen in the previous embodiment 1), and then obtain the value of the music segment splicing information from the value of the barcode. Originally, this pickup can read each barcode in the codewords on the entire page of this audio music book and can decode and play the music section pronunciation according to the value of the combined information in the code value. The barcode is defined as all codewords that the pickup can read. When in use, read a certain codeword (barcode) in these codewords among the codewords on the entire page of the audio music book with this pickup and press the decoding button The program and the decoded data decode the value of the piece combination information in the code word to obtain the music spectrum data, and send the music spectrum data to SNC82030 to play the pronunciation of this music section. (For the method and protocol of SNC710 reading SR008, please refer to their chip information)

本实施例的所提及的乐段谱数据的格式,应是SNC82030能识别的并可由SNC82030用乐段谱数据来播音乐。The format of the music spectrum data mentioned in this embodiment should be recognizable by the SNC82030 and the SNC82030 can use the music spectrum data to play music.

本实施例的外接数据存储器6也是属于本实施例所说的拾音器的一部分。本实施例所说的拾音器,是包括图3中所有部分的。The external data memory 6 of the present embodiment also belongs to a part of the said pickup of the present embodiment. The said pickup of present embodiment comprises all parts in Fig. 3.

本实施例的光笔种类可自选(不一定使用实施例1用的光笔),也可由其它类型的光电式读码头代替,比如ccd读码头。The light pen type of the present embodiment can be optional (not necessarily using the light pen used in embodiment 1), and can also be replaced by other types of photoelectric reading terminals, such as ccd reading terminals.

本申请文件各处提到的乐谱拼合信息码是指从其码值可得到数个乐段拼合信息值的码字,比如其码值是或含有数个乐段拼合信息值的条形码码字。The musical score combination information code mentioned in this application document refers to a code word whose code value can obtain several music section combination information values, such as a barcode code word whose code value is or contains several music section combination information values.

以上2个拾音器实施例中采用的MCU都是以SNC710为例的,实际上这些拾音器实施例所采用的MCU也可用其它具有按音乐合成方式播放音乐的功能的别的芯片厂生产的MCU来代替。The MCUs adopted in the above two pickup embodiments all take SNC710 as an example. In fact, the MCUs used in these pickup embodiments can also be replaced by MCUs produced by other chip factories that have the function of playing music in a music synthesis mode. .

本申请文件所涉及的有读码头(机电式导电码读码头或光电式读码头等)的拾音器,一般情况用一个MCU完成读码,读键,解码及发音等任务即可;但如分开使用多个mcu也可,比如使用两个mcu,互相通讯,一个mcu(假设为主控mcu)完成读码及其它(如读键等)主要控制任务,另一个mcu(假设为从mcu)从通讯线接受主控mcu命令(比如主控mcu通过通讯线向从mcu传输其读码得到的乐段拼合信息的值)来完成播放音乐发音动作,也是可以的,从mcu具有播放音乐的能力比如:可根据主控mcu传给从mcu的拼合信息的值按解码程序及解码数据来解码播放音乐发音且解码时是按音乐合成方式播音乐的而且存储有解码程序及解码数据的存储器需能被从mcu读取以解码(此存储器可位于从mcu片内,或者此存储器可位于从mcu片外的存储器芯片并与从mcu连接使用)。此分开使用多个(两个或两个以上)mcu方案,因这多个mcu可分别看成是一个大的MCU的各个部分,可把这多个mcu一齐当成一个大的MCU,所以此分开使用多个mcu方案的本质与一个MCU方案同,是一个MCU方案的等同替换;比如可把本段前面说到的主控mcu和从mcu当成一个大的MCU,从mcu一般是外接喇叭来发音的,可把喇叭当成发音器件;权利要求2,5,6中提及的单片机芯片都可以是按这样由多个mcu组合而成的形式。甚至还有另一种方案:拾音器里有一个主控mcu和一个播放音乐发音的从mcu(这两个mcu的性能使用及搭配及解码程序及解码数据在存储器里如何布置存储等情况如本段前面述的两个mcu的情况),读码头里也有一个小mcu,读码头的小mcu完成读码字并将读码结果传给主控mcu,主控mcu通过通讯线向从mcu传输从码字的值里得到的乐段拼合信息的值,由从mcu解码播放音乐发音且解码时是按音乐合成方式播音乐的,同样可把此主控mcu和从mcu当成一个mcu,并可把此mcu和读码头里的小mcu一起又当成一个大的MCU(这时可把原读码头里除去小mcu外的部分当成读码头),即把主控mcu,从mcu和读码头里的小mcu一齐当成一个大的MCU,所以此方案的本质与一个MCU方案同,是一个MCU方案的等同替换,而且从mcu一般是外接喇叭来发音的,可把喇叭当成发音器件,权利要求2,5,6中提及的单片机芯片都可以是按这样由多个mcu组合而成的形式。(本段所提及的码字是与拼合信息的值有关的码字;因本段前面提到的多个mcu可被当成一个MCU,所以本段提及的存储器仍是可被作为主处理芯片的MCU读取使用的存储器)The pickups with reading docks (electromechanical conductive code reading docks or photoelectric reading docks, etc.) involved in this application document generally use one MCU to complete tasks such as code reading, key reading, decoding and pronunciation; but if used separately Multiple MCUs are also available, such as using two MCUs to communicate with each other. One MCU (assumed to be the master MCU) completes code reading and other (such as key reading, etc.) main control tasks, and the other MCU (assumed to be the slave mcu) communicates from the It is also possible to complete the pronunciation action of playing music by accepting the command of the main control mcu (for example, the main control mcu transmits the value of the piece combination information obtained by reading the code to the slave mcu through the communication line), and the slave mcu has the ability to play music, such as: According to the value of the mosaic information passed from the main control mcu to the slave mcu, it can decode and play the music pronunciation according to the decoding program and decoding data, and the music is played in the way of music synthesis during decoding, and the memory storing the decoding program and decoding data needs to be able to be read from Mcu reads to decode (this memory can be located in the slave mcu chip, or this memory can be located in the memory chip outside the slave mcu chip and connected with the slave mcu for use). This separates the use of multiple (two or more) MCU solutions, because these multiple MCUs can be regarded as various parts of a large MCU, and these multiple MCUs can be regarded as a large MCU, so this separate The essence of using multiple MCU solutions is the same as that of one MCU solution, and it is an equivalent replacement of an MCU solution; for example, the master MCU and slave MCU mentioned earlier in this paragraph can be regarded as a large MCU, and the slave MCU is usually an external speaker to pronounce Yes, the loudspeaker can be used as a sounding device; the single-chip microcomputer chips mentioned in claims 2, 5, and 6 can all be formed by combining multiple mcu in this way. There is even another solution: the pickup has a master control mcu and a slave mcu that plays music and pronunciation (the performance usage and collocation of these two mcu and the decoding program and how to arrange and store the decoded data in the memory are as shown in this paragraph In the case of the aforementioned two MCUs), there is also a small MCU in the reading terminal. The small MCU in the reading terminal completes the code reading and transmits the code reading result to the master control mcu. The master control mcu transmits the slave code to the slave mcu through the communication line The value of the piece combination information obtained in the word value is decoded by the slave mcu to play the music pronunciation, and the music is played in the way of music synthesis when decoding. The master control mcu and the slave mcu can also be regarded as one mcu, and this The mcu and the small mcu in the reading terminal are used together as a large MCU (at this time, the part of the original reading terminal except the small mcu can be regarded as the reading terminal), that is, the main control mcu, the slave mcu and the small mcu in the reading terminal They are all regarded as a large MCU, so the essence of this scheme is the same as that of an MCU scheme, and it is an equivalent replacement of an MCU scheme. Moreover, the MCU is generally used for sounding by an external speaker, and the speaker can be used as a sounding device. Claims 2, 5, The single-chip microcomputer chips mentioned in 6 can all be formed by combining multiple MCUs in this way. (The code word mentioned in this paragraph is the code word related to the value of the information being assembled; because the multiple mcu mentioned earlier in this paragraph can be regarded as an MCU, so the memory mentioned in this paragraph can still be used as the main processing The chip's MCU reads the memory used)

本申请文件中各处所涉及的码字中的乐段拼合信息的值(或其变换值)等主要是以连续二进制格式为例的,实际上,还可有别的格式,比如可以把乐段拼合信息的值(或其变换值等)的二进制BIT分拆开,分散放在一个码字(比如条形码)里的各部分(比如码字里还有一些别的信息的BIT),解码时按约定把各分散开的乐段拼合信息的值(或其变换值等)组合起来,甚至不用组合,直接按约定读各分散开的乐段拼合信息的值(或其变换值等)的二进制BIT来解码即可,此种格式是连续二进制格式的等同替换;使用别的格式是常规手法且也是使用连续二进制格式的等同替换。The value (or its transformation value) etc. of the music piece combination information involved in the code word everywhere in this application document is mainly based on the continuous binary format as an example. In fact, there are other formats, such as the music section The binary BIT of the value of the combined information (or its transformation value, etc.) is separated and scattered in each part of a codeword (such as a barcode) (for example, there are some BITs of other information in the codeword), and when decoding, press It is agreed to combine the value (or its transformation value, etc.) of each scattered music segment combination information, even without combining, directly read the binary BIT of the value (or its conversion value, etc.) of each scattered music segment combination information according to the agreement To decode, this format is an equivalent replacement of the continuous binary format; using other formats is a conventional method and is also an equivalent replacement of the continuous binary format.

本申请文件各处所称的一个码字的值,是或含有数个乐段拼合信息的值,等情况,如果编码者将码字里的这些值(数个乐段拼合信息的值等,可把这些值称为原来的值),再进行变换,比如加密后得到新的值(此值一般仍是二进制值,可放入导电码或条形码等码字里;可把新的值称为原来的值的变换值),可把新的值放入码字里,在MCU读码处理时,新的值经相应转换(逆变换)后仍可得到原来的值(这里定义的码变换,逆变换是这样的:要使用新的值的所有BIT才能作逆变换,不能对新的值的部分BIT来作逆变换以得到部分原来的值)。码字的值是新的值后,一个码字的值表示的信息仍是与拼合信息的值有关的信息的这种说法的一种具体表现形式(每个这样的码字仍属于有声音乐读物整页面上的诸个码字的范围),仍可把一些这样的码字定义成有声音乐读物整页面上诸个码字中的某拾音器能读的部诸码字,此部诸码字中的各码字是能被此拾音器读取以用于此拾音器播放音乐发音的,此拾音器可读此部诸码字中的码字并根据读得的码字的码值里的新的值来作转换并用于播放音乐发音,新的值可看成是原来的值的等同替换。(本申请文件各处所说的数个乐段拼合信息的值的变换值中的‘变换’也是指本段所说的变换)The value of a code word referred to in this application document everywhere is or contains the value of several music section combination information, etc., if the coder puts these values (the value of several music section combination information etc.) in the code word, can These values are referred to as the original value), and then transformed, such as obtaining a new value after encryption (this value is generally still a binary value, which can be placed in code words such as conductive codes or barcodes; the new value can be called the original value). The transformation value of the value), the new value can be put into the code word, and when the MCU reads the code, the new value can still get the original value after the corresponding transformation (inverse transformation) (the code transformation defined here, the inverse The transformation is like this: all BITs of the new value must be used for inverse transformation, and part of the BIT of the new value cannot be inverse transformed to obtain part of the original value). After the value of the code word is a new value, the information represented by the value of a code word is still the information related to the value of the combined information (each such code word still belongs to the sound music book codewords on the entire page), some such codewords can still be defined as some codewords that can be read by a certain pickup in the codewords on the entire page of audio music books, and all codewords in this part Each code word of each code word can be read by this pickup to be used for this pickup to play music pronunciation, and this pickup can read the code word in this part all code words and according to the new value in the code value of the code word read Converted and used to play music pronunciation, the new value can be regarded as an equivalent replacement of the original value. ('transformation' in the conversion value of the value of the several music section splicing information mentioned in this application document everywhere also refers to the conversion mentioned in this paragraph)

本说明书以上所述码字的值是或含有一个乐段拼合信息的值(或其变换值等)主要是针对一个码字来说的;一般情况,一个码字的值是或含有一个乐段拼合信息的值(或一个乐段拼合信息的值的变换值),拾音器读取一个码字就可处理播音乐,如果表示一个乐段拼合信息的值涉及的BIT(一个乐段拼合信息的值涉及的BIT是指这个乐段拼合信息的值本身或其变换值)长度大于单个码字的BIT码长,那就需要把表示一个乐段拼合信息的值涉及的BIT放在多个码字里,这多个码字所处有声音乐读物的页面位置一般也是需播放音乐的位置,这多个码字在有声音乐读物的页面上一般是印刷排得比较近的并且这多个码字一般是处于同一页面上的,这多个码字也可能分别处于不同页面上,拾音器需按约定(可以在这多个码字中的各码字里开一个或多个BIT标志表示需读多个码字)读取这多个码字(拾音器一般是依次读取这多个码字),然后把读得的这多个码字的值组合起来,得到乐段拼合信息的值,再解码播放音乐(这多个码字一般是同一码种的)。这种情况单个码字的值表示的信息仍然是与拼合信息的值有关的信息,这种情况的各码字仍是属于有声音乐读物整页面上的诸个码字的范围,仍可由拾音器根据读得的码字的码值来播放音乐发音(只不过是拾音器需读多个码字后才播放音乐发音),这种情况的任一码字仍可看成属于有声音乐读物整页面上的诸个码字中的部诸码字中的码字,此部诸码字中的各码字是能被某拾音器读取并用于播音乐的,此拾音器每次读此部诸码字中的一个或多个码字来播音乐。甚至可把一个乐段拼合信息的值(或一个乐段拼合信息的值的变换值)放在多个不同码种的码字里,这多个码字又能被同一拾音器读取以用于播音乐,这多个码字可以属于此拾音器能读的部诸码字,比如这多个码字中既有导电码又有条形码,可由有光电式读码头的拾音器读取这两种码字以播音乐。本说明书中各处述及的有关各种有声音乐读物形式,有声音乐读物与拾音器的配合使用形式等内容中各处涉及读一个码字就可播音乐的地方,都可用每次读多个码字来播音乐的情况来做替换,比如原来有声音乐读物的一页面上有一个码字,码字的值是一个乐段拼合信息的值,拾音器读这个码字就可播音乐,现把这一个码字换成印刷两个码字,把那个乐段拼合信息的值的BIT分开放在这两个码字里,原来的拾音器需读这两个码字后才可得到此乐段拼合信息的值的BIT以播音乐。The value of the above-mentioned code word in this specification is or contains a music segment. The value (or its transformation value, etc.) is mainly for a code word; The value of combining information (or the conversion value of the value of a piece combining information), the pickup reads a codeword and just can process the playing music, if it represents the BIT (the value of a piece combining information) involved in the value of a piece combining information The BIT involved refers to the value itself or its transformed value of the music piece combination information) length is greater than the BIT code length of a single codeword, then it is necessary to put the BIT involved in the value representing a piece of music piece together information in multiple codewords , the page positions of the audio music books where these multiple code words are generally also the position where music needs to be played, these multiple code words are generally printed and arranged relatively close to each other on the page of the audio music books, and these code words are generally Being on the same page, these multiple codewords may also be on different pages respectively, and the pickup needs to be agreed (one or more BIT signs can be opened in each codeword in these multiple codewords to indicate that multiple codewords need to be read) word) to read these multiple codewords (the pickup generally reads these multiple codewords in sequence), and then combine the values of the multiple codewords read to obtain the value of the piece information, and then decode and play the music (these multiple codewords are generally of the same code type). The information represented by the value of the single codeword in this situation is still the information relevant to the value of the combined information, and each codeword in this situation still belongs to the scope of all codewords on the whole page of the audiobook, and can still be used by the pickup according to the The code value of the read code word is used to play the music pronunciation (just that the pickup needs to read a plurality of code words before playing the music pronunciation), any code word in this situation can still be regarded as belonging to the audio music book on the entire page The code words in the code words of the department in the code words, each code word in the all code words of this part can be read and used for broadcasting music by a certain pickup, and this pickup reads each code word in the all code words of this part One or more codewords to play music. Even the value (or the transformation value of the value of a music section combination information) of a music section can be placed in the code words of a plurality of different code types, and these multiple code words can be read by the same pickup again for use in Play music, these multiple code words can belong to all code words that this pick-up can read, such as there are both conductive codes and barcodes in these multiple code words, can read these two kinds of code words by the pick-up that has photoelectric type reading terminal to play music. In this instruction manual, various forms of audio music books, the form of cooperating use of audio music books and pickups, etc. are mentioned everywhere in the place where music can be played by reading a code word, which can be read by reading multiple codes at a time. It is used to replace the situation in which words are used to play music. For example, there is a code word on a page of the original audio music book, and the value of the code word is the value of a piece of combined information. The pickup can play music by reading this code word. One codeword is replaced by two codewords, and the BIT of the value of the music segment information is separated into the two code words. The original pickup needs to read the two code words to obtain the music segment information. The BIT of the value to play music.

码字的值甚至还可以是更复杂的形式,比如把数个乐段拼合信息的值的变换值分散放在多个码字里,这些码字中每个码字的值是或含有此数个乐段拼合信息的值的变换值的部分,这些码字中每个码字的值表示的信息仍是与拼合信息的值有关的信息,可把这些码字印刷在一有声音乐读物的整页面上,这些码字中每个码字仍是此有声音乐读物整页面上的诸个码字中的码字,拾音器可读这些码字并从码字中得到数个乐段拼合信息的值以用于播放音乐。The value of the code word can even be in a more complex form, such as the transformation value of the value of several pieces of piece together information scattered in multiple code words, the value of each code word in these code words is or contains this number The part of the conversion value of the value of the combination information of a music section, the information represented by the value of each code word in these code words is still the information relevant to the value of the combination information, and these code words can be printed on the whole of an audio music book. On the page, each code word in these code words is still a code word among the code words on the entire page of this audio music book, and the pickup can read these code words and obtain the value of several music segment splicing information from the code words for playing music.

码字的值,除了可以是本说明书其它段所述的一个码字的值是或含有一个乐段拼合信息的值,或者一个码字的值是或含有一个乐段拼合信息的值的变换值等外,另外实际上还可以是一个码字的值是或含有多个乐段拼合信息的值,还可以一个码字的值是或含有一个乐段拼合信息的值的部分,还可以一个码字的值是或含有多个乐段拼合信息的值的变换值,还可以一个码字的值是或含有多个乐段拼合信息的值的变换值的部分;多个乐段拼合信息的值涉及的BIT在码值里如何放置排列由编码者自己定义,拾音器读码并取得各乐段拼合信息的值后一般可顺序播放各乐段发音;例:在一个页面上印刷有3个乐段的乐谱,每个乐段的乐谱旁印一个其码值是这个乐段对应的乐段的拼合信息的值的码字,拾音器可一次读一个码字并播放相应的乐段发音,如在页面上部另再印一个码字,码字的值是3个乐段拼合信息的值(这3个乐段拼合信息的值各与本段前面述的3个乐段一一对应),如拾音器读这个码字,就可一次把3个乐段的发音都播放出来。更复杂的,码字的值还可能是数个乐段拼合信息的值的变换值加数个乐段拼合信息的值,等,这样的码字的值表示的信息仍是与拼合信息的值有关的信息,拾音器可读这些码字并从码字中得到数个乐段拼合信息的值以用于播放音乐。The value of a codeword, except that the value of a codeword is or contains the value of a piece of music piece information described in other paragraphs of this specification, or the value of a codeword is or contains a transformation value of the value of a piece of piece piece together information In addition, in fact, the value of a codeword can also be or contain the value of a plurality of music segment combination information, the value of a code word can also be or contain a part of the value of a music segment combination information, or a code word The value of the word is or contains the conversion value of the value of a plurality of musical sections combining information, and the value of a code word can also be or contain the part of the conversion value of the value of a plurality of musical sections combining information; The value of a plurality of musical sections combining information How to place and arrange the BIT involved in the code value is defined by the coder himself. After the pickup reads the code and obtains the value of the combination information of each section, the pronunciation of each section can be played in sequence; for example: there are 3 sections printed on one page The music score of each music section is printed with a code word whose code value is the value of the combined information of the music section corresponding to this music section. The pickup can read a code word at a time and play the pronunciation of the corresponding music section. The upper part prints a code word again, and the value of code word is the value of 3 music paragraphs' combined information (the value of these 3 music paragraphs' combined information is respectively corresponding to the aforementioned 3 music paragraphs of this paragraph), as the pickup reads This code word, just can once all play out the pronunciation of 3 music sections. To be more complicated, the value of the codeword may also be the transformation value of the value of several pieces of pieced together information plus the value of several pieces of pieced together information, etc., the information represented by the value of such a codeword is still the value of the pieced together information For relevant information, the pickup can read these codewords and obtain the values of several pieces of piecewise information from the codewords for playing music.

以上4段所说的码字的那些形式(一个码字的值是或含有一个乐段拼合信息的值,一个码字的值是或含有数个乐段拼合信息的值的变换值,一个码字的值是或含有一个乐段拼合信息的值的部分,一个码字的值是或含有数个乐段拼合信息的值的变换值的部分,一个码字的值是或含有多个乐段拼合信息的值,码字的值是数个乐段拼合信息的值的变换值加数个乐段拼合信息的值,等),其码字的码值表示的信息都是与拼合信息的值有关的信息;拾音器可读数个这些形式的码字以得到数个乐段拼合信息的值以播放音乐。Those forms of said code word of above 4 paragraphs (the value of a code word is or contains the value of a music section splicing information, and the value of a code word is or contains the transformation value of the value of several music section splicing information, and a code word The value of a word is or contains a part of the value of a piece of musical information, the value of a code word is or contains a part of the transformed value of the value of several music pieces, and the value of a code word is or contains a plurality of music sections The value of the combined information, the value of the code word is the transformation value of the value of the combined information of several music sections plus the value of the combined information of several music sections, etc.), the information represented by the code value of the code word is the value of the combined information Relevant information; the pickup can read several codewords of these forms to obtain the value of several pieces of information to play music.

码字的值,除了可以是本说明书其它段所述的一个码字的值是或含有一个乐段拼合信息的值,或者一个码字的值是或含有一个乐段拼合信息的值的变换值等外,另外实际上还可以是一个码字的值是或含有多个乐段拼合信息的值,还可以一个码字的值是或含有一个乐段拼合信息的值的部分,还可以一个码字的值是或含有多个乐段拼合信息的值的变换值,还可以一个码字的值是或含有多个乐段拼合信息的值的变换值的部分,还可以一个码字的值是或含有数个乐段拼合信息的值的变换值加数个乐段拼合信息的值,等;以上这些码字的形式,其码字的码值表示的信息都是与拼合信息的值有关的信息;拾音器可读数个这些形式的码字以得到数个乐段拼合信息的值以播放音乐;多个乐段拼合信息的值涉及的BIT在码值里如何放置排列由编码者自己定义,拾音器读码并从码字中取得各乐段拼合信息的值后一般可顺序播放各乐段发音;例:在一个页面上印刷有3个乐段的乐谱,每个乐段的乐谱旁印一个其码值是这个乐段的拼合信息的值的码字,拾音器可一次读一个码字并解码播放相应的乐段发音,如在页面上部另再印一个码字,码字的值中有这3个乐段各自的拼合信息的值,如拾音器读这个码字,就可一次把3个乐段的发音都解码播放出来。The value of a codeword, except that the value of a codeword is or contains the value of a piece of music piece information described in other paragraphs of this specification, or the value of a codeword is or contains a transformation value of the value of a piece of piece piece together information In addition, in fact, the value of a codeword can also be or contain the value of a plurality of music segment combination information, the value of a code word can also be or contain a part of the value of a music segment combination information, or a code word The value of the word is or contains the transformation value of the value of a plurality of musical sections combining information, and the value of a codeword can also be or contain the part of the transformation value of the value of a plurality of musical sections combining information, and the value of a codeword can also be Or the transformation value of the value containing several musical sections combining information plus the value of several musical sections combining information, etc.; the above forms of these code words, the information represented by the code value of the code word is all related to the value of combining information Information; the pick-up can read several code words of these forms to obtain the value of several pieces of combined information to play music; how to place and arrange the BITs involved in the value of multiple pieces of combined information in the code value is defined by the coder himself, and the pick-up After reading the code and obtaining the value of the combination information of each music section from the code word, the pronunciation of each music section can generally be played sequentially; for example: a sheet of music with 3 music sections is printed on a page, and a score next to each section is printed. The code value is the code word of the combined information value of this music section. The pickup can read one code word at a time and decode and play the corresponding music section pronunciation. If another code word is printed on the upper part of the page, the value of the code word has these 3 The value of the combined information of each music section, if the pickup reads this code word, the pronunciation of the 3 music sections can be decoded and played at one time.

本申请文件所说的码字的值可转换得到数个拼合信息的值,等价于说码字的值是或含有数个乐段拼合信息的值的变换值。The value of the code word mentioned in this application document can be converted to obtain the value of several combination information, which is equivalent to saying that the value of the code word is or contains the transformation value of the value of the combination information of several music segments.

本申请文件的说明书和权利要求书中各处所说的拾音器读部诸码字中的码字后播放音乐发音,一般有两种情况:1是拾音器读一个码字后,拾音器根据从这一个码字的值里得到(直接得到或转换得到)的数个乐段拼合信息的值来播放音乐发音(一般是这数个乐段的发音),2是指拾音器读多个码字后,拾音器根据把这多个码字的值组合起来得到(直接得到或转换得到)的数个乐段拼合信息的值来播放音乐发音(一般是这数个乐段的发音),可能是一个乐段拼合信息的值是分散放在这多个码字里的,也可能是这数个乐段拼合信息的值中全部或部分乐段拼合信息的值是分散放在这多个码字里的,也可能是这数个乐段拼合信息的值的变换值是分散放在这多个码字里的,而且一般在这多个码字中每个码字里有BIT标志表示需读这多个码字来得到数个拼合信息的值以解码播音乐。显然,如以上两种情况如读一个码字得到(或读多个码字得到)多个乐段拼合信息的值后,那拾音器播的音乐一般应是顺序播放这多个乐段的发音(除非有其它约定,比如码字中有BIT标志表示只播某些乐段)。本段以上各处提到的‘多个码字’中的各码字以及本段以上各处提到的‘一个码字’都是属于本段第1句提到的‘部诸码字’中的码字,本段以上除本段第1句外各处提到的‘拾音器’都是指本段第1句提到的‘拾音器’;权利要求6中提到的‘并由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播放音乐发音’,权利要求2中提到的‘前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播放音乐发音’都可以是本段前面所述情况,即这里说的‘根据读得的前述部诸码字中的码字的码值’是指根据读得的前述部诸码字中的一个码字的码值或多个码字的码值。本申请文件各处提到的‘拾音器读码字以播放音乐’等类似表达也可参照本段前面所述。Play the music pronunciation after the code words in the code words in the code words of the pickup reading part said everywhere in the description of the application document and the claims, generally have two kinds of situations: 1 is after the pickup reads a code word, the pickup is based on this In the value of the word, obtain (directly obtain or convert) the value of several musical sections to combine the information to play the music pronunciation (generally the pronunciation of these several musical sections). Combining the values of these multiple codewords to obtain (directly obtained or converted) the value of several pieces of piecewise information to play the music pronunciation (usually the pronunciation of these pieces), which may be a piece of piecewise information The value of the value is scattered in these multiple codewords, and it may also be that all or part of the value of the music piece combination information is scattered in these multiple codewords, or it may be It means that the transformation values of the values of these several pieces of music segment information are scattered in these multiple codewords, and generally there is a BIT mark in each codeword of these multiple codewords to indicate that these multiple codewords need to be read To get the value of several combined information to decode the playing music. Obviously, after reading a codeword and obtaining (or reading a plurality of codewords to obtain) the value of a plurality of musical sections' combined information as in the above two cases, the music played by the pickup generally should be the pronunciation of these multiple musical sections in order ( Unless there are other agreements, for example, there is a BIT mark in the codeword to indicate that only certain sections are played). Each of the codewords in the "multiple codewords" mentioned above in this paragraph and the "one codeword" mentioned in this paragraph above all belong to the "partial codewords" mentioned in the first sentence of this paragraph In the code word, the 'pickup' mentioned everywhere in this paragraph except the first sentence of this paragraph all refers to the 'pickup' mentioned in the first sentence of this paragraph; 'mentioned in claim 6 and by the aforementioned pickup The aforementioned single-chip microcomputer chip makes the aforementioned pronunciation device of the aforementioned pickup play the music pronunciation according to the code value of the code words in the aforementioned codewords read, the aforementioned single-chip microcomputer chip of the aforementioned pickup mentioned in claim 2 according to the read The code value of the code word in all the code words of the aforementioned part makes the aforementioned pronunciation device of the aforementioned sound pick-up play the music pronunciation' all can be the situation described in front of this paragraph, promptly said here' according to the code in all the code words of the aforementioned part read. The code value ' of word refers to the code value of a code word or the code value of a plurality of code words in all code words according to the aforementioned parts that read. Similar expressions such as 'the pickup reads code words to play music' mentioned throughout this application document can also refer to the foregoing description of this paragraph.

上段第2行至第14行所述也是对权利要求书中各处提到的‘所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的任一码字后,都不需再读所述部诸码字中的其它码字即可播音乐,或者,所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的任一码字后,都需再读所述部诸码字中的其它数个码字才可播音乐,或者,所述拾音器在被使用时,所述拾音器的所述单片机芯片读得所述部诸码字中的部分码字中的任一码字后都不需再读所述部诸码字中的其它码字即可播音乐,所述拾音器的所述单片机芯片读得所述部诸码字中另外部分码字中的任一码字后都需再读所述部诸码字中的其它数个码字才可播音乐;’中涉及到的单片机芯片读一个码字即可播音乐和单片机芯片需读多个码字才可播音乐等情况的解释。(本段说的‘上段第2行至第14行’是指从上段第2行‘一般’至上段第14行‘等同替换’止)What is said in the 2nd line to the 14th line of the previous paragraph is also to mention everywhere in the claims 'when the pickup is used, the single-chip microcomputer chip of the pickup reads any one of the codewords of the said section. After the code word, it is not necessary to read other code words in the code words to play music, or, when the pickup is used, the single-chip microcomputer chip of the pickup reads the codes of the parts. After any code word in the word, it is necessary to read other several code words in the code words to play music again, or, when the pickup is used, the single-chip microcomputer chip of the pickup reads Obtain any code word in the part code word in all code words of described part all need not read other code word in all code words of described part again and can broadcast music, and the described single-chip microcomputer chip of described sound pick-up reads After any code word in the other part of the code words in the all code words of the said part, all need to read the other several code words in the code words of the said part to play music; An explanation of the situation that music can be played just by one word and that a single-chip microcomputer chip needs to read multiple code words before it can play music. ("The 2nd line to the 14th line of the above paragraph" mentioned in this paragraph refers to the "general" from the 2nd line of the previous paragraph to the "equivalent replacement" in the 14th line of the previous paragraph)

权利要求2中提到的‘前述部诸码字中的每个码字能被前述拾音器读取以用于前述拾音器播放音乐发音’(本说明书中与此类似的提法也按本段所述理解),对于部诸码字中的任一码字,一般有两种情况,1是指拾音器只需读这一个码字就可播放音乐,拾音器读这一个码字后,拾音器根据从这一个码字的值里得到(直接得到或转换得到)的数个乐段拼合信息的值来播放音乐发音(一般是这数个乐段的发音),2是需用拾音器读部诸码字中包括这一个码字在内的多个码字才能播放音乐,拾音器读这多个码字后,拾音器根据把这多个码字的值组合起来得到(直接得到或转换得到)的数个乐段拼合信息的值来播放音乐发音(一般是这数个乐段的发音),可能是一个乐段拼合信息的值是分散放在这多个码字里的,也可能是这数个乐段拼合信息的值中全部或部分乐段拼合信息的值是分散放在这多个码字里的,而且一般在这多个码字中每个码字里有BIT标志表示需读这多个码字来得到数个拼合信息的值以解码播音乐。显然,依以上两种情况如读一个码字得到(或读多个码字得到)多个乐段拼合信息的值后,那拾音器播放的音乐一般应是顺序播放这多个乐段的发音(除非有其它约定,比如码字中有BIT标志表示只播某些乐段)。本段以上各处提到的‘多个码字’中的各码字以及本段以上各处提到的‘一个码字’都是属于本段第1句提到的‘部诸码字’中的码字,本段以上除本段第1句外各处提到的‘拾音器’都是指本段第1句提到的‘拾音器’。Mentioned in claim 2 'each code word in the aforementioned code words can be read by the aforementioned pickup for the aforementioned pickup to play music pronunciation' (similar formulations in this description are also described in this paragraph Understand), for any code word in all code words of part, generally have two kinds of situations, 1 refers to that pickup only needs to read this code word and just can play music, after pickup reads this code word, pickup is according to from this one In the value of the code word, obtain (directly obtain or convert) the value of several music sections to combine information to play the music pronunciation (generally the pronunciation of these several music sections), and 2 need to include in all code words of the pickup reading section Only multiple codewords including this codeword can play music. After the pickup reads the multiple codewords, the pickup will combine the values of the multiple codewords to obtain (directly obtained or converted) several paragraphs. The value of the information is used to play the music pronunciation (usually the pronunciation of these several paragraphs), it may be that the value of a paragraph combination information is scattered in these multiple codewords, or it may be the combination information of these several paragraphs The value of all or part of the music piece combination information is scattered in these multiple codewords, and generally there is a BIT mark in each codeword of these multiple codewords to indicate that it is necessary to read these multiple codewords to Get the value of several spliced information to decode playing music. Obviously, after reading a code word and obtaining (or reading a plurality of code words to obtain) the value of a plurality of musical sections combining information according to the above two situations, the music played by the pickup should generally be the pronunciation of these multiple musical paragraphs in order ( Unless there are other agreements, for example, there is a BIT mark in the codeword to indicate that only certain sections are played). Each of the codewords in the "multiple codewords" mentioned above in this paragraph and the "one codeword" mentioned in this paragraph above all belong to the "partial codewords" mentioned in the first sentence of this paragraph In the code word, the 'pickup device' mentioned everywhere in this paragraph except the first sentence of this paragraph all refers to the 'pickup device' mentioned in the first sentence of this paragraph.

关于拾音器要读多个码字来播音乐的情况,一是可能一个乐段拼合信息的值的所有BIT分散放在多个码字里,这多个码字中每个码字的值里没其它乐段拼合信息的值,所以拾音器必须要读多个码字才能播音乐(而且一般在这多个码字中每个码字里有BIT标志表示需读这多个码字来得到拼合信息的值以解码播音乐);还可能是拾音器本来读一个码字就可播音乐,但有约定(比如在这个码字里有BIT标志指示)要多读几个码字再一起播音乐,这样拾音器就读几个码字再一起根据从这几个码字里得到的数个乐段拼合信息的值来播各自乐段的发音;关于拾音器要读多个码字来播音乐的情况,还可能有本段所述以外的其它情况。Regarding the situation that the pickup needs to read multiple codewords to play music, firstly, all the BITs of the value of a piece of combined information may be scattered in multiple codewords, and there is no The value of other segment information, so the pickup must read multiple codewords to play music (and generally there is a BIT mark in each codeword among these multiple codewords, indicating that it needs to read these multiple codewords to obtain the combination information It may also be that the pickup can play music by reading a codeword originally, but there is an agreement (for example, there is a BIT sign indication in this codeword) to read a few more codewords and then play music together, so that The pickup reads several codewords and then broadcasts the pronunciation of the respective sections according to the value of the combination information of several sections obtained from these codewords; There are circumstances other than those described in this paragraph.

本申请文件所说的拾音器读一个或多个码字后播音乐,一般有两种情况,1是从这一个或多个码字中得到多少乐段拼合信息的值,拾音器就播多少乐段音乐,2是拾音器可按约定(比如在这一个或多个码字里有BIT标志指示或读者通过拾音器的按键作了设定等)只把从这一个或多个码字中得到的所有的乐段拼合信息的值中的部分乐段拼合信息的值对应的乐段发音播放出来(比如一个码字的值是3个乐段拼合信息的值,拾音器读此码字后只按此码字的码值中的3个乐段拼合信息的值中的两个乐段拼合信息的值来分别解码播放这两个乐段各自的发音),这两种情况可看成互相是等同替换。The said pick-up of this application document reads one or more code words and plays music after reading, generally have two kinds of situations, 1 is the value that how many music sections are stitched together information obtained from this one or more code words, how many music sections are just played by the pick-up Music, 2 is that the pickup can only use all the information obtained from the one or more codewords by agreement (for example, there is a BIT sign indication in this one or more codewords or the reader has made a setting by the button of the pickup, etc.) Part of the value of the segment combination information in the value of the segment combination information corresponds to the pronunciation of the segment (for example, the value of a code word is the value of 3 segment combination information, and the pickup only presses this code word after reading the code word In the value of the code value of the 3 music sections, the values of the information of the two music sections are combined to decode and play the respective pronunciations of the two music sections), these two situations can be regarded as equivalent replacements.

拾音器读多个码字后播音乐,与拾音器读一个码字后播音乐实际上是等同替换,因为它们都是实现读码播放音乐的目的,分次读多个码字相当于就是有一个面积很大的读码头的拾音器一次读一个码字(这个码字的面积也很大,是那多个码字合起来的面积)。Playing music after the pickup reads multiple codewords is actually equivalent to playing music after the pickup reads one codeword, because they both achieve the purpose of reading codes and playing music. Reading multiple codewords in batches is equivalent to having an area The pickup of the very large reading dock reads one codeword at a time (the area of this codeword is also very large, which is the combined area of those multiple codewords).

关于在多个码字中每个码字里有BIT标志表示需读这多个码字来播放音乐的情况,一般这多个码字在有声音乐读物的页面上是印刷得很靠近的,而且可能在页面上有印刷图文来提示需读这多个码字一起播音乐,拾音器读其中一个码字后,拾音器根据码字里需读多个码字的BIT标志来暂不播音乐而是等继续读码,等读完这多个码字后一起播音乐,而且每次读一个码字后继续等待读码时,拾音器可播出提示语音提醒读者继续读码。本段所述是常识。With regard to the situation that there is a BIT sign in each code word in multiple code words and needs to read these multiple code words to play music, generally these multiple code words are printed very close to each other on the page of audio music reading material, and There may be printed pictures and texts on the page to remind you to read these multiple codewords and play music together. After the pickup reads one of the codewords, the pickup will temporarily stop playing music according to the BIT flags that need to read multiple codewords in the codeword. Continue to read codes, play music together after reading these multiple code words, and when you continue to wait for code reading after reading a code word each time, the pickup can play a prompt voice to remind readers to continue reading codes. This paragraph is common sense.

关于本申请文件各处所阐述的具有机电式或光电式读码头的拾音器与有声音乐读物的形状,这仅是机械结构等方面的问题,对形状没必要硬性规定,使用本申请文件阐述的有声音乐读物及拾音器的原理,可以把有声音乐读物与拾音器做成多种形式,比如:一种方式是把拾音器做成大机体连读码头形式,把MCU和发音器件和按键模块做在机体内,读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器,大机体的外型可以是适于让有声音乐读物放入机体里的,使用时,将有声音乐读物放入机体里使用,移动读码头去读有声音乐读物的整页面上的码字,由机体内的MCU取得读码值并播音乐;另外,大机体的外型也可以是适于让有声音乐读物放在机体上的,使用时,将有声音乐读物(如图1所示为有声音乐读物示例)放在机体上使用,移动读码头去读有声音乐读物的整页面上的码字,由机体内的MCU取得读码值并播音乐。另一种方式不做大机体,而把MCU和读码头和发音器件和按键模块等集成在一个小壳体里成为适于手持的拾音器(当然读码头一般需露出在小壳体外以便读码),而有声音乐读物是与小壳体独立的,有声音乐读物可仍是以读物为主体(如图1所示或图6中的下半部分所示为有声音乐读物示例),在读物的整页面上有诸个码字,读物的整页面上各码字的位置是随意的,使用时有声音乐读物可放在桌面不动,手持小壳体,移动小壳体用小壳体上的读码头去接触或接近(如为机电式读码头时可以用接触的方式,如为光电式读码头时可用接触或接近的方式)在有声音乐读物的整页面上的码字以读码播音乐,当有声音乐读物的一个页面上有多个码字时,可实现想读哪个码字就读哪个码字;另一种方式还可把读码头做成移动的,将MCU和发音器件集成在一个小壳体里,此小壳体可放在桌面上,读码头与小壳体用导线连接,把小壳体连读码头当成拾音器;使用时,移动读码头去读有声音乐读物的整页面上的码字,由小壳体内的MCU取得读码值并播音乐;除以上几种拾音器的外型形式外,还可以有其它的拾音器的外型形式。Regarding the shapes of the pickups with electromechanical or photoelectric reading docks and audio music books described in various places in this application document, this is only a matter of mechanical structure and other aspects, and there is no need for rigid regulations on the shape. Use the audio music books described in this application document. The principle of reading materials and pickups can be made into various forms of audio music books and pickups. For example: one way is to make the pickup into a large-body continuous reading dock, and make the MCU, pronunciation devices and button modules in the body. The dock can be made movable, and is connected with the body by wires, and the large body continuous reading dock is used as a pickup, and the appearance of the large body can be suitable for allowing audio music books to be put into the body. Put it into the body for use, move the reading dock to read the code words on the entire page of audio music books, and the MCU in the body gets the code reading value and plays music; in addition, the appearance of the large body can also be suitable for audio music If the reading material is placed on the body, when in use, the audio music reading material (as shown in Figure 1 is an example of the audio music reading material) is placed on the body and used, and the mobile reading dock is used to read the codewords on the entire page of the audio music reading material, which is controlled by the body The MCU inside gets the code reading value and plays music. Another way is not to make a large body, but to integrate the MCU, the reading terminal, the pronunciation device and the button module into a small housing to become a pickup suitable for hand-held (of course, the reading terminal generally needs to be exposed outside the small housing for code reading) , and the audio music book is independent of the small shell, the audio music book can still be based on the reading material (as shown in Figure 1 or shown in the lower part of Figure 6 as an example of the audio music book), in the entire reading material All code words are arranged on the page, and the position of each code word on the whole page of reading material is random, and sound music reading material can be placed on the tabletop during use, and hand-held small casing, moves small casing with the reading on the small casing. The dock goes to touch or approach (such as the electromechanical reading dock can be used in a contact manner, such as the photoelectric reading dock can be contacted or approached) the code words on the entire page of the audio music reading material to read the code and play music, When there are multiple codewords on one page of audio music books, you can read which codeword you want to read; another way can also make the reading terminal mobile, integrating MCU and pronunciation device in a small Inside the housing, this small housing can be placed on the desktop, and the reading terminal is connected with the small housing by a wire, and the small housing is used as a pickup; when in use, move the reading terminal to read the entire page of the audio music book For the code word, the code reading value is obtained by the MCU in the small housing and the music is played; in addition to the above-mentioned external forms of the pickups, other external forms of the pickups are also possible.

如图4,是一个大机体连读码头形式的拾音器外型实例和有声音乐读物的实例,拾音器外型是大机体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个机体里,读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器),机体的外型是适于让有声音乐读物放入机体里的,1是导电码读码头,4是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),图中有声音乐读物的主体是书,书的页面上有导电码10,使用时,将有声音乐读物放入机体里,手持移动读码头1,用读码头1去接触有声音乐读物的整页面上的导电码10以读码,机体内的MCU取得读码值并根据码值播放音乐发音。(本段说的读码头显然可以换成光电式读码头比如光笔或CCD读码头以读取有声音乐读物的整页面上的导电码10)。As shown in Figure 4, it is an example of the appearance of a pickup in the form of a large-body continuous reading dock and an example of audio music books. Here, the reading dock can be made mobile, and connected with the body through wires, and the large body connecting reading dock is used as a pickup), the shape of the body is suitable for putting audio music books into the body, 1 is the conductive code reading Wharf, 4 is a button module (i.e. two buttons, this button module is made up of two buttons, and the button is not necessary and can be omitted). When in use, put the audio music book into the body, hold the mobile reading dock 1, use the reading dock 1 to touch the conductive code 10 on the entire page of the audio music book to read the code, the MCU in the body obtains the code reading value and according to the code The value plays the music pronunciation. (the reading dock that this paragraph says can be changed into photoelectric type reading dock such as light pen or CCD reading dock to read the conduction code 10 on the whole page of audiobook reading material obviously).

如图5,是一个大机体连读码头形式的拾音器外型实例和有声音乐读物的实例,拾音器外型是大机体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个机体里,读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器),机体的外型是适于让有声音乐读物放在机体上的,1是机电式甲读码头,4是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),图中有声音乐读物的主体是乐谱书,乐谱书的页面上有甲导电码10,使用时,将有声音乐读物放在机体上,手持移动甲读码头1,用甲读码头1去接触有声音乐读物的整页面上的甲导电码10以读码,机体内的MCU取得读码值并根据码值播放音乐发音。(本段说的读码头显然可以换成光电式读码头比如光笔或CCD读码头以读取有声音乐读物的整页面上的甲导电码)As shown in Figure 5, it is an example of the appearance of a pickup in the form of a large-body continuous reading dock and an example of audio music books. Here, the reading dock can be made movable, and connected with the body by wires, and the large body reading dock is used as a pickup), the shape of the body is suitable for allowing audio music books to be placed on the body, and 1 is an electromechanical armor Read the terminal, 4 is a button module (i.e. two buttons, this button module is made up of two buttons, the button is not necessary and can be omitted), the main body of the audio music book in the figure is a music score book, and there is a Conductive code 10, when in use, put the audio music reading material on the body, hold the mobile A reading dock 1, and use the A reading dock 1 to touch the A conductive code 10 on the entire page of the audio music reading material to read the code, the MCU in the body Get the code reading value and play music pronunciation according to the code value. (the reading terminal mentioned in this paragraph can obviously be replaced with a photoelectric reading terminal such as a light pen or a CCD reading terminal to read the first conduction code on the entire page of the audio music book)

如图6,是一个手持式拾音器外型实例和页面上印有导电码的有声音乐读物的实例,有声音乐读物与拾音器是独立的,图6上半部分示的是一个可手持的拾音器(即把MCU和读码头和发音器件和按键模块等集成在一个小壳体里成为适于手持的拾音器,读码头一般需露出在小壳体外以便读码),1是机电式的甲读码头,4是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),图6中的下半部分示的是一个有声音乐读物实例,此有声音乐读物的主体是乐谱书,乐谱书的页面上有甲导电码10,使用时,手持移动拾音器,用拾音器上的甲读码头1去接触图6中的下半部分示的有声音乐读物的整页面上的甲导电码10,读码并根据码值播放音乐发音。(本段说的读码头显然可以换成光电式读码头比如光笔或CCD读码头以读取有声音乐读物的整页面上的甲导电码)。As shown in Figure 6, it is an example of a hand-held pickup appearance and an audio book that is printed with conductive codes on the page. Integrate the MCU, the reading terminal, the pronunciation device and the button module into a small housing to become a hand-held pickup, the reading terminal generally needs to be exposed outside the small housing for code reading), 1 is the electromechanical A reading terminal, 4 It is a button module (i.e. two buttons, this button module is made up of two buttons, and the button is not necessary and can be omitted), and what the lower part in Fig. 6 shows is an example of an audiobook, the main body of this audiobook It is a music score book, and there is a conductive code 10 on the page of the music score book. When in use, a mobile pickup is held in hand, and the first reading terminal 1 on the pickup is used to contact the first on the entire page of the audio music book shown in the lower part of Fig. 6. Conductive code 10, read the code and play music pronunciation according to the code value. (the reading dock that this paragraph says can be changed into photoelectric type reading dock obviously such as light pen or CCD reading dock to read the first conduction code on the whole page of audiobook).

如图7,是一个小壳体连读码头形式的拾音器外型实例和有声音乐读物的实例,拾音器外型是小壳体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个小壳体里,此小壳体可放在桌面上,读码头可做成移动的,并通过导线与小壳体连接,把小壳体连读码头当成拾音器),45是光笔读码头,4是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),图7下半部分示的有声音乐读物的主体是书,书的页面上有条形码57,使用时,手持移动光笔读码头45,用光笔读码头45去接触有声音乐读物的整页面上的条形码57以读码,小壳体内的MCU从读得的数个码字里取得拼合信息的值并根据拼合信息的值解码播放音乐发音。(本段说的读码头显然可以换成机电式导电码甲读码头以读取有声音乐读物的整页面上的甲导电码)As shown in Figure 7, it is an example of the appearance of the pickup in the form of a small housing continuous reading dock and an example of audio music books. In a small housing, this small housing can be placed on the desktop, the reading dock can be made mobile, and connected with the small housing by wires, the small housing connected reading dock is used as a pickup), 45 is the light pen reading dock, 4 is a button module (i.e. two buttons, this button module is made up of two buttons, the button is not necessary and can be omitted), the main body of the audio music book shown in the lower part of Figure 7 is a book, and there is a barcode on the page of the book 57. When in use, hold the mobile light pen to read the terminal 45, and use the light pen to read the terminal 45 to touch the barcode 57 on the entire page of the audio music book to read the code, and the MCU in the small housing obtains the stitching information from the read code words value and decode the playback music pronunciation according to the value of the spliced information. (the reading dock mentioned in this paragraph can obviously be replaced with electromechanical conductive code A reading dock to read the first conductive code on the entire page of the audio music book)

上面五段内容都属常识。以上五段各处提及的导电码的值表示的信息是与拼合信息的值有关的信息。以上1段各处提及的条形码的值表示的信息是与拼合信息的值有关的信息。The above five paragraphs are all common sense. The information represented by the value of the conductive code mentioned in the above five paragraphs is information related to the value of the combined information. The information indicated by the value of the barcode mentioned in the above paragraph 1 is information related to the value of the combined information.

一般机电式导电码读码头上有有弹性的导电胶条(比如2006102010785号发明申请里述的甲方式读码头或乙方式读码头),读码时用导电胶条去接触导电码,导电胶条要被按压,对于采用机电式导电码读码头的手持式拾音器,为了减小手的用力,可将手持式拾音器的结构做布置将重量大的元件集中到读码头附近,使其在使用时重量比较集中在读码头的导电胶条上,使用时,由于拾音器的重量起到了一些对导电胶条的按压作用,所以用户使用起来就比较省力。Generally, there is an elastic conductive rubber strip on the electromechanical conductive code reading dock (for example, the A reading dock or the B reading dock mentioned in the invention application No. 2006102010785). To be pressed, for the hand-held pickup that uses the electromechanical conductive code reading dock, in order to reduce the force of the hand, the structure of the handheld pickup can be arranged to concentrate the heavy components near the reading dock, so that it weighs less when in use. It is more concentrated on the conductive rubber strip of the reading dock. When using it, because the weight of the pickup plays a role in pressing the conductive rubber strip, it is less labor-intensive for the user to use.

本发明前面所阐述的拾音器,实际上是可以和其它电子类产品集成在一起的,比如与MP3播放机或与复读机(比如播放磁带的那种)或与手机集成在一起,可有多种方式集成,以与手机集成为例(其它与MP3播放机等的集成与与手机集成类似),其集成方式一般可有:一:手机其内部原来就有用作手机功能控制用的主MCU,仍可用此主MCU兼作为读码发音乐处理用的MCU(当然要求此MCU有足够多的IO脚等来作兼用),那就可把读码头(可以是机电式导电码读码头或光电式读码头等)装在手机上,读码头与主MCU意向连接(主MCU与原来手机的元器件的连接仍可不变),主MCU能读取的存储器里存储有对乐段拼合信息的值进行解码的解码程序及解码数据等,手机原来的按键可兼作读码发音控制(比如调节音量)的按键,如手机原来的主MCU或主MCU外接的发音芯片可播音乐(比如可用音乐合成方式按对拼合信息的值解码得到的解码结果播音乐),那就可仍旧用手机原来的主MCU来播音乐(这种情况主MCU外接的喇叭是发音器件)或用主MCU外接的发音芯片来播音乐(这种情况主MCU外接的带喇叭的发音芯片是发音器件),否则那一般应在主MCU外接音乐芯片来播音乐(这种情况主MCU外接的带喇叭的音乐芯片是发音器件;此音乐芯片比如可用音乐合成方式按对拼合信息的值解码得到的解码结果播音乐),也可在MCU外接其它芯片按音乐合成方式来播音乐声,手机原来其它的电路可不变,这样一来原来手机的主MCU既可完成原来通讯的功能,又可读码并发音乐;二:可在手机机体内,分别布置手机原来的电路和拾音模块的电路,两个电路独立(供电可公用),各有各的作主控用的单片机芯片,各有各的按键,发音的电路也分开,拾音模块这边有自己的发音器件,读码头也装在手机上并与读码发音乐主处理用的MCU(也即拾音模块的MCU)意向连接,这样集成后就同时具备原来通讯的功能和读码发音乐的功能,且这两功能互不干扰;另还可有其它集成的方式。可把这些种集成方式的电子产品(本段以下所说电子产品是指其内部集成有MP3播放机或复读机或手机等模块且可读码播放音乐发音的电子产品)都看成是拾音器,这样的拾音器包括权利要求6述的拾音器的所有技术特征,所以前述任一种集成方式的电子产品(电子产品可看成是拾音器)应属于权利要求6所述的拾音器的范围。(本段所说的拾音模块包括MCU,读码头,发音器件等部件,这些部件也是本段以外述及的拾音器包括的部件,拾音模块在必要时可能还包括存储器)。使用本段前面所述的电子产品拾音器,显然可用于读取有声音乐读物的整页面上的诸个码字中的码字并播放音乐发音,有声音乐读物的整页面上能被某电子产品拾音器读码以用于播音乐的所有码字中的全部或部分码字也可被定义为此有声音乐读物整页面上的诸个码字中的此电子产品拾音器能读的部诸码字,此电子产品拾音器可读取有声音乐读物整页面上的诸个码字中的部诸码字中的数个码字并播音乐。本段所说的各电子产品的形状显然没特别限制。本段各处提到的读码头可以是机电式导电码读码头或光电式读码头等,而且如是与手机集成为电子产品的话,还可以是把手机原来就有的可以读码的器件当成读码头,比如可用手机原来有的照相头来读码,那一般就可把照相头当成读码头(照相机图象处理电路在主MCU内时,一般是照相头连线到主MCU,这种情况照相涉及的读码软件一般放在主MCU内),或把照相头加照相机图象处理集成电路当成读码头(照相机图象处理电路是做成照相机图象处理集成电路时,照相头和此集成电路都在主MCU芯片外,一般是照相头连线到此集成电路且此集成电路连线到主MCU,这种情况照相涉及的读码软件一般可放在照相机图象处理集成电路内,由照相机图象处理集成电路得到读码结果后,再由此集成电路把读码结果传给主MCU),显然以上在述两种照相方式的读码头情况时提到的主MCU也是兼作为读码播音乐主控用的MCU,用照相头读的相应的有声音乐读物整页面上的码字一般是二维条形码(因二维条形码的BIT容量大,所以可在二维条形码上放较长乐段的拼合信息的值,由集成有手机的电子产品拾音器读码字并按码值里的乐段拼合信息的值解码播音乐,当然此电子产品拾音器需有按音乐合成方式播音乐的能力),也可修改照相涉及的读码软件使此种照相方式的读码头可以读导电码或一维条形码或图形码等(甚至可修改照相涉及的读码软件使这些不同码种的码字都可被照相方式的一个读码头分别读);而且还可把一些字符比如一些数据字符,字母字符印刷排列在一起成为字符码,可用此种照相方式的读码头去读这种字符码,当然照相涉及的读码软件比较复杂,因各个字符可分别与不同的二进制值建立对应关系,所以可把不同乐段拼合信息的值作一些转换后用不同的字符组合来表示,即可用这种字符码来表示数个乐段拼合信息的值(或其变换值)等,所以本申请文件所说的有声音乐读物整页面上的码字可以是这种字符码。本段所说的发音器件都是指读码播放音乐用的发音器件,属于本说明书其它段反复提到的拾音器的发音器件的概念。The pickup described above of the present invention can be integrated with other electronic products in fact, such as with MP3 player or with repeater (such as the kind of playing tape) or integrated with mobile phone, there can be multiple Integration with mobile phones, for example (other integration with MP3 players is similar to integration with mobile phones), the integration methods generally include: 1: The main MCU used for mobile phone function control is still inside the mobile phone. This main MCU can also be used as an MCU for code reading and music processing (of course, this MCU is required to have enough IO pins for dual purposes), then you can use the reading terminal (it can be an electromechanical conductive code reading terminal or a photoelectric reading terminal) Dock, etc.) installed on the mobile phone, read the intended connection between the dock and the main MCU (the connection between the main MCU and the original mobile phone components can still be unchanged), and the memory that the main MCU can read stores the value of the segment combination information for decoding The original key of the mobile phone can also be used as the key for code reading and pronunciation control (such as adjusting the volume). For example, the original main MCU of the mobile phone or the pronunciation chip external to the main MCU can play music (for example, the music synthesis method can be used to press the pair The decoded result obtained by combining the value of the information is decoded to play music), then you can still use the original main MCU of the mobile phone to play music (in this case, the external speaker of the main MCU is a sound device) or use the sound chip external to the main MCU to play music (In this case, the sounding chip with a speaker connected to the main MCU is a sounding device), otherwise, the music chip should be connected to the main MCU to play music (in this case, the music chip with a speaker connected to the main MCU is a sounding device; this music For example, the chip can use the music synthesis method to play music according to the decoding result obtained by decoding the value of the combined information), and can also connect other chips outside the MCU to play music sound according to the music synthesis method. The original other circuits of the mobile phone can remain unchanged, so that the original mobile phone The main MCU of the mobile phone can not only complete the original communication function, but also read the code and send music concurrently; two: the original circuit of the mobile phone and the circuit of the pickup module can be arranged separately in the mobile phone body, and the two circuits are independent (the power supply can be shared). There are individual single-chip microcomputer chips for main control, each with its own key, and the pronunciation circuit is also separated. The pickup module has its own pronunciation device, and the reading dock is also installed on the mobile phone and used for the main processing of reading code and sending music. The MCU (that is, the MCU of the pickup module) is intended to be connected, so that after the integration, it will have the original communication function and the function of reading code and sending music, and these two functions do not interfere with each other; there are also other integration methods. Electronic products in these integrated ways (the electronic products mentioned below in this paragraph refer to electronic products that integrate modules such as MP3 players, repeaters or mobile phones and can read codes to play music and pronounce) can be regarded as pickups. Such a sound pickup includes all the technical features of the sound pickup described in claim 6, so any electronic product in an integrated manner (the electronic product can be regarded as a sound pickup) should belong to the scope of the sound pickup described in claim 6. (the sound pick-up module mentioned in this paragraph includes MCU, reads docks, components such as pronunciation devices, and these parts are also the parts that the sound pick-up mentioned outside this paragraph includes, and the sound pickup module may also include memory when necessary). Using the electronic product pickup described earlier in this paragraph can obviously be used to read the codewords in the codewords on the entire page of the audio music book and play the music pronunciation, and the entire page of the audio music book can be read by an electronic product pickup. All or part of codewords in all codewords for playing music can also be defined as part of the codewords that can be read by the pickup of this electronic product among the codewords on the entire page of the audio music book. The electronic product pickup can read several codewords in all codewords in the codewords on the entire page of the audio music book and play music. The shape of each electronic product mentioned in this paragraph is obviously not particularly limited. The reading docks mentioned in this paragraph can be electromechanical conductive code reading docks or photoelectric reading docks, etc., and if they are integrated with mobile phones into electronic products, they can also use the original code-reading devices of mobile phones as reading devices. For example, the original camera head of the mobile phone can be used to read the code, then generally the camera head can be used as a reading port (when the camera image processing circuit is in the main MCU, the camera head is usually connected to the main MCU, in this case the camera The code reading software involved is generally placed in the main MCU), or the camera head plus camera image processing integrated circuit is used as a reading terminal (when the camera image processing circuit is made into a camera image processing integrated circuit, the camera head and the integrated circuit They are all outside the main MCU chip. Generally, the camera head is connected to this integrated circuit and this integrated circuit is connected to the main MCU. In this case, the code reading software involved in photography can generally be placed in the camera image processing integrated circuit, and the After the image processing integrated circuit obtains the code reading result, the code reading result is passed to the main MCU by the integrated circuit), obviously the main MCU mentioned above when reading the dock situation of the above-mentioned two camera modes also doubles as the code reading broadcast For the MCU used for music master control, the code word on the entire page of the corresponding audio music book read by the camera is generally a two-dimensional barcode (because the BIT capacity of the two-dimensional barcode is large, it is possible to put a longer piece of music on the two-dimensional barcode) The value of the combined information is read by the pickup of the electronic product integrated with the mobile phone and decoded to play the music according to the value of the combined information in the code value. The code-reading software involved in photography can be modified so that the photo-reading dock can read conductive codes or one-dimensional barcodes or graphic codes, etc. (even the code-reading software involved in photography can be modified so that code words of these different code types can be photographed A reading dock of the mode reads separately); and some characters such as some data characters, alphabetic characters can be printed and arranged together to form a character code, and the reading dock of this photographic mode can be used to read this character code, of course the reading involved in photography The code software is more complicated, because each character can establish a corresponding relationship with different binary values, so the values of different pieces of combined information can be converted and expressed with different character combinations, that is, this character code can be used to represent numbers. The value (or its conversion value) of a piece of piece information, etc., so the code word on the entire page of the audio music book mentioned in this application document can be this character code. The pronunciation device mentioned in this paragraph all refers to the pronunciation device used for reading codes and playing music, and belongs to the concept of the pronunciation device of the pickup repeatedly mentioned in other paragraphs of this manual.

本说明书所说的音素数据库的记录项,与计算机领域里数据库里的记录(RECORD)是相同概念,一个记录项里一般有多个字段(FIELD),当然放在拾音器里的音素数据库里的各字段的数据格式一般是拾音器的MCU方便使用的数据格式。The record item of the phoneme database mentioned in this manual is the same concept as the record (RECORD) in the database in the computer field. There are generally multiple fields (FIELD) in a record item. Of course, each field in the phoneme database in the pickup The data format of the field is generally a data format conveniently used by the MCU of the pickup.

本申请文件各处出现的‘音组合’是一个完整的词,比如说到‘一个音组合’,那其中‘一个’是修饰‘音组合’的,如说到‘一音组合’,那其中‘一’是修饰‘音组合’的,说到‘两个音组合’,那其中‘两个’是修饰‘音组合’的,说到‘这些音组合’,那其中‘这些’是修饰‘音组合’的。The 'sound combination' that appears everywhere in this application document is a complete word, for example, when it comes to 'one sound combination', then 'one' is a modified 'sound combination', such as when it comes to 'one sound combination', where 'One' modifies 'sound combination', when it comes to 'two sound combinations', then 'two' modifies 'sound combination', and when it comes to 'these sound combinations', then 'these' are modifiers' sound combination'.

本申请文件各处提到的‘数个乐段的拼合信息的值’,其中‘数个’是修饰‘乐段的拼合信息的值’的。本申请文件各处提到的‘数个乐段拼合信息的值’,其中‘数个’是修饰‘乐段拼合信息的值’的。The 'values of the combined information of several music sections' mentioned throughout this application document, wherein 'several' is used to modify the 'value of the combined information of the music sections'. The 'values of combining information of several segments' mentioned throughout this application document, wherein 'several' is used to modify 'the value of combining information of segments'.

在本说明书的叙述中,对任一段落,把此段落的括号外的任一个用于断句的标点符号(句号或逗号或分号或冒号)到此段落的括号外的此断句的标点符号的下一个用于断句的标点符号(句号或逗号或分号或冒号)之间的内容,都定义为一句(即使这两个用于断句的标点符号之间的内容是分句也定义为一句),这两个用于断句的标点符号之间的括号里的内容,也属于这一句的范围,且括号里可有各种标点符号,然后本说明书各处有关‘上一句’,‘本句’等这样的说法涉及的‘句’就是指本段前面定义的‘句’。本段所说的用于断句的标点符号不包括一个语句中几个互相是或关系的词之间的逗号,顿号;本段所说的用于断句的标点符号不包括一个语句中几个互相是与关系的词之间的逗号,顿号。In the narration of this specification, for any paragraph, any punctuation mark (full stop or comma or semicolon or colon) outside the brackets of this paragraph used for sentence punctuation is placed under the punctuation mark of this sentence outside the brackets of this paragraph The content between a punctuation mark used for sentence breaking (period or comma or semicolon or colon) is defined as a sentence (even if the content between these two punctuation marks used for sentence breaking is a sentence, it is also defined as a sentence), The content in the brackets between the two punctuation marks used to break sentences also belongs to the scope of this sentence, and there can be various punctuation marks in the brackets. The 'sentence' involved in this statement refers to the 'sentence' defined earlier in this paragraph. The punctuation marks used for sentence breaks mentioned in this paragraph do not include commas and commas between several words that are or are related to each other in a sentence; the punctuation marks used for sentence breaks mentioned in this paragraph do not include several Each is a comma and a comma between words related to each other.

如不能从提及处的上下文推出有他意,本说明书各处提及的其前面没用‘乐段’来修饰的‘谱数据’,默认是指乐段的谱数据。If it cannot be deduced from the context of the mention that there is other meaning, the 'spectral data' that is not modified by the 'music section' mentioned in this manual refers to the spectral data of the music section by default.

有关本申请文件所说的多个乐段拼合信息的值的意思,举个例子:比如说3个乐段拼合信息的值,就是指乐段1拼合信息的值和乐段2拼合信息的值和乐段3拼合信息的值;2个乐段拼合信息的值也按此类推理解。Regarding the meaning of the value of the combined information of multiple sections mentioned in this application document, for example: for example, the value of the combined information of 3 sections refers to the value of the combined information of Section 1 and the value of the combined information of Section 2 The value of the combined information with the section 3; the value of the combined information of the 2 sections is also understood by analogy.

如提及时无特别指明有他意,本申请文件各处所说的拾音器读码后播音乐,默认是拾音器读数个码字后,从这数个码字得到一个拼合信息的值并对这一个拼合信息的值解码播音乐,或是从这数个码字得到多个乐段拼合信息的值,并对这多个拼合信息的值各自解码播各自发音,其它类似说法也依此类推理解。If there is no special indication of other intentions when mentioned, the pickup reads the code and plays music after reading the code. The default is that after the pickup reads a number of code words, a value of a combined information is obtained from these code words and this combined information The value decodes and broadcasts music, or obtains the value of a plurality of pieces of segmental combination information from these several codewords, and decodes the value of these plurality of combination information and broadcasts respective pronunciation respectively, and other similar statements can also be understood by analogy.

如提及时无特别指明有他意,本申请文件各处所说的按乐段拼合信息的值播音乐,默认是根据乐段拼合信息的值解码播音乐,其它类似说法也依此类推理解。If there is no special indication of other intentions when mentioned, the value of the information in this application to play music according to the value of the segment combination information is by default to decode and play music according to the value of the segment combination information, and other similar statements can be understood in a similar manner.

本申请文件所说的拾音器按播谱数据的方式来播音乐,谱数据的具体格式并无特别限制,根据不同的用以播谱数据的MCU或音乐芯片的型号,谱数据的具体格式可能会不同;本申请文件所说的拾音器可按播谱数据的方式来播音乐,与说拾音器可按谱数据来播放音乐是相同的意思。本段各处所说的谱数据是对乐段拼合信息的值解码得到的乐段谱数据。The pickup mentioned in this application document plays music by broadcasting spectrum data, and the specific format of spectrum data is not particularly limited. According to different models of MCU or music chip used for broadcasting spectrum data, the specific format of spectrum data may vary. Different; the said pick-up in this application document can play music according to the mode of broadcast spectrum data, and it is the same meaning as saying that the pick-up can play music according to spectrum data. The spectrum data referred to in this paragraph is the music spectrum data obtained by decoding the value of the music piece combination information.

本申请文件各处提到的有声音乐读物整页面上印的乐谱,可以是简谱,五线谱,吉他谱,电子琴谱等。The music scores printed on the entire page of the audio music books mentioned in this application document can be numbered notation, staff notation, guitar notation, electronic piano notation, etc.

在权利要求书中的各权利要求项里,每两个分号之间的文字内容是一个完整技术特征的描述,一个权利要求项里各完整技术特征之间是‘且’的关系,在两个分号之间的文字可能被数个‘,或者,’分成多个部分,每个部分表示此两个分号之间的文字描述的完整技术特征里或的一个选择(但有例外情况,见下段述),每个部分里如再需用‘或’或‘或者’的话那‘或’或‘或者’的前后就无逗号;但为了阅读方便,在权利要求书中有分段,一个段落里可能没分号(段落最后也没分号),这种情况仍是以每两个分号之间的文字内容为一个完整技术特征的描述(即一个完整技术特征可能包括几个段落的内容),如一个段落的文字被几个分号隔成3部分,那每部分的文字内容就是一个完整技术特征的描述,那此段落就包括3个完整技术特征的描述,如一个段落的文字没被分号隔开,那此段落就是1个完整技术特征的描述。(但作为独立权利要求的权利要求项1,6中的前叙部分不适合本段所述)。显然,权利要求书里的每个冒号那里也相当于有一个本段所说的分号,权利要求书里的每个句号那里也相当于有一个本段所说的分号。In each claim item in the claims, the text between each two semicolons is a description of a complete technical feature, and the relationship between the complete technical features in a claim is an 'and' relationship. The text between the two semicolons may be divided into multiple parts by several ', or,', and each part represents one of the complete technical features described in the text between the two semicolons or a selection (but there are exceptions, See the next paragraph), if you need to use 'or' or 'or' in each part, there will be no commas before and after 'or' or 'or'; but for the convenience of reading, there are paragraphs in the claims, a There may be no semicolon in the paragraph (there is no semicolon at the end of the paragraph). In this case, the text content between every two semicolons is still the description of a complete technical feature (that is, a complete technical feature may include several paragraphs. Content), if the text of a paragraph is divided into 3 parts by several semicolons, then the text content of each part is a description of a complete technical feature, then this paragraph includes a description of 3 complete technical features, such as a paragraph of text If they are not separated by semicolons, then this paragraph is a description of a complete technical feature. (but the preceding part in claim 1 and 6 which are independent claims is not suitable for the description in this paragraph). Obviously, every colon in the claims is equivalent to a semicolon mentioned in this paragraph, and every period in the claims is also equivalent to a semicolon mentioned in this paragraph.

另外在权利要求书中有类似这样的说法:对于某事物(事物是名词),怎样1,或者,怎样2,这样的说法是指怎样1与怎样2是或的关系,即某事物可以是怎样1或某事物可以是怎样2;举个例比如权利要求书中提到的‘对于所述诸个码字中的某一码字,此码字的载体可以是所述读物的一个页,此码字的载体平面是所述读物的一个页面,或者,此码字的载体可以是一小块平面材料,此码字的载体平面是平面材料的表面,此平面材料的表面上有此码字,此平面材料是粘贴在所述读物的一个页面上的;’就是这样;如还有类似本段所说情况(即比如说:对于某事物,怎样1,或者,怎样2,怎样3,或者,怎样4),也按本段所说类推理解。In addition, there is a statement like this in the claims: For something (thing is a noun), how 1, or how 2, this statement refers to the relationship between how 1 and how 2, that is, how something can be 1 or how something can be 2; for example, as mentioned in the claims 'for a certain code word among the code words, the carrier of this code word can be a page of the reading material, here The carrier plane of the codeword is a page of the reading material, or the carrier of the codeword can be a small piece of planar material, the carrier plane of the codeword is the surface of the planar material, and the codeword is arranged on the surface of the planar material , this planar material is pasted on a page of the reading material;' that's it; , how 4), also understand by analogy said in this paragraph.

本申请文件各处所提的码字携带某值的含义就是指可从码字得到某值(一般就是把此值或此值的变换值放入此码字里),本段所说的码字是指单个码字。The meaning of the code word mentioned in this application document carrying a certain value refers to that a certain value can be obtained from the code word (generally, this value or the transformation value of this value is put into this code word), and the code word mentioned in this paragraph A word refers to a single codeword.

按本说明书的内容可知,本申请文件各处所说的在有声音乐读物整页面上的诸个码字中的各码字的码值表示的信息是与拼合信息的值有关的信息,所以有声音乐读物的整页面上如还有另外的码字,但如其码值表示的信息不是与拼合信息的值有关的信息,此码字就不属于本申请文件各处所说的有声音乐读物整页面上的诸个码字的范围;比如,在有些有声音乐读物的整页面上可能有另一些特殊的码字,比如码字携带的只是暂停的控制信息,如拾音器读取此码字后,就把拾音器正在播的音乐暂停下来,用拾音器再读一次此码字又从暂停处继续播,此种码字的码值表示的信息不是与拼合信息的值有关的信息,所以此种码字不属于本申请文件各处所说的有声音乐读物整页面上的诸个码字的范围。According to the content of this specification, it can be seen that the information represented by the code value of each code word in the code words on the entire page of the audio music book mentioned in this application document is the information related to the value of the combined information, so the audio music If there are other code words on the entire page of the reading material, but the information represented by the code value is not information related to the value of the combined information, this code word does not belong to the entire page of the audio music reading material mentioned in this application document. The range of various codewords; for example, there may be other special codewords on the entire page of some audio music books, such as codewords that only carry the control information of the pause, such as after the pickup reads the codeword, the pickup The music being played is paused, and the code word is read again with a pickup and continues to play from the pause point. The information represented by the code value of this code word is not information related to the value of the combined information, so this code word does not belong to this code word. The scope of code words on the entire page of the audio music book mentioned in the application documents.

本申请文件各处所提及的‘读物’概念(其前面没加‘有声音乐’4字来修饰的)与‘有声音乐读物’概念是有点区别的,读物是指常见的书籍,乐谱书,杂志等本身(读物自身范围内是与码字无关的),如在读物(如书籍,乐谱书,杂志等)整页面上形成(比如印刷)有一些与拼合信息的值有关的码字,那这读物整页面上的所有这些与拼合信息的值有关的码字与读物就一起形成有声音乐读物,此读物也就是作为此有声音乐读物主体的读物。本段说的码字可以是直接把作为有声音乐读物主体的读物的页面当成载体平面的码字,也可以是如本说明书前面述的小块平面材料的表面上有的码字,且这一小块平面材料是粘贴在作为有声音乐读物主体的读物的页面上的。The concept of "reading materials" mentioned in this application document (without adding the word "sound music" in front of it to modify) is somewhat different from the concept of "audio music reading materials". Reading materials refer to common books, music score books, Magazine etc. itself (the reading material itself has nothing to do with the codeword), if there are some codewords related to the value of the combined information formed (such as printing) on the entire page of the reading material (such as books, music books, magazines, etc.), then All these codewords related to the value of the spliced information on the whole page of the reading material and the reading material together form the audio music reading material, and this reading material is also the reading material as the main body of the audio music reading material. The code word mentioned in this paragraph can be the code word that directly regards the page of the reading material as the main body of the audio music book as the carrier plane, or it can be the code word on the surface of the small piece of plane material as mentioned above in this specification, and this The small piece of flat material is pasted on the page of the reading that is the main body of the audio music reading.

上段说的在读物整页面上形成有码字,只是有声音乐读物产品的最终形式,实际上生产时到底采用先在数个大开张的印刷纸上形成(比如印刷)码字再印图文(乐谱)再装订成有声音乐读物,先在数个大开张的印刷纸上印图文再形成码字再装订成有声音乐读物(本句和上句说的装订是指把印有图文和形成有码字的数个大开张的印刷纸各自裁减成多个小开张的,并装订起来),先装订成读物形式再印图文再形成码字以形成有声音乐读物等这些方式中的哪一种是没关系的(本句说的装订是指把数个还没印图文也没形成码字的大开张的印刷纸各自裁减成多个小开张的,并装订起来),这些方式互相是等同替换。本段说的码字是与拼合信息的值有关的码字。The code words mentioned in the previous paragraph are formed on the entire page of the reading material, which is only the final form of the audio music book product. In fact, the code word is first formed (such as printed) on several large sheets of printing paper and then the graphic text is printed ( music score) and then bound into audio music books, first print pictures and texts on several large sheets of printing paper and then form code words and then bind them into audio music books (binding in this sentence and the previous sentence refers to printing pictures and texts and forming Several large sheets of printed paper with codes are cut into multiple small sheets and bound together), first bound into the form of reading materials, then printed with pictures and texts, and then form codes to form audio music books, etc. Which of these methods It doesn't matter (the binding mentioned in this sentence refers to cutting several large sheets of printed paper that have not yet printed pictures and texts and formed code words into multiple small sheets, and then bound them together). These methods are equivalent to each other. replace. The code word mentioned in this paragraph is the code word related to the value of the spliced information.

如提及时无特别指明有他义,本申请文件各处提及的‘有声音乐读物的整页面’默认包括有声音乐读物的所有页面(对于只有一个页面的有声音乐读物,其所有页面就是指这一个页面),如提及时无特别指明有他义,本申请文件各处提及的‘读物的整页面’(‘读物’前面没加‘有声音乐’的),默认包括读物的所有页面(对于只有一个页面的读物,其所有页面就是指这一个页面)。如提及时无特别指明有他义,本申请文件各处提及的‘有声音乐读物的页面’,默认是指有声音乐读物的某个具体的页面;如提及时无特别指明有他义,本申请文件各处提及的‘读物的页面’(‘读物’前面没加‘有声音乐’的),默认是指读物的某个具体的页面;如提及时无特别指明有他义,本申请文件各处提及的‘读物的页’(‘读物’前面没加‘有声音乐’的),默认是指读物的某个具体的页;如提及时无特别指明有他义,本申请文件各处提及的‘有声音乐读物的页’,默认是指有声音乐读物的某个具体的页。If there is no special indication of other meanings when mentioned, the "full page of the audio music book" mentioned in this application document includes all pages of the audio music book by default (for an audio music book with only one page, all pages refer to this One page), if there is no other meaning when it is mentioned, the "whole page of the reading" mentioned in this application document (without "audio music" in front of the "reading"), includes all pages of the reading by default (for For a reading with only one page, all its pages refer to this one page). If no other meaning is specified when mentioned, the "page of the audio music book" mentioned in this application document refers to a specific page of the audio music book by default; if no other meaning is specified when mentioned, this The 'reading page' mentioned in the application documents (without 'sound music' in front of the 'reading') refers to a specific page of the reading by default; The 'pages of reading materials' mentioned in various places (without 'audio music' in front of 'reading materials') refer to a specific page of reading materials by default; The 'page of the audiobook' referred to refers to a specific page of the audiobook by default.

对于有声音乐读物的任一页,也可说是作为此有声音乐读物主体的读物的一页,即有声音乐读物的这一页与读物的这一页是指物质上完全相同的东西,比如一有声音乐读物的一个页,是在一张纸的正面印有‘135’(正面没再印其它图文),反面印有‘246’(反面没再印其它图文),正面还印有一个条形码,那此有声音乐读物的这一页就包括这张纸和印的‘135’和‘246’,作为此有声音乐读物主体的读物的页也包括这张纸和印的‘135’和‘246’。关于本段所说的物质上完全相同的东西,比如对一个放在桌上的苹果,那说‘这苹果’与说‘放在桌上的这苹果’就是指物质上完全相同的东西(都是指这个3两重的苹果)。For any page of the audio music book, it can also be said to be a page of the book as the main body of the audio music book, that is, this page of the audio music book and this page of the book refer to the same thing in substance, such as a A page of an audio music book is printed with '135' on the front of a piece of paper (no other pictures and texts printed on the front), '246' printed on the back (no other pictures and texts printed on the back), and a bar code, then this page of the audio music book includes this paper and the printed '135' and '246', and the page of the book that is the main body of the audio music book also includes this paper and the printed '135' and '135' and '246' printed 246'. Regarding the materially identical things mentioned in this paragraph, for example, for an apple on the table, saying 'this apple' and 'this apple on the table' refer to the materially identical things (both refers to the 3 double apples).

由上段可推出:对于有声音乐读物的任一页面,也可说是作为此有声音乐读物主体的读物的一页面(或者说对于作为有声音乐读物主体的读物的任一页面,也可说是此有声音乐读物的一页面),即有声音乐读物的这一页面与读物的这一页面是完全相同所指;比如一有声音乐读物的一个页,是在一张纸的正面印有‘135’(正面没再印其它图文),反面印有‘246’(反面没再印其它图文),正面还印有一个条形码,此有声音乐读物的这一页就包括这张纸和印的‘135’和‘246’,此有声音乐读物的这一页也是作为此有声音乐读物主体的读物的一页,所以这页的正面就是此有声音乐读物的一页面,这页的正面也是作为此有声音乐读物主体的读物的一页面。由此再可推出有声音乐读物的整页面与作为此有声音乐读物主体的读物的整页面是完全相同所指;比如一有声音乐读物只含两个页(页1和页2),页1是在一张纸的正面印有‘135’(正面没再印其它图文),反面印有‘246’(反面没再印其它图文),正面还印有一个条形码,那此有声音乐读物的页1就包括这张纸和印的‘135’和‘246’,页2是在另一张纸的正面印有‘356’(正面没再印其它图文),反面印有‘467’(反面没再印其它图文),正面和反面还各印有一个导电码,那此有声音乐读物的页2就包括这张纸和印的‘356’和‘467’,那么此有声音乐读物的整页面就包括此有声音乐读物的页1的正面和此有声音乐读物的页1的反面和此有声音乐读物的页2的正面和此有声音乐读物的页2的反面,作为此有声音乐读物主体的读物的整页面包括此读物的页1的正面和此读物的页1的反面和此读物的页2的正面和此读物的页2的反面,而有声音乐读物的页1与读物的页1等同,有声音乐读物的页2与读物的页2等同,所以此有声音乐读物的整页面与作为此有声音乐读物主体的读物的整页面是完全相同所指。It can be deduced by the last paragraph: for any page of the audio music reading material, it can also be said to be a page of the reading material as the main body of the audio music reading material (or for any page of the reading material as the audio music reading material main body, it can also be said to be this page). A page of an audio music book), that is, this page of an audio music book is exactly the same as this page of the book; for example, a page of an audio music book is printed with '135' ( There are no other pictures and texts printed on the front), '246' is printed on the back (no other pictures and texts are printed on the back), and a barcode is printed on the front. This page of this audio book includes this paper and the printed '135 ' and '246', this page of this audio music book is also a page of the book that is the main body of this audio music book, so the front of this page is a page of this audio music book, and the front of this page is also a page of this audio music book A page of the reading in the main body of the reading. Can deduce that the whole page of audiobook and the whole page of the reading as this audiomusic main body are exactly the same reference; For example, an audiobook only contains two pages (page 1 and page 2), page 1 is There is '135' printed on the front of a piece of paper (no other pictures and texts printed on the front), '246' printed on the back (no other pictures and texts printed on the back), and a barcode is printed on the front, so the audiobook Page 1 includes this paper and the printed '135' and '246', and page 2 is printed with '356' on the front of another sheet of paper (no other pictures and texts are printed on the front), and '467' printed on the back ( There are no other pictures and texts printed on the back), and a conductive code is printed on the front and back respectively, then page 2 of the audio music book includes this paper and printed '356' and '467', then the audio music book's The entire page includes the front of page 1 of the audio music book, the back of page 1 of the audio music book, the front of page 2 of the audio music book and the back of page 2 of the audio music book, as the main body of the audio music book The entire page of the reading material includes the front of page 1 of the reading material and the reverse side of page 1 of the reading material and the front side of page 2 of the reading material and the reverse side of page 2 of the reading material, while page 1 of the audio music book Equally, page 2 of the audio music book is equivalent to page 2 of the reading material, so the entire page of the audio music book is exactly the same as the entire page of the reading material as the main body of the audio music book.

本申请文件所说的读物自身范围内是与码字无关的,是指读物自身范围内无码字,比如上段说的含两个页的有声音乐读物,作为此有声音乐读物主体的读物,包括组成页1的那张纸和印的‘135’和‘246’,以及组成页2的那张纸和印的‘356’和‘467’,显然此读物包括的东西的范围内无码字,也即此读物自身范围内是与码字无关的。The reading material mentioned in this application document has nothing to do with codewords within the scope of the reading material itself. The sheet of paper on page 1 and the printed '135' and '246', and the sheet of paper forming page 2 and the printed '356' and '467', obviously there is no codeword in the range of things included in this reading material, that is, Within the scope of this reading itself, it has nothing to do with codewords.

对于任一个码字,如处于一本有声音乐读物的页面上,那此码字与作为此有声音乐读物主体的读物是互相独立的关系,而且此码字既不属于此读物的页面,也不属于此读物的页,也不属于此有声音乐读物的页面,也不属于此有声音乐读物的页,但此码字是属于此有声音乐读物的(此码字可以是直接把作为此有声音乐读物主体的读物的页面当载体平面的码字,也可以是本说明书前面述的在小块平面材料上的码字且这小块平面材料是粘贴在作为此有声音乐读物主体的读物的页面上的)。而作为有声音乐读物主体的读物的页面上印刷的图文(比如印刷的乐谱),显然既属于此读物,也属于此有声音乐读物(此读物是作为此有声音乐读物的主体,此读物属于此有声音乐读物),也属于此读物的页,也属于此有声音乐读物的页。作为某有声音乐读物主体的读物的整页面上的诸个码字与此有声音乐读物的整页面上的诸个码字是相同所指,且这里说的诸个码字属于此有声音乐读物但不属于此有声音乐读物的整页面也不属于作为此有声音乐读物主体的读物也不属于作为此有声音乐读物主体的读物的整页面。对于某有声音乐读物的整页面上的诸个码字中的部诸码字,也可说成作为此有声音乐读物主体的读物的整页面上的诸个码字中的部诸码字;对于作为某有声音乐读物主体的读物的整页面上的诸个码字中的部诸码字,也可说成此有声音乐读物的整页面上的诸个码字中的部诸码字。For any code word, if it is on the page of an audio music book, then this code word and the reading material as the main body of the audio music book are independent of each other, and this code word neither belongs to the page of this book nor The page that belongs to this reading material does not belong to the page of this audio music book, nor does it belong to the page of this audio music book, but this code word belongs to this audio music book (this code word can be directly used as this audio music book The page of the reading material of the main body is used as the code word of the carrier plane, or it can be the code word on the small piece of plane material mentioned above in this specification, and this small piece of plane material is pasted on the page of the reading material as the main body of the audio music book ). And the pictures and texts (such as printed music scores) printed on the pages of the reading material as the main body of the audio music reading material obviously belong to both this reading material and the audio music reading material (this reading material is the main body of this audio music reading material, and this reading material belongs to this Audiobook), also belongs to this book's page, also belongs to this audiobook's page. The codewords on the entire page of the reading material as the main body of a certain audio music reading are the same reference as the codewords on the entire page of the audio music reading, and the codewords mentioned here belong to the audio music reading but Whole pages that do not belong to the audiobook do not belong to the book that is the subject of the audiobook nor do they belong to the book that is the subject of the audiobook. For the part code words in the code words on the whole page of a certain audio music reading material, it can also be said to be the part code words in the code words on the whole page of the reading material as the audio music reading material main body; Part code words in the code words on the whole page of the reading material as a certain audio music reading material main body can also be said to be part code words in the code words on the whole page of this audio music reading material.

以上7段提到的码字都是与拼合信息的值有关的码字。以上7段所述是为了本申请文件的表达方便,都是显而易见的常识。以上7段引号里的数字是指简谱的音符。The codewords mentioned in the above 7 paragraphs are all codewords related to the value of the spliced information. The above 7 paragraphs are for the convenience of expressing this application document and are all obvious common sense. The numbers in the quotation marks of the above 7 paragraphs refer to the musical notes in the numbered notation.

本申请文件所说的书,书籍,一般是指音乐教材,比如乐理教材等。The books mentioned in this application document generally refer to music teaching materials, such as music theory teaching materials.

本申请文件所说的乐谱书,是指其各页上主要是印有乐谱的书,比如《车尔尼钢琴练习曲299》,《巴赫平均律钢琴曲集》,《贝多芬钢琴奏鸣曲集》,《民谣吉他曲集》等。The music score book mentioned in this application document refers to the book whose pages are mainly printed with music scores, such as "Czerny Piano Etudes 299", "Bach's Well-tempered Piano Collection", "Beethoven Piano Sonata Collection" , "Folk Guitar Collection" and so on.

本申请文件各处所说的与拼合信息的值有关的码字,是指其码值所表示的信息是与拼合信息的值有关的信息的的码字。The code word related to the value of the combined information referred to throughout this application document refers to the code word whose code value represents information related to the value of the combined information.

在权利要求书各处所说的读物(其前面没加‘有声音乐’来修饰的),是指作为有声音乐读物主体的读物,而不是指有声音乐读物。The reading materials mentioned in various parts of the claims (modified without adding "sound music" in front of it) refer to the reading materials that are the main body of the audio music reading materials, not the audio music reading materials.

本申请文件各处所说的码字被拾音器读取,默认是指每次一个码字被一个拾音器的一个读码头读取(每个拾音器只有一个读码头);如有每次多个码字被一个拾音器的多个读码头读取,那这种每次多个码字被一个拾音器的多个读码头读取的情况与每次一个码字被一个拾音器的一个读码头读取是等同替换,因这多个读码头合起来可看成是一个面积很大的读码头,这多个码字合起来可看成是一个面积很大的码字。(本段提到的‘多个码字’中的‘多’与‘多个读码头’中的‘多’数量可相同或不同,因有比如一个读码头可一次读几个码字或一个码字一次被几个读码头读等情况)。如提及时无特别指明有他义,本申请文件各处提及的读码头显然默认是指单个的读码头。如提及时无特别指明有他义,本申请文件各处提及的拾音器显然默认是指单个的拾音器。The codewords mentioned in this application document are read by the pickup by default, which means that each codeword is read by a reading terminal of a pickup (each pickup has only one reading terminal) by default; Multiple reading docks of a pickup are read, and the situation that multiple codewords are read by multiple reading docks of a pickup each time is equivalent to one codeword being read by one reading dock of a pickup every time. Because of these multiple reading docks can be regarded as a very large area of reading docks together, these multiple code words can be regarded as a very large code word of an area together. (the 'many' in the 'multiple codewords' mentioned in this paragraph can be the same or different from the 'many' in the 'multiple reading docks', because for example, one reading dock can read several codewords or one The code word is once read by several reading terminals, etc.). If there is no special meaning when mentioning, the reading terminal mentioned in this application document obviously refers to a single reading terminal by default. If there is no other meaning specified when mentioning, the pickups mentioned in this application document obviously refer to a single pickup by default.

在任一权利要求项中出现的‘前述’都是指本权利要求项内的前面所述。'Aforementioned' appearing in any claim refers to the preceding statement in this claim.

在权利要求书中各处提及的‘前述部诸码字’中的‘前述’与‘部诸码字’是默认搭配,即只要‘前述’后面出现‘部诸码字’,那此‘前述’就是只修饰这个‘部诸码字’的,而不修饰‘部诸码字’后面跟的其它内容,‘前述部诸码字’是指本权利要求项内前面述及的‘部诸码字’;与此类似,权利要求书中各处提及的‘前述诸个码字’,‘前述拾音器’,‘前述有声音乐读物’,‘前述读物’,‘前述读码头’,‘前述发音器件’,‘前述单片机芯片’,‘前述读码头’中的各提法中的‘前述’与此提法中此‘前述’后面的部分(比如‘前述’与‘拾音器’)都是默认搭配,即这些提法中的各提法中的‘前述’词是只用于修饰此提法中此‘前述’后面的部分(比如‘前述拾音器’中的‘前述’是只用于修饰此‘前述’后面的‘拾音器’的)。"Aforementioned" and "all codewords" in the "aforementioned codewords" mentioned everywhere in the claims are collocations by default, that is, as long as "partial codewords" appear behind the "aforementioned", then this' Aforesaid ' just only modifies this ' all code words ', and does not modify other content that follows behind ' all code words ', and ' all code words of the foregoing parts' refers to the ' all code words mentioned above' in the claims codeword'; similarly, the 'aforementioned codewords', 'the aforesaid pickup', 'the aforesaid audio music book', 'the aforesaid reading material', 'the aforesaid reading terminal', 'the aforesaid Pronunciation device', 'the aforementioned single-chip microcomputer chip', the 'forementioned' in each formulation of 'the aforementioned reading dock' and the part behind the 'forementioned' in this formulation (such as 'the aforementioned' and 'pickup') are all default Collocation, that is, the word "previous" in each of these formulations is only used to modify the part behind the "previous" in this formulation (for example, the "previous" in "previous pickup" is only used to modify this 'Pickup' behind 'predecessor').

如提及时无特别指明有他义,本申请文件各处提及的乐段默认是指单个乐段;如提及时无特别指明有他义,本申请文件各处提及的播放乐段发音默认是指播放单个乐段发音。If there is no special meaning when it is mentioned, the paragraphs mentioned in this application document refer to a single paragraph by default; if there is no special meaning when it is mentioned, the playback paragraphs mentioned in this application file are pronounced by default Refers to playing a single section pronunciation.

本申请文件各处所提到的‘数个’是指‘一个或多个’。References to 'several' throughout this application mean 'one or more'.

本申请文件说一个码字的码值是与拼合信息的值有关的,与说此码字的码值表示的信息是与拼合信息的值有关的信息的的说法是等价的。In this application document, saying that the code value of a codeword is related to the value of the combined information is equivalent to saying that the information represented by the code value of the codeword is related to the value of the combined information.

本申请文件所说的其前面没带数量词的‘乐段拼合信息的值’是默认指一个乐段拼合信息的值,本申请文件所说的其前面没带数量词的‘乐段谱数据’是默认指一个乐段谱数据。The "value of music segment information" without quantifiers in front of this application document refers to the value of a musical segment combination information by default, and the "music spectrum data" without quantifiers in front of it mentioned in this application document is By default it refers to a piece of score data.

本申请文件各处所提及的‘部分所述拾音器涉及的程序’中的‘部分’是用于修饰‘所述拾音器涉及的程序’的,‘部分所述拾音器涉及的程序’是指拾音器涉及使用的所有程序的一部分;本申请文件各处所提及的‘部分所述拾音器涉及的数据’中的‘部分’是用于修饰‘所述拾音器涉及的数据’的,‘部分所述拾音器涉及的数据’是指拾音器涉及使用的所有数据的一部分。The "part" in the "part of the program related to the pickup" mentioned throughout this application is used to modify the "program related to the pickup", and the "part of the program related to the pickup" means that the pickup is related to Part of all programs used; 'part' of 'part of said pickup-related data' mentioned throughout this document is used to modify 'said pickup-related data', 'part of said pickup-related 'Data' refers to all data part of the pickup involved in use.

如提及时无特别指明有他义,本申请文件各处提及的码字默认是指有声音乐读物的整页面上的诸个码字中的码字。如提及时无特别指明有他义,本申请文件各处提及的条形码,导电码,图形码或磁码默认也是属于有声音乐读物的整页面上的诸个码字中的码字。If there is no special indication of other meanings when mentioned, the codewords mentioned in this application document refer to the codewords in the codewords on the entire page of the audio music book by default. If there is no other meaning when mentioned, the barcodes, conductive codes, graphic codes or magnetic codes mentioned in this application document are by default also codewords among the codewords on the entire page of audio music books.

如提及时无特别指明有确切含义且不能从提及处的上下文推出有确切含义,本申请文件各处提及的码字默认可以是导电码,条形码,图形码或磁码等,(而如提及处的上下文已指明码字是条形码,即能从提及处的上下文推出有确切含义,即码字是条形码)。If the exact meaning is not specified when mentioned and the exact meaning cannot be deduced from the context of the mention, the codewords mentioned in this application document can be conductive codes, barcodes, graphic codes or magnetic codes by default, (and such as The context of the mention has indicated that the code word is a barcode, that is, the exact meaning can be deduced from the context of the mention, that is, the code word is a bar code).

本申请文件各处出现的‘某’,如其是修饰其后面名词的而且此‘某’后面紧跟着没出现数量词,那这个‘某’默认是指‘某一’的意思,这也是常识;比如说‘某乐段’就是指‘某一乐段’。本申请文件各处出现的‘此’,如其是修饰其后面名词的而且此‘此’后面紧跟着没出现数量词,那这个‘此’默认是指‘此一’的意思,这也是常识;比如说‘此乐段’就是指‘此一乐段’。If the "some" appearing everywhere in this application document modifies the nouns behind it and there is no quantifier immediately after the "some", then this "some" means "some" by default, which is also common sense; For example, 'a certain section' means 'a certain section'. The 'he' appearing everywhere in this application document, if it modifies the nouns behind it and there is no quantifier immediately after the 'he', then the 'he' means 'this' by default, which is also common sense; For example, 'this section' means 'this section'.

如提及时无特别指明有他义且不能从提及处的上下文推出有他义,本申请文件各处提及的其前面没带数量词的‘拾音器’默认是指一个拾音器。If there is no other meaning specified when mentioning and it cannot be inferred from the context of the mention, the 'sound picker' mentioned throughout this application document without a quantifier in front of it refers to a pick-up by default.

在本申请文件中多处提及的‘部诸码字中的某一码字’,是指有声音乐读物的整页面上的诸个码字中的部诸码字中的某一个码字,比如部诸码字中含有3个码字(码字1,码字2和码字3),那部诸码字中的某一码字就是指码字1或码字2或码字3。同理,在本申请文件中多处提及的‘诸个码字中的某一码字’,是指有声音乐读物的整页面上的诸个码字中的某一个码字,比如诸个码字中含有4个码字(码字1,码字2,码字3和码字4),那诸个码字中的某一码字就是指码字1或码字2或码字3或码字4。"A certain codeword in all codewords" mentioned in many places in this application document refers to a certain codeword among all codewords in the codewords on the entire page of the audio music book, Contain 3 codewords (codeword 1, codeword 2 and codeword 3) in all codewords such as part, a certain codeword in all codewords of that part refers to codeword 1 or codeword 2 or codeword 3 exactly. Similarly, "a certain codeword among the codewords" mentioned in many places in this application document refers to a certain codeword among the various codewords on the entire page of the audio music book, such as the various codewords Contain 4 codewords (codeword 1, codeword 2, codeword 3 and codeword 4) in the codeword, a certain codeword in that all codewords refers to codeword 1 or codeword 2 or codeword 3 or codeword 4.

在权利要求书中有多处提及的‘所述有声音乐读物的任一页面’,是指有声音乐读物的整页面的任一个页面,比如有声音乐读物的整页面含有4个页面(页面1,页面2,页面3和页面4),那有声音乐读物的任一页面就是指页面1或页面2或页面3或页面4;本说明书中多处提及的‘有声音乐读物的任一页面’也是按这样理解。The "any page of the audio music book" mentioned in many places in the claims refers to any page of the entire page of the audio music book. For example, the entire page of the audio music book contains 4 pages (page 1 , page 2, page 3 and page 4), any page of the audio music book refers to page 1 or page 2 or page 3 or page 4; "any page of the audio music book" mentioned in many places in this manual It is also understood in this way.

在权利要求书中各处提及的‘所述部诸码字中的某一码字’中的‘所述’是只用于修饰此‘所述’后面的‘部诸码字’的。在权利要求书中各处提及的‘所述部诸码字中的任一码字’中的‘所述’是只用于修饰此‘所述’后面的‘部诸码字’的。在权利要求书中各处提及的‘所述诸个码字中的某一码字’中的‘所述’是只用于修饰此‘所述’后面的‘诸个码字’的。在权利要求书中各处提及的‘所述诸个码字中的任一码字’中的‘所述’是只用于修饰此‘所述’后面的‘诸个码字’的。在权利要求书中各处提及的‘所述的一个码字的码值’中的‘所述’是只用于修饰此‘所述’后面的‘一个码字’的。The 'said' in 'a certain codeword in said codewords' mentioned everywhere in the claims is only used to modify the 'said codewords' behind this 'said'. The 'said' in 'any codewords in said codewords' mentioned everywhere in the claims is only used to modify 'the codewords' behind this 'said'. The 'said' in 'a certain codeword among the said codewords' mentioned in the claims is only used to modify the 'codewords' behind this 'said'. The 'said' in 'any of the codewords' mentioned in the claims is only used to modify the 'codewords' following the 'said'. The 'said' in 'the code value of the said one codeword' mentioned in the claims is only used to modify the 'one codeword' following the 'said'.

本申请文件各处所提及的码字的值就是码字的码值。The values of the codewords mentioned throughout the application documents are the code values of the codewords.

如提及时没特别指明有他义,本申请文件各处所提及的其前面无数量词的‘乐段拼合信息的值’(或‘表示乐段发音的拼合信息的值’)默认是指一个乐段的拼合信息的值。If there is no other meaning specified when mentioning, the "value of the combination information of the music segment" (or the value of the combination information indicating the pronunciation of the music segment) mentioned everywhere in this application document without quantifiers refers to a default The value of the piece's flattening information.

本申请文件各处所说的有声音乐读物的整页面上有某种码字,是指在整页面上的全部页面上都有这种码字或在整页面上的部分页面上(甚至只有一个页面上)有这种码字。Some kind of code word is arranged on the entire page of audiobook that this application document says everywhere, refers to that this code word is arranged on all pages on the whole page or on the partial page on the whole page (even only has one page) above) has this codeword.

如提及时无特别指明有他义也不能从提及处的上下文推出有他义,本申请文件各处所提及的存储器,默认是指在拾音器使用时,能被作为此拾音器的主处理芯片的单片机芯片读取使用的存储器。If there is no special indication of other meanings when mentioned, and other meanings cannot be deduced from the context of the mention, the memory mentioned in this application document refers to the main processing chip that can be used as the main processing chip of the pickup when it is used by default. The microcontroller chip reads the memory used.

本申请文件各处提及的把一个乐段拼合信息的值分散(或分开)放在多个码字里,一般是指把这个乐段拼合信息的值的BIT值分开成多个部分并分别放在多个码字里,这是显而易见的。Dispersing (or separating) the value of a piece of combined information mentioned in this application document and placing it in multiple codewords generally refers to separating the BIT value of the value of this piece of combined information into multiple parts and separately Put in multiple codewords, this is obvious.

本申请文件各处提及的‘诸个码字中的全部码字或部分码字’,是以整个码字为最小单位的,比如诸个码字中含3个码字(码字1,码字2,码字3),那诸个码字中的全部码字就是这3个码字,诸个码字中的部分码字的可能情况是(码字1)或(码字2)或(码字3)或(码字1,码字2)或(码字1,码字3)或(码字2,码字3);对于本申请文件各处提及的‘部诸码字中的全部码字或部分码字’或其它类似说法,也是按此类推理解,比如部诸码字中含3个码字(码字1,码字2,码字3),那部诸码字中的全部码字就是这3个码字,部诸码字中的部分码字的可能情况是(码字1)或(码字2)或(码字3)或(码字1,码字2)或(码字1,码字3)或(码字2,码字3)。The 'all codewords or part codewords in the codewords' mentioned throughout the application documents are based on the whole codeword as the minimum unit, such as containing 3 codewords in the codewords (codeword 1, Codeword 2, codeword 3), all codewords in that all codewords are exactly these 3 codewords, the possible situation of part codewords in all codewords is (codeword 1) or (codeword 2) Or (codeword 3) or (codeword 1, codeword 2) or (codeword 1, codeword 3) or (codeword 2, codeword 3); All codewords or part codewords' or other similar sayings in the word are also understood by analogy, such as containing 3 codewords (codeword 1, codeword 2, codeword 3) in all codewords of the department, the codewords of that department are all All codewords in codewords are exactly these 3 codewords, and the possibility situation of part codewords in all codewords is (codeword 1) or (codeword 2) or (codeword 3) or (codeword 1, Codeword 2) or (codeword 1, codeword 3) or (codeword 2, codeword 3).

显然,在任一权利要求项中提及的‘前述读物’是指本权利要求项内前面提及的‘读物’,而不是指本权利要求项内前面提及的‘有声音乐读物’。Obviously, the 'previous readings' mentioned in any claim refer to the 'readings' mentioned in this claim, not the 'audio music readings' mentioned in this claim.

本说明书中对本说明书和权利要求书的内容引用(比如有说:在权利要求2中提及的‘。。。’),其引用的内容可以是权利要求书或本说明书中的多句话或整句话,其引用的内容也可以是权利要求书或本说明书中的整句话的部分等。In this specification, the content of this specification and the claims is cited (for example: "..." mentioned in claim 2), and the content cited can be multiple sentences in the claims or this specification or A whole sentence, and the cited content may also be a part of a claim or a whole sentence in this specification.

本申请文件各处所说的‘数个’是指一个或多个。The 'several' mentioned throughout the application documents refers to one or more.

权利要求书里提及的‘前述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;前述部诸码字可以是一个或多个码字;’,这里说的两个‘多个’可以是相同数目或不是相同数目,而且诸个码字包含的码字个数是大于或等于部诸码字包含的码字个数的(比如如诸个码字包含多个码字,部诸码字包含多个码字,那本括号内第1句里的‘多’就大于或等于本括号内第2句里的‘多’)。权利要求书里多处提及的‘所述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;所述部诸码字可以是一个或多个码字’也是按这样理解。(本申请文件各处提及的诸个码字是多少个码字与诸个码字包含多少个码字是相同表达方式,本申请文件各处提及的部诸码字是多少个码字与部诸码字包含多少个码字是相同表达方式,比如‘诸个码字包含一个码字’与‘诸个码字是一个码字’是相同意思,比如‘诸个码字包含多个码字’与‘诸个码字是多个码字’是相同意思,比如‘部诸码字包含一个码字’与‘部诸码字是一个码字’是相同意思,比如‘部诸码字包含多个码字’与‘部诸码字是多个码字’是相同意思)All codewords of the aforementioned parts mentioned in the claims are all codewords or partial codewords in the described codewords; the described codewords can be one or more codewords; the codewords of the aforementioned parts Can be one or more codewords; ', the two 'multiple' mentioned here can be the same number or not the same number, and the number of codewords included in all codewords is greater than or equal to that contained in all codewords The number of code words (such as all code words comprise a plurality of code words, part all code words comprise a plurality of code words, ' many' in the 1st sentence in this bracket is just greater than or equal to the 2nd sentence in this bracket in the 'many'). The multiple codewords mentioned in the claims are all codewords or partial codewords in the codewords; the codewords can be one or more codewords; Part of all codewords can be one or more codewords' also be understood like this. (how many codewords and how many codewords are included in the codewords mentioned in this application document is the same expression, and how many codewords are the codewords mentioned in various places in this application document It is the same expression as how many codewords are included in all codewords, such as 'all codewords include a codeword' is the same meaning as 'all codewords are a codeword', such as 'all codewords include a plurality of codewords' Code word' and 'all code words are a plurality of code words' are the same meaning, such as 'part of all code words comprises a code word' and 'part of all code words is a code word' is the same meaning, such as 'part of all code words Word comprises a plurality of codewords' and ' all codewords are a plurality of codewords' is the same meaning)

本说明书各处提及的说明书句子中某一成分修饰另一成分,所说的修饰默认是指只修饰,即某一成分只限定另一成分,而不限定另一成分之外的内容,比如本说明书有说‘数个乐段拼合信息的值的变换值’中的‘数个’是修饰‘乐段拼合信息的值’的,那此‘数个’就是只用于限定这里的‘乐段拼合信息的值’的。In the description sentences mentioned in this manual, a certain component modifies another component, and the modification mentioned refers to only modification by default, that is, a certain component only limits another component, and does not limit the content other than the other component, such as This manual says that the "several" in the "transformation value of the value of the combination information of several sections" is to modify the "value of the combination information of the sections", then the "number" is only used to limit the value of the "music" here. The value of segment flattening information'.

如提及时无特别指明有他义,在本说明书中各处提及的第多少多少页,默认是指本说明书的第多少多少页。If there is no other meaning specified when mentioning, the number of pages mentioned in various places in this manual refers to the number of pages in this manual by default.

显然,本申请文件所说的有声音乐读物的整页面上的码字中,如任意两个码字,其码值都是数个乐段拼合信息的值,那这一个码字里的数个乐段拼合信息的值与这另一个码字里的数个乐段拼合信息的值一般是不同的。Obviously, among the codewords on the entire page of the audio music book mentioned in this application document, as any two codewords, the code values are all the values of the combined information of several music sections. The value of the piece combination information is generally different from the values of several pieces of piece combination information in the other codeword.

本申请文件所提到的解码程序及解码数据,是指用于对码字中的乐段拼合信息的值进行解码的解码程序及解码数据,且解码程序的程序代码及解码数据是按照对乐段拼合信息的值进行解码的解码规则来生成的(本说明书前面有对解码规则的举例)。The decoding program and decoding data mentioned in this application document refer to the decoding program and decoding data used to decode the value of the music segment combination information in the codeword, and the program code and decoding data of the decoding program are based on the music It is generated by the decoding rule for decoding the value of segment combination information (there is an example of the decoding rule at the beginning of this specification).

如提及时无特别指明有他义,本申请文件各处提及的权利要求书默认是指本申请文件的权利要求书。If there is no special indication of other meanings when mentioned, the claims mentioned everywhere in this application document refer to the claims of this application document by default.

权利要求书中各处提到的导电码,可以是通断式导电码或模数转换式导电码,可以是平面式导电码或非平面式导电码,可以是正贴形式导电码或反贴形式导电码,其码字区域内可以全是导电物质或既有导电物质又有非导电物质,权利要求书中各处提到的机电式导电码读码头,是可以读以上这些种导电码中任意一种或多种的读码头(机电式导电码读码头的阐述见2006102010785号发明申请)。The conductive codes mentioned everywhere in the claims can be on-off conductive codes or analog-to-digital conversion conductive codes, can be planar conductive codes or non-planar conductive codes, can be positively pasted conductive codes or reversely pasted. Conductive code, the code word area can be all conductive substances or both conductive substances and non-conductive substances. The electromechanical conductive code reading dock mentioned in the claims can read any of the above conductive codes. One or more types of reading docks (for the description of electromechanical conductive code reading docks, see invention application No. 2006102010785).

本申请文件提到的拾音器里的发音程序,可以是按谱数据播放音乐的程序,还可以是按其它形式播放音乐的程序。The pronunciation program in the pickup mentioned in this application document may be a program for playing music according to spectrum data, or a program for playing music in other forms.

本申请文件各处提到的有声音乐读物的所有乐段,是指此有声音乐读物各页面需播放的乐段的总和,也是指作为此有声音乐读物主体的读物的各页面需播放的乐段的总和;本申请文件各处提到的有声音乐读物的诸乐段,是指此有声音乐读物各页面需播放的乐段的总和的全部或部分,也是指作为此有声音乐读物主体的读物的各页面需播放的乐段的总和的全部或部分。本申请文件各处提到的诸乐段是指某一有声音乐读物的诸乐段。All the passages of the audio music book mentioned in this application document refer to the sum of the passages that need to be played on each page of the audio music book, and also refer to the passages that need to be played on each page of the reading that is the main body of the audio music book The sum of the audio music readings mentioned in this application document refers to all or part of the sum of the music sections that need to be played on each page of the audio music reading, and also refers to the reading that is the main body of the audio music reading. All or part of the sum of the sections to be played on each page. All music sections mentioned in this application document everywhere refer to all music sections of a certain audio music book.

本说明书各处述及拾音器读某码种的码字播音乐时,显然是默认此拾音器的读码头可以读此码种的码字。本说明书各处述及拾音器读数个码字并根据码值里的乐段拼合信息的值来播音乐时,显然是默认与此拾音器有关的存储器集里存有解码程序及解码数据,并且拾音器可按解码程序及解码数据来解码播音乐。When this instruction manual mentions that the pickup reads the code words of a certain code type to play music, it is obviously that the reading dock of this pickup can read the code words of this code type by default. When this manual mentions that the pickup reads several codewords and plays music according to the value of the segment combination information in the code value, it is obvious that there are decoding programs and decoding data in the memory set related to the pickup by default, and the pickup can be Decode and play music according to the decoding program and decoding data.

如有拾音器(包括本说明书前面说的电子产品拾音器),把读码头和MCU集成在一起(比如电子产品拾音器可读码播音乐,而此电子产品拾音器的主MCU与用于读码的照相电路是集成在一起的,这里照相电路相当于读码头),那这种情况应该是把读码头和MCU分开的等同替换。If there is a pickup (including the pickup of the electronic product mentioned earlier in this manual), integrate the reading terminal and the MCU (for example, the pickup of the electronic product can read the code and play music, and the main MCU of the pickup of the electronic product and the camera circuit for code reading are integrated together, where the camera circuit is equivalent to the reading dock), then this situation should be an equivalent replacement of separating the reading dock and the MCU.

本申请文件各处提到的某值涉及的BIT,是指某值本身或某值的变换值,比如说数个乐段拼合信息的值涉及的BIT,是指数个乐段拼合信息的值本身或数个乐段拼合信息的值的变换值。The BIT involved in a certain value mentioned in this application document refers to a certain value itself or the transformation value of a certain value. For example, the BIT involved in the value of several pieces of information combined refers to the value itself of the number of pieces combined information Or the conversion value of the value of the combination information of several sections.

如一个乐段拼合信息的值的BIT分开放在多个码字里(或数个乐段拼合信息的值的变换值的BIT分开放在多个码字里或类似情况),这多个码字又分别处于多本有声音乐读物的整页面上,使用时需由拾音器读这多本有声音乐读物上的这多个码字,而得到乐段拼合信息的值来播音乐,这种情况可把这多本有声音乐读物等同看成是一本有声音乐读物。As the BIT of the value of a music section splicing information is separately placed in multiple codewords (or the BIT of the transformed value of the value of several music section splicing information is separately placed in multiple codewords or the like), these multiple codewords Words are respectively located on the entire pages of many audio music books, and these multiple code words on these audio music books need to be read by the pickup during use, and the value of the music segment combination information is obtained to play music. This situation can be Think of these multiple audiobooks as one audiobook.

本申请文件所说的SNC710具备播音乐功能(四声道),是指具备按音乐合成方式播放四声道音乐的功能。The SNC710 mentioned in this application document has the function of playing music (four-channel), which means that it has the function of playing four-channel music in the way of music synthesis.

本申请文件各处所说的一个码字(比如导电码,条形码)对应播放的音乐发音可以是指此一个码字被拾音器读取后,拾音器根据此码字的码值携带的数个乐段拼合信息的值来播放的音乐发音(一般是这数个乐段的发音),也可以是指包括此码字在内的多个码字被拾音器读取后,拾音器根据这多个码字的码值组合起来得到的数个乐段拼合信息的值(比如一个乐段拼合信息的值,此一个乐段拼合信息的值是分散放置在这多个码字里的)来播放的音乐发音(一般是这数个乐段的发音),等。The music pronunciation corresponding to a code word (such as conductive code, bar code) mentioned in this application document can refer to that after this code word is read by the pickup, the pickup is assembled according to the number of music segments carried by the code value of the code word The value of the information to play the music pronunciation (generally the pronunciation of these several sections), it can also mean that after multiple code words including this code word are read by the pick-up, the pick-up will read the code according to the code of these multiple code words. The value of several pieces of information that are combined to obtain the value (such as the value of a piece of information, the value of this piece of information is scattered in these multiple codewords) to play the music pronunciation (generally is the pronunciation of these several passages), etc.

另外,可能某有声音乐读物整页面上的某码字,其码值与数个乐段拼合信息的值有关系,但其不能对应播放音乐,比如是把此有声音乐读物的一乐段的拼合信息的值的一部分BIT放在此码字里(此码字里不再放别的值),没有作将此乐段拼合信息的值的另一部分BIT往其它任何码字里放的任何处理,用任一拾音器读此一个码字后再读其它任意多个码字后都不能得到此乐段拼合信息的值的全部BIT,不能按此乐段拼合信息的值解码播乐段发音,此码字也相当于是废码字,本申请把这样的码字归入码字的值不是与拼合信息的值有关的的情况(因为码字里的那些BIT可看成是些无意义的垃圾BIT),象这样不能对应播放乐段发音的码字不属于此有声音乐读物整页面上诸个码字中的码字。In addition, there may be a certain code word on the entire page of an audio music book, and its code value is related to the value of the combination information of several music sections, but it cannot correspond to playing music, such as combining a music section of this audio music book A part of the BIT of the value of the information is placed in this codeword (no other values are placed in this codeword), and there is no processing of putting another part of the BIT of the value of this music segment into any other codeword. After reading this code word with any pick-up and then reading any other code words, all BITs of the value of this music segment combination information cannot be obtained, and the playback segment pronunciation cannot be decoded according to the value of this segment combination information. The word is also equivalent to a waste code word, and the application classifies such a code word into the situation that the value of the code word is not related to the value of the combined information (because those BITs in the code word can be regarded as some meaningless garbage BITs) , the code word that can not correspond to the pronunciation of the playing section like this does not belong to the code word in the code words on the whole page of this audio music book.

本申请文件各处提到的乐段发音(或乐段的发音),是指乐段的音乐声。The section pronunciation (or the pronunciation of the section) mentioned in this application document everywhere refers to the musical sound of the section.

权利要求书中各处提到的‘所述单片机芯片外所述拾音器内’,是指所述单片机芯片外且所述拾音器内的意思。本说明书中提到的‘单片机芯片外拾音器内’是指单片机芯片外且拾音器内的意思。"Inside the pickup outside the chip of the single-chip microcomputer" mentioned everywhere in the claims refers to the meaning outside the chip of the single-chip microcomputer and inside the pickup. The "single-chip microcomputer chip outside the pickup inside" mentioned in this manual refers to the meaning outside the single-chip microcomputer chip and inside the pickup.

本申请文件提到的‘其它电子类产品’是指MP3播放机,复读机,手机等。The 'other electronic products' mentioned in this application document refer to MP3 players, repeaters, mobile phones, etc.

本申请文件提到的手机,包括GSM,CDMA等手机,也包括小灵通,市话通等。The mobile phones mentioned in this application document include GSM, CDMA and other mobile phones, as well as PHS, local telephone and so on.

在权利要求书里多处有‘用于对乐段拼合信息的值进行解码的且是按照解码规则来解码的解码程序及解码数据’的说法,这里‘用于对乐段拼合信息的值进行解码’是修饰‘解码程序及解码数据’的,‘且是按照解码规则来解码’也是修饰‘解码程序及解码数据’的。In the claims, there are many places in the claim that "the decoding program and the decoded data are used to decode the value of the music segment combination information and are decoded according to the decoding rules". Decoding' is a modification of 'decoding program and decoding data', and 'decoding according to decoding rules' is also a modification of 'decoding program and decoding data'.

如把一本有声音乐读物的一些乐段称为此有声音乐读物的诸乐段,那这些乐段也是读物的诸乐段(此读物是此有声音乐读物的主体);同理,对于一本有声音乐读物的一些乐段,那这些乐段也是读物的一些乐段(此读物是此有声音乐读物的主体)。如把作为一本有声音乐读物主体的读物的一些乐段称为此读物的诸乐段,那这些乐段也是此有声音乐读物的诸乐段;同理,对于作为一本有声音乐读物主体的读物的一些乐段,那这些乐段也是此有声音乐读物的一些乐段。If some passages of an audio music reading are called the passages of this audio music reading, then these passages are also the passages of the reading (this reading is the main body of this audio music reading); Some passages of the audio music book, then these passages are also some passages of the reading (this reading is the main body of this audio music book). If some passages of the reading as the main body of an audio music book are called the passages of this reading, then these passages are also the passages of this audio music reading; Some passages of the reading, and these passages are also some passages of this audio music book.

本申请文件各处提及的有关乐段拼合信息的值涉及的编码规则可以是全部成文的(比如编码规则的内容都写成WORD文件形式),或全部不成文的(比如编码者按照他脑子里想好的编码规则来编码),或部分成文部分不成文的,本申请文件各处提及的有关乐段拼合信息的值涉及的解码规则也可以是全部成文的,或全部不成文的,或部分成文部分不成文的,比如编码时可以由编码者凭他脑里想好的一些编法来编码,这些编法也就是编码规则,只是此编码规则没形成文件而已,所以本申请文件各处所说的编码规则也可以是这种全部不成文的编码规则,也可以是部分成文部分不成文的编码规则;同样,程序员凭他脑里想好的一些解法来写拾音器的对乐段拼合信息的值执行解码的解码程序,这些解法也就是解码规则,只是此解码规则没形成文件而已,所以本申请文件各处所说的解码规则也可以是这种不成文的解码规则,也可以是部分成文部分不成文的解码规则。The coding rules involved in the value of the information about the combination of music pieces mentioned in this application document can be all written (such as the content of the coding rules is written in the form of a WORD file), or all unwritten (such as the coder according to his mind. Encoding with a well-thought-out encoding rule), or partly written and partly unwritten, the decoding rules involved in the value of the music segment combination information mentioned throughout this application file can also be all written, or all unwritten, or Some are written and some are unwritten. For example, when coding, the coder can code according to some coding methods in his mind. These coding methods are also coding rules, but this coding rule has not been formed into a document. The coding rules mentioned can also be all unwritten coding rules, or partially written and partially unwritten coding rules; similarly, the programmer writes the matching information of the pickup with some solutions in his mind These solutions are also the decoding rules, but the decoding rules have not been documented, so the decoding rules mentioned in this application document can also be this kind of unwritten decoding rules, or they can be partially written. Partially unwritten decoding rules.

如从提及处的上下文不能推出有他意,本申请文件各处提及的编码规则,默认是指有关乐段拼合信息的值涉及的编码规则;如从提及处的上下文不能推出有他意,本申请文件各处提及的解码规则,默认是指有关乐段拼合信息的值涉及的解码规则。If there is no other meaning from the context of the mention, the encoding rules mentioned throughout this application document refer to the encoding rules involved in the value of the information about the piece combination by default; if there is no other meaning from the context of the mention, The decoding rules mentioned in various parts of this application document refer to the decoding rules involved in the value of the music segment combination information by default.

权利要求书中有几处提到‘有多个页面而且在多个页面上都有码字’,其中后面那个‘多’是小于或等于前面那个‘多’的。There are several places in the claims that 'there are multiple pages and there are codewords on multiple pages', wherein the latter 'many' is less than or equal to the former 'many'.

本说明书各处述及到存储器集(其前面没加‘与拾音器有关的’)时,是默认此存储器集是与拾音器有关的存储器集(此拾音器是在提到存储器集那里涉及的拾音器)。When this specification sheet mentions storage set (do not add ' relevant to the pick-up' in front of it), it is acquiescence that this storage set is the storage set relevant to the pickup (this pickup is the pickup that is mentioned there related to the storage set).

本申请文件各处提到的‘乐段的拼合信息的值’与‘乐段拼合信息的值’同义。The 'value of the piece combination information' mentioned throughout this application document is synonymous with the 'value of the piece combination information'.

在权利要求3中提到的‘所述部诸码字中的每个码字的值,是此诸乐段中一个乐段的拼合信息的值或含有此诸乐段中一个乐段的拼合信息的值或。。。或含有包括此诸乐段中一个乐段的拼合信息的值在内的多个不同类型的乐段发音信息值的变换值的部分’,这种写法是指部诸码字中的每个码字都有这样的特征:码字的值,是此诸乐段中一个乐段的拼合信息的值或含有此诸乐段中一个乐段的拼合信息的值或。。。或含有包括此诸乐段中一个乐段的拼合信息的值在内的多个不同类型的乐段发音信息值的变换值的部分。对于权利要求书中其它类似写法也按此类推理解。(本段里,因为引用的句子太长,所有中间用‘。。。’来作了省略)Mentioned in claim 3, the value of each code word in the said part code words is the value of the mosaic information of a music section in these music sections or the combination of a music section containing a music section in these music sections The value of the message or . . . Or contain the part of the transformation value of the music section pronunciation information value of a plurality of different types including the value of the mosaic information of a music section in these all sections ', this way of writing refers to each code word in all codewords Word all has such feature: the value of code word is the value or the value of the splicing information of a music section in these all music sections or contains the value of the piece together information of a music section in these all music sections. . . Or the part that contains the transformation value of the pronunciation information value of a plurality of different types of music sections including the value of the combined information of one of the music sections. Other similar wordings in the claims should also be understood by analogy. (In this paragraph, because the quoted sentences are too long, all of them are omitted with '...')

本申请文件述及的拾音器,也可以用手机采用的照相头来读码,那一般就可把照相头当成读码头(这种情况是照相机图象处理电路在主MCU内且一般是照相头连线到主MCU的,这种情况照相涉及的读码软件一般放在主MCU内),或把照相头加照相机图象处理集成电路当成读码头(这种情况照相机图象处理电路是做成照相机图象处理集成电路的,照相头和此集成电路都在主MCU芯片外,一般是照相头连线到此集成电路且此集成电路连线到主MCU,这种情况照相涉及的读码软件一般可放在照相机图象处理集成电路内,由照相机图象处理集成电路得到读码结果后,再由此集成电路把读码结果传给主MCU),用照相功能读的相应的有声音乐读物整页面上的码字一般是二维条形码(并且因二维条形码的BIT容量大,所以可在二维条形码上放乐段拼合信息的值,由拾音器读码字并按码值里的乐段拼合信息的值解码播音乐,当然此拾音器需有按乐段拼合信息的值解码播音乐的能力),也可修改照相涉及的读码软件使此种照相方式的读码头可以读导电码或一维条形码或图形码等。本申请文件述及的拾音器的读码头可以是本段所述的读码头,可把这种读码头称为照相式读码头。本段所说的拾音器既可以是本说明书前面说到的电子产品拾音器,也可以是本申请文件提到的除电子产品拾音器以外的拾音器。The pickup mentioned in this application document can also read the code with the camera head that the mobile phone adopts, then generally the camera head can be used as the reading dock (this situation is that the camera image processing circuit is in the main MCU and generally the camera head is connected In this case, the code reading software involved in photography is generally placed in the main MCU), or the camera head plus camera image processing integrated circuit is used as a reading terminal (in this case, the camera image processing circuit is made into a camera For image processing integrated circuits, the camera head and the integrated circuit are both outside the main MCU chip. Generally, the camera head is connected to the integrated circuit and the integrated circuit is connected to the main MCU. In this case, the code reading software involved in photography is generally It can be placed in the camera image processing integrated circuit, after the code reading result is obtained by the camera image processing integrated circuit, and then the integrated circuit transmits the code reading result to the main MCU), and the corresponding audio music book read with the camera function is complete The code word on the page is generally a two-dimensional barcode (and because the BIT capacity of the two-dimensional barcode is large, so the value of the segment combination information can be placed on the two-dimensional bar code, the code word is read by the pickup and combined according to the segment in the code value The value of the information is decoded to play music, of course, the pickup needs to have the ability to decode and play music according to the value of the information according to the music segment), and the code reading software involved in the camera can also be modified so that the reading terminal of this camera can read conductive codes or one-dimensional Bar code or graphic code, etc. The reading terminal of the pickup mentioned in this application document can be the reading terminal described in this paragraph, and this reading terminal can be called a photographic type reading terminal. The pickup mentioned in this paragraph can be the electronic product pickup mentioned earlier in this specification, and it can also be a pickup other than the electronic product pickup mentioned in this application document.

本申请文件提到的作为码字载体的小块平面材料,其面积多大并无特别限制,如果用一面积比较大的平面材料,上有一个或多个码字,把此平面材料粘贴在作为有声音乐读物主体的读物的页面上,一般不会把读物页面盖完,此平面材料也等同是小块平面材料。The small piece of plane material mentioned in this application document as a code word carrier has no special limitation on its area. If a plane material with a relatively large area has one or more code words on it, this plane material is pasted on as a code word carrier. On the page of the reading material of the main body of the audio music book, the page of the reading material is generally not covered, and this plane material is also equivalent to a small piece of plane material.

本申请文件各处提到的‘数个乐段拼合信息的值的变换值’,其中的‘数个’是修饰‘乐段拼合信息的值’的;本申请文件各处提到的‘数个乐段谱数据的变换值’,其中的‘数个’是修饰‘乐段谱数据’的。The "transformation value of the value of the combination information of several pieces" mentioned in this application document, the "several" is to modify the "value of the combination information"; the "number" mentioned in this application document Transformation values of music spectrum data', among which 'several' are modified 'music spectrum data'.

本申请文件各处提到的‘一部诸码字’或‘某部诸码字’或‘此部诸码字’,都是指一部诸码字(此部诸码字中有数个码字),而不是指部诸码字中的一个码字。The 'all codewords of a part' or 'all codewords of a certain part' or 'all codewords of this part' mentioned in this application document all refer to a part of all codewords (there are several codewords in all codewords of this part) word), rather than referring to a codeword in all codewords.

一个乐段的谱数据就是谱格式的用于产生这个乐段发音的那些数据集合。The spectral data of a musical piece is the collection of those data in spectral format used to generate the pronunciation of this musical piece.

本申请文件所说的‘数个乐段的谱数据’与‘数个乐段谱数据’是相同表达,‘一个乐段的谱数据’与‘一个乐段谱数据’是相同表达,‘一些乐段的谱数据’与‘一些乐段谱数据’是相同表达,‘多个乐段的谱数据’与‘多个乐段谱数据’是相同表达。本申请文件所说的‘数个乐段的拼合信息的值’与‘数个乐段拼合信息的值’是相同表达,‘一个乐段的拼合信息的值’与‘一个乐段拼合信息的值’是相同表达,‘一些乐段的拼合信息的值’与‘一些乐段拼合信息的值’是相同表达,‘多个乐段的拼合信息的值’与‘多个乐段拼合信息的值’是相同表达。The 'spectrum data of several musical sections' mentioned in this application document is the same expression as 'several musical section spectral data', 'the spectral data of one musical section' is the same expression as 'a musical section spectral data', and 'some Spectral data of a musical section' is the same expression as 'spectral data of some musical sections', and 'spectral data of multiple musical sections' is the same expression as 'spectral data of multiple musical sections'. The "value of the combined information of several sections" and "the value of the combined information of several sections" mentioned in this application document are the same expressions, and the value of "the value of the combined information of one section" is the same as "the value of the combined information of one section". Value' is the same expression, 'the value of the combined information of some sections' is the same expression as 'the value of the combined information of some sections', and the 'value of the combined information of multiple sections' is the same as 'the value of the combined information of several sections' Value' is the same expression.

如从提到处的上下文不能推出有他义,本申请文件提到的其前面无数量词的‘乐段拼合信息的值’(或‘乐段的拼合信息的值’)默认是指一个乐段拼合信息的值;如从提到处的上下文不能推出有他义,本申请文件提到的其前面无数量词的‘乐段谱数据’(或‘乐段的谱数据’)默认是指一个乐段谱数据。If other meanings cannot be deduced from the context at which it is mentioned, the "value of the music segment combination information" (or "the value of the segment combination information") mentioned in this application document without quantifiers in front of it refers to a music segment combination by default The value of the information; if there is no other meaning from the context of the mention, the 'music spectrum data' (or 'music spectrum data') mentioned in this application document without quantifiers in front of it refers to a music spectrum by default data.

显然,本申请文件各处提到的拾音器内存储有某数据,默认是指与此拾音器有关的存储器集里存储有某数据。本申请文件各处提到的把某数据存储在拾音器内,默认是指把某数据存储在与此拾音器有关的存储器集里。本申请文件各处提到的拾音器内的某数据,默认是指与此拾音器有关的存储器集里的某数据。如与某拾音器有关的存储器集里只含一个存储器,那此存储器集里的各存储器显然就是指这一个存储器。本段各处提到的‘某数据’,都是泛指,比如是解码涉及的数据。Apparently, certain data is stored in the pickup mentioned in various places in this application document, and by default it means that certain data is stored in the memory set related to this pickup. Storing certain data in the pickup mentioned throughout this application document refers to storing certain data in the memory set relevant to this pickup by default. The certain data in the pick-up that mentions everywhere in this application document refers to certain data in the memory set relevant to this pick-up by default. Only contain a storer in the memory set relevant with certain pickup, then each memory store in this memory set obviously refers to this storer. The 'certain data' mentioned throughout this paragraph refers generally, such as the data involved in decoding.

在汉语里‘及’是可作为连词的,相当于‘和’,而且‘及’作为连词还有这种含义:如说‘某甲及某乙’,而某甲和某乙都存在,那说‘某甲及某乙’就等价于说‘某甲和某乙’,而如某甲存在而某乙不存在,那说‘某甲及某乙’就等价于说‘某甲’。在一个单片机系统中,主MCU是要使用程序和数据的,不使用数据的情况很少见,因为比如象汇编程序中用DB,DW定义的数据就应归属于前一句说的数据范围,而汇编程序中很少出现没有用DB,DW这样来定义的数据的情况,但极端情况也可能不用数据,所以本申请文件是采用‘程序及数据’,‘拾音器涉及的程序及拾音器涉及的数据’,‘所有程序及所有数据’这样有‘及’的写法,本申请文件约定,对于本申请文件各处所说的‘程序及数据’,如数据存在,那说‘程序及数据’就相当于说‘程序和数据’,如数据不存在,那说‘程序及数据’就相当于说‘程序’;对于本申请文件各处所说的‘拾音器涉及的程序及拾音器涉及的数据’,如拾音器涉及的数据存在,那说‘拾音器涉及的程序及拾音器涉及的数据’就相当于说‘拾音器涉及的程序和拾音器涉及的数据’,如拾音器涉及的数据不存在,那说‘拾音器涉及的程序及拾音器涉及的数据’就相当于说‘拾音器涉及的程序’;对于本申请文件各处所说的‘所有程序及所有数据’,‘解码程序及解码数据’也按此类推理解。(本申请文件是包括本次申请的权利要求书和说明书的)In Chinese, 'and' can be used as a conjunction, which is equivalent to 'and', and 'and' has this meaning as a conjunction: for example, 'some A and a certain B', and both a certain A and a certain B exist, then Saying 'A and B' is equivalent to saying 'A and B', and saying 'A and B' is the same as saying 'A' if A exists and B does not . In a single-chip microcomputer system, the main MCU needs to use programs and data, and it is rare to not use data, because for example, the data defined by DB and DW in assembler programs should belong to the data range mentioned in the previous sentence, and In the assembler program, it is rare that the data defined by DB and DW is not used, but in extreme cases, the data may not be used, so this application document uses "programs and data", "programs related to pickups and data related to pickups" , "all programs and all data" is written with "and". This application document stipulates that for the "programs and data" mentioned in this application document, if data exists, then saying "programs and data" is equivalent to saying "Programs and data", if the data does not exist, then saying "programs and data" is equivalent to saying "programs"; for the "programs related to the pickup and data related to the pickup" mentioned throughout this application document, if the pickup is related If the data exists, saying 'programs involved in the pickup and data related to the pickup' is equivalent to saying 'programs involved in the pickup and data related to the pickup', if the data involved in the pickup does not exist, then say 'programs involved in the pickup and data related to the pickup' The data' is equivalent to saying 'programs involved in the pickup'; for 'all programs and all data' mentioned throughout this application document, 'decoding program and decoding data' can also be understood by analogy. (This application document includes the claims and description of this application)

本申请文件各处说到的从多个码字得到某值,是指用拾音器读这多个码字并把这多个码字的值组合起来后得到某值,比如把一个乐段拼合信息的值的BIT分开放在多个码字里时,读码使用时就要用拾音器读这多个码字并把这多个码字的值组合起来后得到此乐段拼合信息的值的BIT,即从这多个码字得到此乐段拼合信息的值,或是比如把一个乐段拼合信息的值的变换值的BIT分开放在多个码字里,读码使用时要读这多个码字并把这多个码字的值组合起来后得到此乐段拼合信息的值的变换值的BIT,并对此变换值作逆变换,这样可从这多个码字得到此乐段拼合信息的值。Getting a certain value from multiple codewords mentioned in this application document refers to reading these multiple codewords with a pickup and combining the values of these multiple codewords to obtain a certain value, such as combining a piece of information When the BIT of the value is placed in multiple codewords separately, when reading the code, it is necessary to use a pickup to read the multiple codewords and combine the values of the multiple codewords to obtain the BIT of the value of the segment information , that is, to obtain the value of the piece combination information from these multiple codewords, or for example, separate the BIT of the transformation value of a piece piece together information value into multiple codewords, and read the code when using it. codewords and combine the values of these multiple codewords to obtain the BIT of the transformation value of the value of this music section stitching information, and do inverse transformation to this transformation value, so that this music section can be obtained from these multiple codewords The value of the flatten information.

本说明书中各处提到的码字里的数个乐段拼合信息的值等类似提法,是指可从这码字得到(比如直接得到或转换得到)的数个乐段拼合信息的值等。The values of the several pieces of combination information in the codeword mentioned everywhere in this manual, etc., refer to the value of the several pieces of combination information that can be obtained (such as obtained directly or converted) from this codeword wait.

本申请文件各处所说的一个码字的值是多个乐段拼合信息的值,一般是指把这多个乐段拼合信息的值的BIT顺序放在此码字里,或交叉放在此码字里。其它类似说法也按这样类推理解。The value of a code word mentioned in this application document is the value of a plurality of music segment combination information, and generally refers to putting the BIT sequence of the value of the plurality of music segment combination information in this code word, or interleaved here In the code word. Other similar statements can also be understood by analogy.

本申请文件各处说到的拾音器读取码字以播音乐(或类似说法),实际是指拾音器被人操纵以读取码字以播音乐,这是显然的,之所以说成拾音器读取码字以播音乐是为了叙述简洁。显然在权利要求1,2,6中提到的有声音乐读物,读物,拾音器都是指一个的。The pickup mentioned in this application document reads the codeword to play music (or similar sayings), actually refers to that the pickup is manipulated by people to read the codeword to play music. This is obvious, so it is said that the pickup reads The purpose of codewords to play music is to describe concisely. Obviously in claim 1, 2, the audio music reading material mentioned in 6, reading material, pickup all refers to one.

拾音器每次根据读得的一个码字的码值使发音器件播放音乐发音,或每次根据读得的多个码字的码值使发音器件播放音乐发音;然后下一次又可重复此动作,去读别的数个码字并播放别的音乐发音。The pickup makes the pronunciation device play the music pronunciation according to the code value of a code word that is read at every turn, or makes the pronunciation device play the music pronunciation according to the code values of a plurality of code words that are read; then this action can be repeated next time, Go read other codewords and play other music pronunciations.

本申请文件所说的拾音器读数个码字后播放音乐发音(或播放乐段发音),可能是人用拾音器读了数个码字后拾音器自己就播放数个乐段发音(播放数个乐段发音包括播放乐段发音的情况),也可能是人用拾音器读数个码字后拾音器再根据人的按键操作来播放数个乐段发音(播放数个乐段发音包括播放乐段发音的情况),这些方式互相是等同替换。The said pickup of this application document reads several code words and then plays the music pronunciation (or plays the pronunciation of the music section), it may be that after the people read several code words with the pickup, the pickup itself plays the pronunciation of several sections (playing several sections of music) Pronunciation includes the situation of playing the pronunciation of the passage), and it may also be that the human reads a number of code words with the pickup and then the pickup plays several passages of pronunciation according to the key operation of the person (playing the pronunciation of several passages includes the situation of playing the pronunciation of the passage) , these methods are equivalent replacements for each other.

如提及时无特别指明有他义且不能从提及处的上下文推出有他义,本申请文件各处提到的其前面没加数量词的喇叭,喇叭电路,音乐芯片都默认是指一个的,这也是显然的。If there is no other meaning specified when mentioning and it cannot be deduced from the context of the mention, the speakers, speaker circuits, and music chips mentioned throughout this application document without quantifiers in front of them all refer to one by default. This is also obvious.

本申请文件所说的带喇叭的音乐芯片,是指其外接有喇叭的音乐芯片。The music chip with a speaker mentioned in this application document refers to a music chip with a speaker connected externally.

除非可从提及处上下文推出有他义,本申请文件各处提及的‘播’,默认与‘播放’等同;比如说播音乐就是指播放音乐,说播乐段发音就是指播放乐段发音。Unless other meanings can be deduced from the context of the mention, the "broadcast" mentioned in this application document is the same as "play" by default; pronounce.

本申请文件描述读码字播音乐的过程,主要以先读数个码字,从这数个码字得到数个乐段拼合信息的值,再按这数个乐段拼合信息的值播音乐为例来说的(假设这3句话说的方式是方式1),实际中可能会在使用过程中读数个码字后,每取出一个乐段拼合信息的值就播一个乐段的发音(即比如从这数个码字中共可得到2个乐段拼合信息的值,那先从这数个码字中取得第1个乐段拼合信息的值并解码播此乐段的音,再从这数个码字中取得第2个乐段拼合信息的值并解码播此乐段的音),甚至每取出乐段拼合信息的值的一部分就播乐段的一部分发音(即比如从这数个码字中共可得到1个乐段的拼合信息的值,那先从这数个码字中取得这个乐段的拼合信息的值的前一部分并根据其解码播音乐,再从这数个码字中取得这个乐段的拼合信息的值的后一部分并根据其解码播后面的音乐,这2步播的音乐在听觉上就组成这个乐段的发音),这些种方式与本段前面说的方式1是等同替换。This application document describes the process of reading code words and playing music. It mainly reads several code words first, obtains the value of several pieces of information combined from these codes, and then plays music according to the value of these several pieces of information. For example (assuming that the way these 3 sentences are said is mode 1), in practice, after reading several codewords during use, the pronunciation of a music section is played every time the value of a piece of combination information is taken out (that is, for example From these several codewords, the value of 2 pieces of music segment combination information can be obtained, then first obtain the value of the first music piece combination information from these codewords and decode and play the sound of this piece of music, and then from these numbers Obtain the value of the 2nd music segment combination information in a code word and decode and play the sound of this segment), and even play a part of the music segment pronunciation every time a part of the value of the segment combination information is taken out (that is, for example, from these codes Words can get the value of the mosaic information of 1 paragraph, then first get the first part of the value of the mosaic information of this paragraph from these codewords and decode and play music according to it, and then from these codewords Obtain the latter part of the value of the combined information of this section and play the following music according to its decoding. The music played in these 2 steps constitutes the pronunciation of this section in terms of hearing), these methods are the same as the method 1 mentioned earlier in this paragraph is an equivalent replacement.

权利要求书和本说明书中各处提到的拾音器读部诸码字中的数个码字后播放音乐以及类似说法,除了是本说明书前面所说的一些具体情况外,还可能是:一个乐段的拼合信息的值的BIT分开放在部诸码字中的多个码字里,按常规读码使用时应该用拾音器利用读码头依次读这多个码字以得到此乐段的拼合信息的值的全部BIT,再按此乐段的拼合信息的值的全部BIT解码播放此乐段的发音,实际中也可能是用拾音器利用读码头读这多个码字中其中一个或几个码字以得到此乐段的拼合信息的值的部分BIT,并按此乐段的拼合信息的值的部分BIT解码播放此乐段的部分发音(即部分乐段的发音),如这里说的此乐段的拼合信息的值的部分BIT符合本说明书前面述的拼合信息的值的条件,比如是表示如何用多个音素来拼合本段前述部分乐段的发音的,那可把本段前述部分乐段看成是另一乐段,把这里说的此乐段的拼合信息的值的部分BIT看成是这另一乐段的拼合信息的值,是对此拼合信息的值解码发音的,权利要求书和本说明书中各处提到的拾音器读部诸码字中的数个码字后播放音乐就可以是上述的这种具体情况,而且还可能是别的更多本说明书没提及的具体情况。Play music and similar sayings after several code words in the code words of the pick-up reading part mentioned everywhere in the claims and this specification, except some specific situations mentioned earlier in this specification, it may also be: a music The BIT of the value of the combined information of the section is placed separately in multiple codewords in the codewords. When using the conventional code reading, you should use the pickup to read the multiple codewords in order to obtain the combined information of this music section. All the BITs of the value, and then decode and play the pronunciation of this section according to all the BITs of the value of the combined information of this section. In practice, it is also possible to use the pickup to read one or several of the multiple codewords by using the reading dock. Word to obtain the part BIT of the value of the merging information of this music section, and decode and play the part pronunciation of this music section (that is, the pronunciation of a part of the music section) according to the part BIT decoding of the value of the merging information of this music section, as said here Part of the BIT of the value of the combined information of the paragraph meets the conditions of the value of the combined information mentioned above in this manual, for example, it indicates how to use multiple phonemes to combine the pronunciation of the preceding part of this paragraph, then you can put the preceding part of this paragraph A section is regarded as another section, and the BIT part of the value of the combined information of this section mentioned here is regarded as the value of the combined information of this other section, and the value of the combined information is decoded and pronounced. Play music just can be above-mentioned this specific situation after the several codewords in all codewords of the pick-up that mentions in the claims and this specification, and it may be other more that this specification does not mention specific circumstances.

本申请文件所说的用拾音器读多个码字,一般是指用拾音器利用读码头依次读这多个码字(每次是读一个码字),或用拾音器利用读码头按约定次序读这多个码字。What this application document says reads a plurality of codewords with pickup, generally refers to utilizes reading terminal to read these multiple codewords (reading a codeword at a time) with pickup in turn, or utilizes reading terminal to read these in agreed order with pickup multiple codewords.

本申请文件所说的可把有声音乐读物整页面上的诸个码字中的部分能被某拾音器读取以用于播放音乐的码字,定义为此拾音器能读的部诸码字,那可能在此部诸码字之外的此诸个码字中的一些码字中的各码字,也能被此拾音器读取以用于播放音乐。The part of the codewords on the entire page of the audio music book that can be read by a certain pickup for playing music is defined in this application as the codewords that can be read by the pickup, then Each code word in some of these code words other than the code words of this part may also be read by the pickup for playing music.

如从提及处的上下文不能推出有他意,本说明书各处提及的某某数据的BIT默认是指某某数据的全部BIT,比如提及一个乐段拼合信息的值的BIT时,如从提及处的上下文不能推出有他意,那提及的一个乐段拼合信息的值的BIT就是指此乐段拼合信息的值的全部BIT。If there is no other meaning from the context of the mention, the BIT of a certain data mentioned in this manual refers to all the BITs of a certain data by default. The context of the mention can not be deduced to have other meanings, and the BIT of the value of a piece of music piece information mentioned refers to all the BITs of the value of this piece of piece piece information.

本说明书除本段外的其它段所说的拾音器内只有一个发音器件,比如一个喇叭或带喇叭的音乐芯片等,而如在拾音器内有多个本说明书其它段所说的发音器件,拾音器有的时候用其中一个本说明书其它段所说的发音器件播放音乐,有的时候用其中另外的一个本说明书其它段所说的发音器件播放音乐,那可把这多个本说明书其它段所说的发音器件等同当成一个发音器件,权利要求2,6里提到的发音器件就可以是这种由多个本说明书其它段所说的发音器件(或本说明书没提到的功能类似于本说明书其它段所说的发音器件的发音器件)组合成的,这种情况时仍是这种组合形式的发音器件与MCU意向连接,且是指这多个本说明书其它段所说的发音器件(或本说明书没提到的功能类似于本说明书其它段所说的发音器件的发音器件)中的每个本说明书其它段所说的发音器件(或本说明书没提到的功能类似于本说明书其它段所说的发音器件的发音器件)与MCU直接连接或间接连接,拾音器在使用其中一个或同时使用其中多个本说明书其它段所说的发音器件(或本说明书没提到的功能类似于本说明书其它段所说的发音器件的发音器件)播放音乐时,等同就是拾音器在使用这种组合形式的发音器件播放音乐;比如在一个拾音器内既有带喇叭的音乐芯片1也有带喇叭的音乐芯片2,拾音器有的时候是用带喇叭的音乐芯片1播放音乐,有的时候是用带喇叭的音乐芯片2播放音乐,那可把这带喇叭的音乐芯片1和带喇叭的音乐芯片2合起来当成一个发音器件,权利要求2,6里提到的发音器件可就是这种发音器件,拾音器在使用其中的带喇叭的音乐芯片1播放音乐时,等同就是拾音器在使用这种发音器件(带喇叭的音乐芯片1加带喇叭的音乐芯片2)播放音乐。所以本申请文件所说的拾音器内只有一个发音器件,此发音器件可以是一个喇叭,也可以是一个带喇叭的音乐芯片,也可以是由数个喇叭组成的,或由数个带喇叭的音乐芯片组成的,或由数个喇叭和数个带喇叭的音乐芯片组成的,或更多组成的情况。如把本说明书其它段所说的发音器件或本说明书没提到的功能类似于本说明书其它段所说的发音器件的发音器件定义为子发音器件,那本段所说的组合形式的发音器件是由多个子发音器件组成的,此组合形式的发音器件与MCU意向连接,且这里意向连接一般包括几种情况:1是这多个子发音器件都各与MCU直接连接,2是这多个子发音器件都各与MCU间接连接,3是这多个子发音器件中部分子发音器件各与MCU直接连接,另一部分子发音器件各与MCU间接连接。There is only one pronunciation device in the pickup mentioned in other paragraphs of this manual except this paragraph, such as a horn or a music chip with a horn, etc., and if there are multiple pronunciation devices mentioned in other paragraphs of this manual in the pickup, the pickup has Sometimes use one of the pronunciation devices mentioned in other paragraphs of this manual to play music, and sometimes use another pronunciation device mentioned in other paragraphs of this manual to play music, then you can use these multiple pronunciation devices mentioned in other paragraphs of this manual. The sounding device is equivalent to a sounding device, and the sounding device mentioned in claim 2 and 6 can be the sounding device (or the function not mentioned in this specification is similar to other parts of this specification) by other paragraphs of this specification. The sounding device of the sounding device mentioned in the paragraph) is combined, in this case, the sounding device of this combined form is still connected with the MCU intention, and refers to the sounding device (or the sounding device in this manual) mentioned in other paragraphs of this manual The function not mentioned in the specification is similar to the pronunciation device of the pronunciation device mentioned in other paragraphs of this specification) in each of the pronunciation devices mentioned in other paragraphs of this specification (or the function not mentioned in this specification is similar to that of other paragraphs in this specification) The pronunciation device of the said pronunciation device) is directly connected or indirectly connected with the MCU, and the pickup is using one or more of the pronunciation devices mentioned in other paragraphs of this manual (or the functions not mentioned in this manual are similar to other parts of this manual. When the sounding device of the sounding device mentioned in the paragraph) plays music, it is equivalent to that the pickup is playing music using this combined sounding device; The pickup sometimes uses the music chip 1 with the horn to play music, and sometimes uses the music chip 2 with the horn to play music, then the music chip 1 with the horn and the music chip 2 with the horn can be combined as one Sounding device, claim 2, the sounding device mentioned in the 6th can be exactly this sounding device, and when the pickup uses the music chip 1 with the loudspeaker therein to play music, it is equal to that the pickup is using this sounding device (music with the loudspeaker) Chip 1 plus music chip 2) playing music with speakers. Therefore, there is only one sounding device in the pickup mentioned in this application document. This sounding device can be a speaker, or a music chip with a speaker, or can be composed of several speakers, or a music chip with a speaker. Chips, or composed of several speakers and several music chips with speakers, or more. If the pronunciation device mentioned in other paragraphs of this specification or the pronunciation device not mentioned in this specification is similar to the pronunciation device mentioned in other paragraphs of this specification, it is defined as a sub-sounding device, then the pronunciation device of the combined form mentioned in this paragraph It is composed of multiple sub-pronunciation devices. The combined pronunciation device is connected to the MCU intentionally, and the intentional connection generally includes several situations: 1. The multiple sub-pronunciation devices are directly connected to the MCU. 2. The multiple sub-pronunciation devices are directly connected to the MCU. The devices are all indirectly connected to the MCU. 3, some of the sub-pronunciation devices are directly connected to the MCU, and the other part of the sub-pronunciation devices are indirectly connected to the MCU.

一般情况,拾音器播放乐段发音,播放一个乐段发音,播放此乐段发音,或其它类似说法比如说把此乐段播出来或使拾音器的发音器件播放乐段发音或码字对应播放的乐段发音等,播放的是一个乐段的全部发音,一般情况,拾音器播放多个乐段发音,或其它类似说法比如说把多个乐段播出来或使拾音器的发音器件播放多个乐段发音等,播放的是多个乐段中每个乐段的全部发音;但实际中也可能出现是按约定(比如在码字的BIT中有标志BIT)只播某乐段的发音的一部分(比如有声音乐读物的一个码字的值有一乐段的拼合信息的值,此乐段的音乐长度为10秒,拾音器读此码字后,按码字里的此乐段的拼合信息的值解码,但按约定只播其前面5秒的音,或是按约定只播其前面2秒的音和后面2秒的音)。In general, the pickup plays the pronunciation of the passage, plays a pronunciation of the passage, plays the pronunciation of the passage, or other similar sayings such as broadcasting the passage or making the pronunciation device of the pickup play the pronunciation of the passage or the code word corresponding to the broadcast Section pronunciation, etc., what is played is the entire pronunciation of a section. In general, the pickup plays multiple section pronunciations, or other similar sayings such as broadcasting multiple sections or making the pronunciation device of the pickup play multiple sections. section pronunciation, etc., what is played is all the pronunciations of each section in multiple sections; but in practice, it may also appear that only a part of the pronunciation of a section is played according to the agreement (for example, there is a sign BIT in the BIT of the codeword) (for example, the value of a code word of audio music book has the value of the combined information of a section, and the music length of this section is 10 seconds. After the pickup reads this code word, press the value of the combined information of this section in the code word Decoding, but only play the sound of the preceding 5 seconds according to the agreement, or only play the sound of the preceding 2 seconds and the sound of the following 2 seconds according to the agreement).

本申请文件约定象上一段说的那样把一个乐段的部分发音播放出来的情况也叫做播放乐段发音;如不能从提及处的上下文推出有特定含义,本说明书前面各处提及的拾音器播放乐段发音,播放一个乐段发音,播放此乐段发音,或其它类似说法比如说把此乐段播出来或使拾音器的发音器件播放乐段发音或码字对应播放的乐段发音等,默认既可以是播放此乐段的全部发音,也可以是播放此乐段的部分发音;如不能从提及处的上下文推出有特定含义,本说明书前面各处提及的拾音器播放多个乐段发音,或其它类似说法比如说把多个乐段播出来或使拾音器的发音器件播放多个乐段发音等,默认既可以是播放此多个乐段中每个乐段的全部发音,也可以是播放此多个乐段中每个乐段的部分发音,也可以是播放此多个乐段中部分乐段中每个乐段的全部发音以及此多个乐段中另一部分乐段中每个乐段的部分发音。而且本段说的播放一个乐段发音是本段说的播放多个乐段发音的下位概念,因为播放多个乐段发音包括播放一个乐段发音的情况;至于本申请文件各处说的播放一个或多个乐段发音(或播放数个乐段发音),是指播放一个乐段发音或播放多个乐段发音,显然分别按本段内以上所述播放一个乐段发音,播放多个乐段发音的情况理解即可;在权利要求书里各处提到的播放乐段发音,既可以是播放乐段的全部发音,也可以是播放乐段的部分发音,(以上3句话说的乐段都是指单个的乐段)。本段所说的播放发音都是指解码播放发音。This application document stipulates that the situation of playing part of the pronunciation of a piece of music as said in the previous paragraph is also called playing the pronunciation of a piece of music; if it cannot be deduced from the context of the mention that there is a specific meaning, the pickup mentioned in the front of this specification Play the pronunciation of a passage, play the pronunciation of a passage, play the pronunciation of this passage, or other similar expressions such as broadcasting this passage or making the pronunciation device of the pickup play the pronunciation of a passage or the pronunciation of a passage played by the codeword, etc. , the default can be to play all the pronunciation of this section, or to play part of the pronunciation of this section; if the specific meaning cannot be deduced from the context of the mention, the pickups mentioned in the front of this manual play multiple music Paragraph pronunciation, or other similar expressions such as broadcasting multiple passages or making the pronunciation device of the pickup play multiple passages, etc., the default can be to play all the pronunciations of each passage in the multiple passages, It can also be to play part of the pronunciation of each of the multiple sections, or to play the entire pronunciation of each of some of the multiple sections and another part of the multiple sections Partial pronunciation of each section in . And the pronunciation of playing a passage said in this paragraph is the subordinate concept of the pronunciation of playing a plurality of passages said in this paragraph, because the pronunciation of playing a plurality of passages includes the situation of playing a pronunciation of a passage; The pronunciation of one or more passages (or the pronunciation of several passages) refers to the pronunciation of one passage or the pronunciation of multiple passages. Obviously, the pronunciation of one passage is played and the pronunciation of multiple passages is played as described above in this paragraph. It is enough to understand the pronunciation of the section; the pronunciation of the playback section mentioned in the claims can be the entire pronunciation of the playback section, or part of the pronunciation of the playback section, (the above three sentences say Sections refer to individual sections). The playback pronunciation mentioned in this paragraph refers to the decoding playback pronunciation.

注:在本申请文件的说明书中,有些地方有提及的‘本段以上’,‘本段上面’,‘本段前面’,‘本段内前面’等,显然都是指本段范围内前面所述;同理,如有些地方有提及到的‘本段以下’,‘本段下面’,‘本段后面’,是指本段范围内后面所述。本说明书中有些地方提及的‘前面两段’,是不包括此‘前面两段’几个词本身所处的那段的,其它比如前面多少段这样的类似提法也是这样的。本说明书中有些地方提及的‘后面两段’,是不包括此‘后面两段’几个词本身所处的那段的,其它比如后面多少段这样的类似提法也是这样的。Note: In the description of this application document, there are references to 'above this paragraph', 'above this paragraph', 'before this paragraph', 'before this paragraph', etc., which obviously refer to the scope of this paragraph As mentioned above; similarly, if there are references to 'below this paragraph', 'below this paragraph', and 'behind this paragraph' in some places, it refers to what is described later within the scope of this paragraph. The "first two paragraphs" mentioned in some places in this manual does not include the paragraph where the words "first two paragraphs" are located, and the same is true for other similar references such as the first few paragraphs. The "two paragraphs in the back" mentioned in some places in this specification does not include the paragraph in which the words "two paragraphs in the back" are located, and the same is true for other similar references such as several paragraphs in the back.

注:在本说明书中所称‘本申请文件’包括本次申请的权利要求书,说明书和说明书附图;如提到时无特别指明有他义,本申请文件的说明书中各处提到的‘说明书’和‘权利要求书’和权利要求项(比如提到的‘权利要求1’)默认是指针对本申请文件的,这是显然的。本说明书各处说到的多少号专利申请,都是指中国国内的专利申请。Note: The term "this application document" in this specification includes the claims of this application, the specification and the drawings of the specification; if there is no special meaning when it is mentioned, the specification mentioned in this application document is everywhere It is obvious that 'description' and 'claims' and claim items (such as the mentioned 'claim 1') refer to this application document by default. The number of patent applications mentioned in this manual refers to the domestic patent applications in China.

Claims (10)

1. sound music reading material, its main body is a reading matter, on the whole page of aforementioned reading matter all code word is arranged, this all code word belongs to aforementioned sound music reading material, it is characterized in that:
The code value of each code word in aforementioned all the code word is altogether how to come the value of the amalgamation information that the amalgamation period pronounces relevant with phoneme with several expressions.
2. sound music reading material according to claim 1 is characterized in that:
The all code words of portion are arranged in described all the code word; Aforementioned all code word is whole code words or the part code word in described all the code word; Described all code word can be one or more code words; Aforementioned all code word can be one or more code words;
Each code word in aforementioned all code word can both be read by same acoustic pickup, each code word in aforementioned all code word is that the reading terminal by aforementioned acoustic pickup is read by aforementioned acoustic pickup, and each code word in aforementioned all code word can be read to be used for aforementioned acoustic pickup by aforementioned acoustic pickup broadcasts the period pronunciation;
Aforementioned acoustic pickup includes an aforementioned reading terminal, a pronunciation device, a singlechip chip, this singlechip chip is the main process chip of aforementioned acoustic pickup, this singlechip chip is connected with aforementioned reading terminal purpose in aforementioned acoustic pickup, this singlechip chip is connected with aforementioned pronunciation device purpose in aforementioned acoustic pickup, aforementioned acoustic pickup in use, need use the storer collection, this storer collection is used to deposit the data that program that aforementioned acoustic pickup relates to and aforementioned acoustic pickup relate to, and each storer in this storer collection can be read use by the aforementioned singlechip chip of aforementioned acoustic pickup;
Aforementioned acoustic pickup is when being used, the aforementioned singlechip chip of aforementioned acoustic pickup utilizes the aforementioned reading terminal of aforementioned acoustic pickup can read each code word in aforementioned all code word, aforementioned acoustic pickup when being used, the aforementioned singlechip chip of aforementioned acoustic pickup according to read aforementioned all code word in the code value of code word make the aforementioned pronunciation device of aforementioned acoustic pickup broadcast the period pronunciation;
For a code word in aforementioned all code word, its code value is how to come the value of amalgamation information of amalgamation period pronunciation relevant with phoneme with several expressions together, how this several expression comes in the value of amalgamation information of amalgamation period pronunciation one to represent how to come with phoneme the value of the amalgamation information of amalgamation period pronunciation with phoneme, it is the value of representing how to come the amalgamation information of amalgamation period 1 pronunciation with phoneme, the value of amalgamation information how this expression comes this period 1 pronunciation of amalgamation with phoneme is according to generations of encoding of a kind of coding rule, and how this expression comes the value of the amalgamation information that this period 1 of amalgamation pronounces with phoneme also is the value 1 of amalgamation information.
3. sound music reading material according to claim 2 is characterized in that:
The value 1 of described amalgamation information is according to the generation of encoding of described coding rule, containing compressed encoding in this coding rule handles, there is compressed encoding to handle when encoding the value 1 that generates described amalgamation information with this coding rule, only needing maybe needs a plurality of code words from described all code word that comprises a described code word just can obtain the value 1 of described amalgamation information from the value 1 that a described code word can obtain described amalgamation information, the described singlechip chip of described acoustic pickup utilizes described reading terminal to can read a described code word or described a plurality of code word and obtains the value 1 of described amalgamation information, the described singlechip chip of described acoustic pickup is to make the described pronunciation device of described acoustic pickup broadcast described period 1 pronunciation to reading the 1 execution decoding of yard value of the described amalgamation information that obtains by a kind of rule of decoding, and contain the compression coding processing in this decoding rule, the compression coding processing is arranged when with this decoding rule value 1 of described amalgamation information being carried out decoding.
4. sound music reading material according to claim 3 is characterized in that:
The value 1 of described amalgamation information is the value of amalgamation information that how to represent to come with a plurality of phonemes the pronunciation of the described period 1 of amalgamation, and has at least in these a plurality of phonemes and include different pronunciations values constantly in the phoneme;
For certain phoneme of the related use of described coding rule, can be used for the amalgamation pronunciation of different periods;
Described decoding rule is special corresponding or general corresponding with described coding rule.
5. according to claim 1,2,3 or 4 described sound music reading materials is characterized in that:
Described broadcast period pronunciation is non-whole pronunciations of broadcasting whole pronunciations of period or broadcasting period;
Described period 1 pronunciation of described broadcast is non-whole pronunciations of broadcasting whole pronunciations of period 1 or broadcasting period 1;
Aforementioned period 1 pronunciation of described broadcast is non-whole pronunciations of broadcasting whole pronunciations of period 1 or broadcasting period 1;
Described reading matter can be a book, and books are printed on the card of picture and text, papers and magazines or map;
Described sound music reading material has one or more pages and only on a page code word is arranged, and perhaps, a plurality of pages is arranged and code word is all arranged on a plurality of pages; On arbitrary page of described sound music reading material or do not have code word, or a code word is arranged, or a plurality of code words that are in any diverse location are arranged;
A certain code word in described all the code word is to be in the position that need broadcast several periods pronunciations that this code word correspondence broadcasts on the whole page of described reading matter;
A certain code word in described all the code word can be a conduction code, perhaps can be bar code, perhaps can be graphic code, perhaps can be magnetic code; For a certain code word in described all the code word, the carrier of this code word can be a page or leaf of described reading matter, the bearer plane of this code word is a page of described reading matter, perhaps, the carrier of this code word can be a fritter planar materials, the bearer plane of this code word is the surface of planar materials, and it is to stick on the page of described reading matter that this code word, this planar materials are arranged on the surface of this planar materials.
6. one kind can be used the acoustic pickup that broadcasts the period pronunciation with the described sound music reading material of claim 1, this acoustic pickup is acoustic pickup just, this acoustic pickup includes a reading terminal, a pronunciation device, a singlechip chip, this singlechip chip is the main process chip of aforementioned acoustic pickup, this singlechip chip is connected with aforementioned reading terminal purpose, this singlechip chip is connected with aforementioned pronunciation device purpose, aforementioned acoustic pickup in use, need use the storer collection, this storer collection is in order to deposit program and data, each storer in this storer collection can be read use by the aforementioned singlechip chip of aforementioned acoustic pickup, and the main body of aforementioned sound music reading material is a reading matter, on the whole page of aforementioned reading matter all code word is arranged, this all code word belongs to aforementioned sound music reading material, aforementioned acoustic pickup is when being used, and the aforementioned singlechip chip of aforementioned acoustic pickup can utilize the aforementioned reading terminal of aforementioned acoustic pickup to read each code word in all code words of portion in aforementioned all the code word, it is characterized in that:
The code value of each code word in aforementioned all the code word is altogether how to come the value of the amalgamation information that the amalgamation period pronounces relevant with phoneme with several expressions; Aforementioned acoustic pickup when being used, by the aforementioned singlechip chip of aforementioned acoustic pickup according to read aforementioned all code word in the code value of code word make the aforementioned pronunciation device of aforementioned acoustic pickup broadcast the period pronunciation.
7. acoustic pickup according to claim 6 is characterized in that:
Described singlechip chip and described reading terminal are connected directly or indirectly; Described singlechip chip and described pronunciation device are connected directly or indirectly;
Described storer collection is the set of all storeies that can be used by described singlechip chip in the described acoustic pickup; The data storage that program that described acoustic pickup relates to and described acoustic pickup relate to is in described storer collection;
Described all code word is whole code words or the part code word in described all the code word; Described all code word can be one or more code words; Described all code word can be one or more code words;
For a code word in described all code word, its code value is how to come the value of amalgamation information of amalgamation period pronunciation relevant with phoneme with several expressions together, how this several expression comes in the value of amalgamation information of amalgamation period pronunciation one to represent how to come with phoneme the value of the amalgamation information of amalgamation period pronunciation with phoneme, it is the value of representing how to come the amalgamation information of amalgamation period 1 pronunciation with phoneme
How this expression comes the value of amalgamation information of this period 1 pronunciation of amalgamation according to the generation of encoding of a kind of coding rule with phoneme, how this expression comes the value of the amalgamation information of this period 1 pronunciation of amalgamation with phoneme also is the value 1 of amalgamation information, only needing maybe needs a plurality of code words from described all code word that comprises an aforementioned code word just can obtain the value 1 of aforementioned amalgamation information from the value 1 that an aforementioned code word can obtain aforementioned amalgamation information, the described singlechip chip of described acoustic pickup utilizes described reading terminal to can read an aforementioned code word or aforementioned a plurality of code word and obtains the value 1 of aforementioned amalgamation information, and the described singlechip chip of described acoustic pickup is to come that by a kind of rule of decoding the value 1 of reading the aforementioned amalgamation information that sign indicating number obtains is carried out the described pronunciation device that decoding makes described acoustic pickup to broadcast aforementioned period 1 pronunciation.
8. acoustic pickup according to claim 7 is characterized in that:
The value 1 of described amalgamation information is according to the generation of encoding of described coding rule, containing compressed encoding in this coding rule handles, there is compressed encoding to handle when encoding the value 1 that generates described amalgamation information with this coding rule, the described singlechip chip of described acoustic pickup is to make the described pronunciation device of described acoustic pickup broadcast described period 1 pronunciation to reading the 1 execution decoding of yard value of the described amalgamation information that obtains by described decoding rule, and contain the compression coding processing in this decoding rule, the compression coding processing is arranged when with this decoding rule value 1 of described amalgamation information being carried out decoding.
9. acoustic pickup according to claim 8 is characterized in that:
Described decoding rule is special corresponding or general corresponding with described coding rule;
The value 1 of described amalgamation information is the value of amalgamation information that how to represent to come with a plurality of phonemes the pronunciation of the described period 1 of amalgamation, and has at least in these a plurality of phonemes and include different pronunciations values constantly in the phoneme;
When described singlechip chip comes value 1 decoding to described amalgamation information by described decoding rule, be to come the pronunciation of the described period 1 of amalgamation with a plurality of phonemes, and have at least in these a plurality of phonemes and include different pronunciation moment values in the phoneme;
Described singlechip chip is to come the value 1 of the described amalgamation information described pronunciation device that makes described acoustic pickup of decoding is broadcasted the period pronunciation by described decoding rule;
For certain phoneme of the related use of described coding rule, can be used for the amalgamation pronunciation of different periods;
For certain phoneme of the regular related use of described decoding, can be used for the amalgamation pronunciation of different periods;
Can be stored in the described storer collection according to decoding program and the decoded data that described decoding rule is decoded to the value 1 of described amalgamation information.
10. according to claim 6,7,8 or 9 described acoustic pickups is characterized in that:
Described broadcast period pronunciation is non-whole pronunciations of broadcasting whole pronunciations of period or broadcasting period;
Described period 1 pronunciation of described broadcast is non-whole pronunciations of broadcasting whole pronunciations of period 1 or broadcasting period 1;
Aforementioned period 1 pronunciation of described broadcast is non-whole pronunciations of broadcasting whole pronunciations of period 1 or broadcasting period 1;
Described reading matter can be a book, and books are printed on the card of picture and text, papers and magazines or map;
Described sound music reading material has one or more pages and only on a page code word is arranged, and perhaps, a plurality of pages is arranged and code word is all arranged on a plurality of pages; On arbitrary page of described sound music reading material or do not have code word, or a code word is arranged, or a plurality of code words that are in any diverse location are arranged;
A certain code word in described all the code word is to be in the position that need broadcast several periods pronunciations that this code word correspondence broadcasts on the whole page of described reading matter;
Described reading terminal is the electromechanical conduction code reading terminal that can be used for reading the conduction code on the whole page of described sound music reading material, perhaps described reading terminal is to can be used for reading the bar code on the whole page of described sound music reading material or the photo-electric reading terminal of conduction code, perhaps described reading terminal is the photo-electric graphic code reading terminal that can be used for reading the graphic code on the whole page of described sound music reading material, perhaps described reading terminal is the magnetic code reading terminal that can be used for reading the magnetic code on the whole page of described sound music reading material, and perhaps described reading terminal is to can be used for reading the two-dimensional bar on the whole page of described sound music reading material or the photographic-type reading terminal of one-dimensional bar code or conduction code;
For a certain code word in described all the code word, the carrier of this code word is a page or leaf of described reading matter, and perhaps, the carrier of this code word is the nonconducting planar materials in fritter surface, and this planar materials is to stick on the page of described reading matter;
Can make the acoustic pickup of big body band reading terminal form to described acoustic pickup and the external form of body is suitable for allowing sound music reading material be placed in the body, during use, described sound music reading material put in the body use, perhaps, can make the acoustic pickup of big body band reading terminal form to described acoustic pickup and the external form of body is suitable for allowing sound music reading material be placed on the body, during use, described sound music reading material is placed on the body uses, perhaps, described acoustic pickup can be made the acoustic pickup of hand-held, perhaps, can be placed on the desktop acoustic pickup and the small shell that described acoustic pickup is made small shell band reading terminal form; Described acoustic pickup is not integrated with other electronic product, and perhaps, described acoustic pickup is that acoustic pickup that integrate with mobile phone or described is that acoustic pickup that integrate with the MP3 player or described integrates with language repeater.
CNA2007102019784A 2007-10-09 2007-10-09 Sound music reading material and pick-up device Withdrawn CN101409021A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2007102019784A CN101409021A (en) 2007-10-09 2007-10-09 Sound music reading material and pick-up device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2007102019784A CN101409021A (en) 2007-10-09 2007-10-09 Sound music reading material and pick-up device

Publications (1)

Publication Number Publication Date
CN101409021A true CN101409021A (en) 2009-04-15

Family

ID=40572020

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2007102019784A Withdrawn CN101409021A (en) 2007-10-09 2007-10-09 Sound music reading material and pick-up device

Country Status (1)

Country Link
CN (1) CN101409021A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102884546A (en) * 2010-05-05 2013-01-16 林晖 A card and a business method for promotion using the card as a digital data carrier
CN108352124A (en) * 2015-12-21 2018-07-31 大连新锐天地传媒有限公司 Card early education machine system
CN108682422A (en) * 2018-04-24 2018-10-19 上海思依暄机器人科技股份有限公司 A kind of audio-frequency information processing method and system

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102884546A (en) * 2010-05-05 2013-01-16 林晖 A card and a business method for promotion using the card as a digital data carrier
CN108352124A (en) * 2015-12-21 2018-07-31 大连新锐天地传媒有限公司 Card early education machine system
CN108682422A (en) * 2018-04-24 2018-10-19 上海思依暄机器人科技股份有限公司 A kind of audio-frequency information processing method and system

Similar Documents

Publication Publication Date Title
US5565641A (en) Relativistic electronic musical instrument
CN102568261B (en) Auxiliary musical instrument learning machine
US20080167739A1 (en) Autonomous robot for music playing and related method
CN101313477A (en) Music generating device and operating method thereof
WO2021151344A1 (en) Somethod and apparatus for song synthesis, and computer readable storage medium
CN104361884B (en) An electronic device capable of playing vocal notation and its operating method
CN101388149A (en) Keyboard musical instrument learning device and method
CN1160694C (en) Method for realizing music playing on mobile phone and mobile phone capable of realizing music playing
CN101409021A (en) Sound music reading material and pick-up device
CN103425901B (en) original sound data collator
CN1750116A (en) Automatic rendition style determining apparatus and method
CN116434725B (en) Music blind character conversion system for synchronous presentation of embedded audio-visual touch
CN100437442C (en) Chord score input method for hand-held terminal
CN116778904A (en) Audio synthesis methods and devices, training methods and devices, electronic equipment and media
CN101201892A (en) Voice coding talking book and pickup main body
JP5847048B2 (en) Piano roll type score display apparatus, piano roll type score display program, and piano roll type score display method
KR100635455B1 (en) Sheet music barcode generating device and method, sheet music barcode reproducing device and method
CN205541912U (en) Intelligence piano system with suggestion location of key and video function that grades
Laws Beckett in New Musical Composition
US20220310046A1 (en) Methods, information processing device, performance data display system, and storage media for electronic musical instrument
CN1755795B (en) Method of constructing digital voice database of Chinese characters, Chinese digital series synthesis system and method
JPS61140995A (en) Automatic performer
US7214869B2 (en) Method for generating and playing a musical file and a computer-readable media storing the musical file
CN118942431A (en) Twelve-semitone isomorphic musical instruments and their corresponding twelve-semitone isomorphic notation
JP2021096436A (en) Sound element input medium, reading converter, musical instrument system, and musical sound generation method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C04 Withdrawal of patent application after publication (patent law 2001)
WW01 Invention patent application withdrawn after publication