CN104392237B - Fuzzy sign language identification method for data gloves - Google Patents
Fuzzy sign language identification method for data gloves Download PDFInfo
- Publication number
- CN104392237B CN104392237B CN201410655579.5A CN201410655579A CN104392237B CN 104392237 B CN104392237 B CN 104392237B CN 201410655579 A CN201410655579 A CN 201410655579A CN 104392237 B CN104392237 B CN 104392237B
- Authority
- CN
- China
- Prior art keywords
- state
- sign language
- mrow
- mtr
- mtd
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 238000012545 processing Methods 0.000 claims abstract description 9
- 238000005452 bending Methods 0.000 claims description 36
- 230000006870 function Effects 0.000 claims description 25
- 238000012549 training Methods 0.000 claims description 4
- 210000003811 finger Anatomy 0.000 description 19
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 210000004247 hand Anatomy 0.000 description 1
- 210000004932 little finger Anatomy 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 210000003813 thumb Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/28—Recognition of hand or arm movements, e.g. recognition of deaf sign language
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Image Analysis (AREA)
Abstract
本发明公开了一种数据手套的模糊手语识别方法,包括以下步骤:获取手部动作数据并对其进行模糊处理,得到手势帧序列;根据手语数据库和概率数据库,对得到的手势帧序列进行识别处理,得到手势帧序列识别结果。本发明通过对手部动作数据进行模糊处理,有效避免了因手掌大小不一造成的识别率较低的问题,而且通过结合手语数据库和概率数据库,使得本发明能根据前后手势选取当前手势最优的识别,大大提高手势识别的准确率。本发明作为一种数据手套的模糊手语识别方法可广泛应用于手语识别产品中。
The present invention discloses a fuzzy sign language recognition method for a data glove, comprising the following steps: obtaining hand motion data and performing fuzzy processing on the data to obtain a gesture frame sequence; performing recognition processing on the obtained gesture frame sequence according to a sign language database and a probability database to obtain a gesture frame sequence recognition result. The present invention effectively avoids the problem of low recognition rate caused by different palm sizes by performing fuzzy processing on the hand motion data, and by combining the sign language database and the probability database, the present invention can select the optimal recognition of the current gesture according to the previous and next gestures, thereby greatly improving the accuracy of gesture recognition. As a fuzzy sign language recognition method for a data glove, the present invention can be widely used in sign language recognition products.
Description
技术领域technical field
本发明涉及手势识别领域,尤其涉及一种数据手套的模糊手语识别方法。The invention relates to the field of gesture recognition, in particular to a fuzzy sign language recognition method for data gloves.
背景技术Background technique
手势识别国内外科学家已经进行过了大量研究。1994年,Ramon M S和Dannil T研制了一种基于物理约束的手抓取过程的手动作合成的控制与抓取系统。1995年,Lee Jintae和Kunii Tosiyasv L研究用摄像机获得手的运动图像数据来自动分析三维手势,实现三维手势重构。1997年,加拿大多伦多大学的Sidney S F研究的Glove TalkII系统是目前最有影响的手势接口系统,他采用神经网络将用户手势转换成手势语言参数,通过语言合成器合成为语言输出。我国高文等人,也进行了基于手势和人的行为动作识别的手语合成技术的研究。Scientists at home and abroad have done a lot of research on gesture recognition. In 1994, Ramon M S and Dannil T developed a hand motion synthesis control and grasping system based on the physical constraints of the hand grasping process. In 1995, Lee Jintae and Kunii Tosiyasv L researched the motion image data of the hand obtained by the camera to automatically analyze the three-dimensional gestures and realize the reconstruction of the three-dimensional gestures. In 1997, the Glove Talk II system researched by Sidney S F of the University of Toronto in Canada is currently the most influential gesture interface system. He uses a neural network to convert user gestures into gesture language parameters, which are synthesized into language output through a speech synthesizer. my country's Gao Wen and others have also carried out research on sign language synthesis technology based on gesture and human behavior recognition.
目前基于传感器数据手套的手势识别是从传感器直接获得手指运动特征数据,然后通过匹配的算法将其翻译成人可直接识别的文字或声音。但是目前手语翻译技术多数受限于词语的识别,并且随着手语库的增多容易造成翻译混淆,并且由于人手的大小不一,也比较容易造成不同人之间的手势识别效率差别较大,识别准确率较低。At present, gesture recognition based on sensor data gloves directly obtains finger motion feature data from sensors, and then translates them into words or sounds that can be directly recognized by humans through matching algorithms. However, most of the current sign language translation technologies are limited to the recognition of words, and as the number of sign language databases increases, it is easy to cause confusion in translation, and due to the different sizes of hands, it is also easy to cause large differences in gesture recognition efficiency between different people. The accuracy rate is lower.
发明内容Contents of the invention
为了解决上述技术问题,本发明的目的是提供一种能提高手势识别的准确率的一种数据手套的模糊手语识别方法。In order to solve the above technical problems, the object of the present invention is to provide a fuzzy sign language recognition method for a data glove that can improve the accuracy of gesture recognition.
本发明所采用的技术方案是:The technical scheme adopted in the present invention is:
一种数据手套的模糊手语识别方法,包括以下步骤:A fuzzy sign language recognition method for data gloves, comprising the following steps:
A、获取手部动作数据并对其进行模糊处理,得到手势帧序列;A. Obtain hand movement data and perform blurring processing on it to obtain a sequence of gesture frames;
B、根据手语数据库和概率数据库,对得到的手势帧序列进行识别处理,得到手势帧序列识别结果。B. Perform recognition processing on the obtained gesture frame sequence according to the sign language database and the probability database, and obtain the recognition result of the gesture frame sequence.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A包括:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, described step A includes:
A1、获取手部动作数据中的各个手指的弯曲角度,并根据预设的弯曲隶属度函数,得出对应的各个手指的弯曲状态;A1. Obtain the bending angle of each finger in the hand movement data, and obtain the corresponding bending state of each finger according to the preset bending membership function;
A2、获取手部动作数据中的手掌俯仰角,并根据预设的俯仰隶属度函数进行计算,得出结果值最大的即为对应的俯仰状态;A2. Obtain the palm pitch angle in the hand movement data, and calculate it according to the preset pitch membership function, and obtain the largest result value as the corresponding pitch state;
A3、获取手部动作数据中的手掌倾斜角,并根据预设的倾斜隶属度函数进行计算,得出结果值最大的即为对应的倾斜状态;A3. Obtain the palm inclination angle in the hand movement data, and calculate it according to the preset inclination membership function, and obtain the largest result value as the corresponding inclination state;
A4、获取手部动作数据中的手掌偏航角,并根据预设的偏航隶属度函数进行计算,得出结果值最大的即为对应的偏航状态;A4. Obtain the palm yaw angle in the hand movement data, and calculate it according to the preset yaw membership function, and the one with the largest result value is the corresponding yaw state;
A5、根据计算得到的俯仰状态、倾斜状态和偏航状态,结合预设的规则,得出对应的手掌朝向;A5. According to the calculated pitch state, tilt state and yaw state, combined with the preset rules, the corresponding palm orientation is obtained;
A6、根据手掌朝向和各个手指的弯曲状态,得出手势帧,并进而得出手势帧序列。A6. According to the orientation of the palm and the bending state of each finger, a gesture frame is obtained, and then a sequence of gesture frames is obtained.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,其特征在于:所述的步骤B包括:As a further improvement of the fuzzy sign language recognition method of the data glove, it is characterized in that: the step B includes:
B1、获取手势帧序列,依照从头到尾的顺序提取手势帧;B1. Obtain a sequence of gesture frames, and extract the gesture frames in sequence from beginning to end;
B2、将提取的手势帧分别依次放入对应的结点中;B2. Put the extracted gesture frames into the corresponding nodes in turn;
B3、依次从手语数据库中提取各个手势帧对应的所有字词,并将其附加进对应的结点中,直到手势帧序列上所有手势帧均完成手语数据库的检索;B3. Extract all the words corresponding to each gesture frame from the sign language database in turn, and attach them to the corresponding nodes until all the gesture frames on the gesture frame sequence complete the sign language database retrieval;
B4、将相邻两个结点所附带的字词按照结点的顺序分别两两组合,组合中由上一个结点的字词指向下一个结点的字词;B4, the words attached to two adjacent nodes are combined in pairs according to the order of the nodes, and the words of the previous node point to the words of the next node in the combination;
B5、将所有组合分别在概率数据库中索引出各组合的概率;B5. Indexing all combinations in the probability database to obtain the probability of each combination;
B6、查找出各组合组成的句子中概率和最大的一个句子,得出手势帧序列识别结果。B6. Find out the sentence with the largest probability sum among the sentences formed by each combination, and obtain the gesture frame sequence recognition result.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A1中的弯曲隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a data glove, the bending membership function in the step A1 is:
其中,X∈U0,U0表示手指弯曲角度,U0=[0,120],在U0上建立手指弯曲角度的三个模糊集A0表示弯曲状态为“伸直”的状态,A1=表示弯曲状态为“半弯曲”的状态,A2=表示弯曲状态为“紧握”的状态。Among them, X∈U 0 , U 0 represents the bending angle of the finger, U 0 = [0, 120], three fuzzy sets A 0 of the bending angle of the finger are established on U 0 to represent the state of the bending state as "straightening", A 1 = indicates that the bending state is a "semi-bent" state, and A 2 = indicates that the bending state is a "tight grip" state.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A2中的俯仰隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the pitch membership function in the described step A2 is:
其中,x∈U1,U1表示手掌俯仰角,U1=[-90,90],在U1上建立俯仰角的三个模糊集B0表示俯仰角为“俯”的状态,B1=表示俯仰角为“水平”的状态,B2=表示俯仰角为“仰”的状态。Among them, x∈U 1 , U 1 represents the pitch angle of the palm, U 1 = [-90, 90], three fuzzy sets B 0 of the pitch angle are established on U 1 to represent the state that the pitch angle is "prostrate", B 1 = indicates the state where the pitch angle is "horizontal", and B 2 = indicates the state where the pitch angle is "upward".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A3中的倾斜隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the oblique membership function in the step A3 is:
其中,y∈U2,U2表示手掌倾斜角,U2=[-180,180],在U2上建立倾斜角的三个模糊集C0表示倾斜角为“左倾”的状态,C1=表示倾斜角为“水平”的状态,C2=表示倾斜角为“右倾”的状态,C3=表示倾斜角为“翻转水平”的状态。Among them, y∈U 2 , U 2 represents the inclination angle of the palm, U 2 = [-180, 180], and the three fuzzy sets C 0 establishing the inclination angle on U 2 represent the state that the inclination angle is "left leaning", C 1 = Indicates the state where the inclination angle is "horizontal", C 2 = indicates the state where the inclination angle is "right-inclined", C 3 = indicates the state where the inclination angle is "flip horizontal".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A3中的偏航隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the yaw membership function in the described step A3 is:
其中,z∈U3,U3表示手掌偏航角,U3=[0,360],在U3上建立偏航角的三个模糊集D0表示偏航角为“前”的状态,D1=表示偏航角为“右”的状态,D2=表示偏航角为“后”的状态,D3=表示偏航角为“左”的状态。Among them, z∈U 3 , U 3 represents the yaw angle of the palm, U 3 =[0, 360], and the three fuzzy sets D 0 of the yaw angle established on U 3 represent the state that the yaw angle is "front", D 1 = indicates the state where the yaw angle is "right", D 2 = indicates the state where the yaw angle is "back", D 3 = indicates the state where the yaw angle is "left".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述手语数据库以手指弯曲状态和手掌朝向形成的手势帧作为索引,手势对应的字词作为被索引的内容。As a further improvement of the fuzzy sign language recognition method of a data glove, the sign language database uses gesture frames formed by finger bending state and palm orientation as indexes, and words corresponding to gestures as indexed content.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述概率数据库中的字词组合的概率为利用语言模型训练工具SRILM得到。As a further improvement of the fuzzy sign language recognition method of a data glove, the probability of word combinations in the probability database is obtained by using the language model training tool SRILM.
本发明的有益效果是:The beneficial effects of the present invention are:
本发明一种数据手套的模糊手语识别方法通过对手部动作数据进行模糊处理,有效避免了因手掌大小不一造成的识别率较低的问题,而且通过结合手语数据库和概率数据库,使得本发明能根据前后手势选取当前手势最优的识别,大大提高手势识别的准确率。A fuzzy sign language recognition method for data gloves in the present invention effectively avoids the problem of low recognition rate caused by different palm sizes by performing fuzzy processing on the hand movement data, and by combining the sign language database and the probability database, the present invention can Select the optimal recognition of the current gesture according to the front and back gestures, which greatly improves the accuracy of gesture recognition.
附图说明Description of drawings
下面结合附图对本发明的具体实施方式作进一步说明:The specific embodiment of the present invention will be further described below in conjunction with accompanying drawing:
图1是本发明一种数据手套的模糊手语识别方法的步骤流程图;Fig. 1 is the step flow chart of the fuzzy sign language recognition method of a kind of data glove of the present invention;
图2是本发明一种数据手套的模糊手语识别方法步骤A的步骤流程图;Fig. 2 is a flow chart of step A of the fuzzy sign language recognition method of a data glove according to the present invention;
图3是本发明一种数据手套的模糊手语识别方法步骤B的步骤流程图;Fig. 3 is a step flow chart of step B of a fuzzy sign language recognition method of a data glove according to the present invention;
图4是本发明一种数据手套的模糊手语识别方法中手势帧的构成示意图。Fig. 4 is a schematic diagram of the composition of a gesture frame in a fuzzy sign language recognition method for a data glove according to the present invention.
具体实施方式detailed description
参考图1,本发明一种数据手套的模糊手语识别方法,包括以下步骤:With reference to Fig. 1, the fuzzy sign language recognition method of a kind of data glove of the present invention comprises the following steps:
A、获取手部动作数据并对其进行模糊处理,得到手势帧序列;A. Obtain hand movement data and perform blurring processing on it to obtain a sequence of gesture frames;
B、根据手语数据库和概率数据库,对得到的手势帧序列进行识别处理,得到手势帧序列识别结果。B. Perform recognition processing on the obtained gesture frame sequence according to the sign language database and the probability database, and obtain the recognition result of the gesture frame sequence.
参考图2,作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A包括:With reference to Fig. 2, as the further improvement of the fuzzy sign language recognition method of a kind of data glove, described step A comprises:
A1、获取手部动作数据中的各个手指的弯曲角度,并根据预设的弯曲隶属度函数,得出对应的各个手指的弯曲状态;A1. Obtain the bending angle of each finger in the hand movement data, and obtain the corresponding bending state of each finger according to the preset bending membership function;
A2、获取手部动作数据中的手掌俯仰角,并根据预设的俯仰隶属度函数进行计算,得出结果值最大的即为对应的俯仰状态;A2. Obtain the palm pitch angle in the hand movement data, and calculate it according to the preset pitch membership function, and obtain the largest result value as the corresponding pitch state;
A3、获取手部动作数据中的手掌倾斜角,并根据预设的倾斜隶属度函数进行计算,得出结果值最大的即为对应的倾斜状态;A3. Obtain the palm inclination angle in the hand movement data, and calculate it according to the preset inclination membership function, and obtain the largest result value as the corresponding inclination state;
A4、获取手部动作数据中的手掌偏航角,并根据预设的偏航隶属度函数进行计算,得出结果值最大的即为对应的偏航状态;A4. Obtain the palm yaw angle in the hand movement data, and calculate it according to the preset yaw membership function, and the one with the largest result value is the corresponding yaw state;
A5、根据计算得到的俯仰状态、倾斜状态和偏航状态,结合预设的规则,得出对应的手掌朝向;A5. According to the calculated pitch state, tilt state and yaw state, combined with the preset rules, the corresponding palm orientation is obtained;
A6、根据手掌朝向和各个手指的弯曲状态,得出手势帧,并进而得出手势帧序列。A6. According to the orientation of the palm and the bending state of each finger, a gesture frame is obtained, and then a sequence of gesture frames is obtained.
参考图3,作为所述的一种数据手套的模糊手语识别方法的进一步改进,其特征在于:所述的步骤B包括:With reference to Fig. 3, as the further improvement of the fuzzy sign language recognition method of a kind of data glove, it is characterized in that: described step B comprises:
B1、获取手势帧序列,依照从头到尾的顺序提取手势帧;B1. Obtain a sequence of gesture frames, and extract the gesture frames in sequence from beginning to end;
B2、将提取的手势帧分别依次放入对应的结点中;B2. Put the extracted gesture frames into the corresponding nodes in turn;
B3、依次从手语数据库中提取各个手势帧对应的所有字词,并将其附加进对应的结点中,直到手势帧序列上所有手势帧均完成手语数据库的检索;B3. Extract all the words corresponding to each gesture frame from the sign language database in turn, and attach them to the corresponding nodes until all the gesture frames on the gesture frame sequence complete the sign language database retrieval;
B4、将相邻两个结点所附带的字词按照结点的顺序分别两两组合,组合中由上一个结点的字词指向下一个结点的字词;B4, the words attached to two adjacent nodes are combined in pairs according to the order of the nodes, and the words of the previous node point to the words of the next node in the combination;
B5、将所有组合分别在概率数据库中索引出各组合的概率;B5. Indexing all combinations in the probability database to obtain the probability of each combination;
B6、查找出各组合组成的句子中概率和最大的一个句子,得出手势帧序列识别结果。B6. Find out the sentence with the largest probability sum among the sentences formed by each combination, and obtain the gesture frame sequence recognition result.
比如,手势序列S有两个手势帧依次为A和B,假设手势A有字词“你”和“那”,手势B有字词“好”和“正”,假设“你好”的概率为0.0052,“你正”的概率为“0.00045”,“那好”的概率为0.0078,“那正”的概率为0.00032,则手势序列S的识别结果为概率最大的那个句子,即识别结果为“那好”。For example, gesture sequence S has two gesture frames A and B in turn, assuming gesture A has the words "you" and "that", and gesture B has the words "good" and "positive", assuming the probability of "hello" is 0.0052, the probability of "you are right" is "0.00045", the probability of "that's good" is 0.0078, and the probability of "that is right" is 0.00032, then the recognition result of gesture sequence S is the sentence with the highest probability, that is, the recognition result is "Well".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A1中的弯曲隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a data glove, the bending membership function in the step A1 is:
其中,X∈U0,U0表示手指弯曲角度,U0=[0,120],在U0上建立手指弯曲角度的三个模糊集A0表示弯曲状态为“伸直”的状态,A1=表示弯曲状态为“半弯曲”的状态,A2=表示弯曲状态为“紧握”的状态。Among them, X∈U 0 , U 0 represents the bending angle of the finger, U 0 = [0, 120], three fuzzy sets A 0 of the bending angle of the finger are established on U 0 to represent the state of the bending state as "straightening", A 1 = indicates that the bending state is a "semi-bent" state, and A 2 = indicates that the bending state is a "tight grip" state.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A2中的俯仰隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the pitch membership function in the described step A2 is:
其中,x∈U1,U1表示手掌俯仰角,U1=[-90,90],在U1上建立俯仰角的三个模糊集B0表示俯仰角为“俯”的状态,B1=表示俯仰角为“水平”的状态,B2=表示俯仰角为“仰”的状态。Among them, x∈U 1 , U 1 represents the pitch angle of the palm, U 1 = [-90, 90], three fuzzy sets B 0 of the pitch angle are established on U 1 to represent the state that the pitch angle is "prostrate", B 1 = indicates the state where the pitch angle is "horizontal", and B 2 = indicates the state where the pitch angle is "upward".
若手掌俯仰角为42,将x=42代入公式二中的隶属度函数计算,得出B0(42)=0.1,B1(42)=0.3,B2(42)=0,B1>B0>B2,则该次输入的x值的俯仰角为水平。If the palm pitch angle is 42, substituting x=42 into the membership function calculation in Formula 2, it is obtained that B 0 (42)=0.1, B 1 (42)=0.3, B 2 (42)=0, B 1 > B 0 >B 2 , then the pitch angle of the x value input this time is horizontal.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A3中的倾斜隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the oblique membership function in the step A3 is:
其中,y∈U2,U2表示手掌倾斜角,U2=[-180,180],在U2上建立倾斜角的三个模糊集C0表示倾斜角为“左倾”的状态,C1=表示倾斜角为“水平”的状态,C2=表示倾斜角为“右倾”的状态,C3=表示倾斜角为“翻转水平”的状态。Among them, y∈U 2 , U 2 represents the inclination angle of the palm, U 2 = [-180, 180], and the three fuzzy sets C 0 establishing the inclination angle on U 2 represent the state that the inclination angle is "left leaning", C 1 = Indicates the state where the inclination angle is "horizontal", C 2 = indicates the state where the inclination angle is "right-inclined", C 3 = indicates the state where the inclination angle is "flip horizontal".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述的步骤A3中的偏航隶属度函数为:As a further improvement of the fuzzy sign language recognition method of a kind of data glove, the yaw membership function in the described step A3 is:
其中,z∈U3,U3表示手掌偏航角,U3=[0,360],在U3上建立偏航角的三个模糊集D0表示偏航角为“前”的状态,D1=表示偏航角为“右”的状态,D2=表示偏航角为“后”的状态,D3=表示偏航角为“左”的状态。Among them, z∈U 3 , U 3 represents the yaw angle of the palm, U 3 =[0, 360], and the three fuzzy sets D 0 of the yaw angle established on U 3 represent the state that the yaw angle is "front", D 1 = indicates the state where the yaw angle is "right", D 2 = indicates the state where the yaw angle is "back", D 3 = indicates the state where the yaw angle is "left".
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述手语数据库以手指弯曲状态和手掌朝向形成的手势帧作为索引,手势对应的字词作为被索引的内容。As a further improvement of the fuzzy sign language recognition method of a data glove, the sign language database uses gesture frames formed by finger bending state and palm orientation as indexes, and words corresponding to gestures as indexed content.
作为所述的一种数据手套的模糊手语识别方法的进一步改进,所述概率数据库中的字词组合的概率为利用语言模型训练工具SRILM得到。As a further improvement of the fuzzy sign language recognition method of a data glove, the probability of word combinations in the probability database is obtained by using the language model training tool SRILM.
参考图4,其中,本发明中的一个手势帧序列是指一个完整的手语句子,包含N个手势帧。一个手势帧是由4个字节(32位)组成,其中第32位保留,第26~30位和第11~15位分别为左手和右手的“手掌朝向”状态,第16~25位和第1位~第10位分别为存储左手和右手的手指弯曲状态,其中一个手指弯曲状态占2个位,拇指、食指、中指、无名指和小指分别从高位到低位排序,第0位为校验位。Referring to FIG. 4 , a sequence of gesture frames in the present invention refers to a complete gesture sentence, including N gesture frames. A gesture frame is composed of 4 bytes (32 bits), of which the 32nd bit is reserved, the 26th to 30th and the 11th to 15th are the "palm facing" status of the left hand and the right hand respectively, and the 16th to 25th and The 1st to 10th digits store the bending state of the fingers of the left hand and the right hand respectively. One of the finger bending states occupies 2 digits. The thumb, index finger, middle finger, ring finger and little finger are sorted from high to low respectively, and the 0th digit is the checksum. bit.
手语帧序列是由多个手语帧按时间顺序进行排列的,一个手语帧对应多个字或词,因为一个手势在不同的语境景代表着不同的意思,并且在本方法中已经对手势进行了抽象,使一些只有轻微差别的手势抽象成相同的手语帧。而手势识别部分就是将手语帧序列中的手语帧根据上下文的统计概率将最合适的字词提取出来并与手语帧序列中的其他手语帧形成最理想的句子。The sign language frame sequence is arranged in chronological order by multiple sign language frames, and one sign language frame corresponds to multiple words or words, because a gesture represents different meanings in different contexts, and gestures have been analyzed in this method abstraction, so that some gestures with only slight differences are abstracted into the same sign language frame. The gesture recognition part is to extract the most suitable words from the sign language frames in the sign language frame sequence according to the statistical probability of the context and form the most ideal sentence with other sign language frames in the sign language frame sequence.
手语识别之前必须建立好手语数据库和概率数据库。Sign language database and probability database must be established before sign language recognition.
手语数据库的建立是指参照《中国手语》上下册,将里面的内容利用第一部分介绍方法提出去手指的弯曲状态和手掌的朝向状态并形成手语帧,并以这个4个字节(32位)的手语帧作为索引,而手势对应的字词作为被索引的内容,相同的手势对应的字词放在一个索引下,当以该索引搜索时,将引出该索引的全部内容。如:“你”和“那”的手势是一样的,则它们的索引是一样的,假设该索引为A,搜索A时将引出“你”,“那”。The establishment of the sign language database refers to referring to the first and second volumes of "Chinese Sign Language", using the first part of the introduction method to propose the bending state of the fingers and the orientation state of the palm to form a sign language frame, and use this 4 bytes (32 bits) The sign language frame is used as the index, and the words corresponding to the gesture are used as the indexed content. The words corresponding to the same gesture are placed under an index. When searching with this index, the entire content of the index will be drawn. For example: the gestures of "you" and "that" are the same, then their indexes are the same, assuming that the index is A, when searching for A, "you" and "that" will be drawn.
概率数据库是指字词在语料库中单个字词的出现的频率,以及每一个词后面跟着出现另一个词的频率的集合;所谓的语料库是指手语方面日常用语句子或文章。对于单个字词和一个词跟住出现另一个词的概率的则是利用语言模型训练工具SRILM得到。并将它们以下面的格式来存储。以字词或字词组合索引出该字词或字词组合的概率,如“绿叶”索引出概率为0.000000147332,以“至于”“外貌”索引出概率“0.000145517086”。概率数据库的索引是以两个词为基础,如果在\2-gram\中找不到这两个词的组合,则在\1-gram\分别找到这两个词单独的概率,设为P1和P2,则该两个词的组合概率为P=P1*P2*e,e为自然常数约2.71828182845905。The probability database refers to the frequency of occurrence of a single word in the corpus, and the collection of the frequency of each word followed by another word; the so-called corpus refers to daily sentences or articles in sign language. For a single word and the probability of a word followed by another word, it is obtained by using the language model training tool SRILM. and store them in the following format. The probability of the word or word combination is indexed by word or word combination. For example, the probability of "green leaf" is 0.000000147332, and the probability of "as for" and "appearance" is "0.000145517086". The index of the probability database is based on two words. If the combination of these two words cannot be found in \2-gram\, then find the individual probability of these two words in \1-gram\, set to P1 and P2, then the combined probability of the two words is P=P1*P2*e, where e is a natural constant of about 2.71828182845905.
以上是对本发明的较佳实施进行了具体说明,但本发明创造并不限于所述实施例,熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换,这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of the preferred implementation of the present invention, but the invention is not limited to the described embodiments, and those skilled in the art can also make various equivalent deformations or replacements without violating the spirit of the present invention. , these equivalent modifications or replacements are all within the scope defined by the claims of the present application.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410655579.5A CN104392237B (en) | 2014-11-17 | 2014-11-17 | Fuzzy sign language identification method for data gloves |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410655579.5A CN104392237B (en) | 2014-11-17 | 2014-11-17 | Fuzzy sign language identification method for data gloves |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104392237A CN104392237A (en) | 2015-03-04 |
CN104392237B true CN104392237B (en) | 2017-10-10 |
Family
ID=52610138
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410655579.5A Expired - Fee Related CN104392237B (en) | 2014-11-17 | 2014-11-17 | Fuzzy sign language identification method for data gloves |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104392237B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018158753A1 (en) * | 2017-03-03 | 2018-09-07 | Symbiosis International University | System and method enabling an interactive wearable as an educational supplement for hearing impaired individuals |
CN109558826B (en) * | 2018-11-23 | 2021-04-20 | 武汉灏存科技有限公司 | Gesture recognition method, system, equipment and storage medium based on fuzzy clustering |
CN110362195A (en) * | 2019-06-10 | 2019-10-22 | 东南大学 | Gesture identification and interactive system based on bistable state coding and Flexiable angular transducer |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0777186A1 (en) * | 1995-11-28 | 1997-06-04 | Nec Corporation | Language data storage and reproduction apparatus |
CN102193633A (en) * | 2011-05-25 | 2011-09-21 | 广州畅途软件有限公司 | dynamic sign language recognition method for data glove |
CN103175502A (en) * | 2013-02-07 | 2013-06-26 | 广州畅途软件有限公司 | Attitude angle detecting method based on low-speed movement of data glove |
CN103309448A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Gesture recognition method with symbol sequence matching based on three-dimensional acceleration |
CN103488287A (en) * | 2013-09-06 | 2014-01-01 | 广州畅途软件有限公司 | Glove-based data input method |
-
2014
- 2014-11-17 CN CN201410655579.5A patent/CN104392237B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0777186A1 (en) * | 1995-11-28 | 1997-06-04 | Nec Corporation | Language data storage and reproduction apparatus |
CN102193633A (en) * | 2011-05-25 | 2011-09-21 | 广州畅途软件有限公司 | dynamic sign language recognition method for data glove |
CN103175502A (en) * | 2013-02-07 | 2013-06-26 | 广州畅途软件有限公司 | Attitude angle detecting method based on low-speed movement of data glove |
CN103309448A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Gesture recognition method with symbol sequence matching based on three-dimensional acceleration |
CN103488287A (en) * | 2013-09-06 | 2014-01-01 | 广州畅途软件有限公司 | Glove-based data input method |
Also Published As
Publication number | Publication date |
---|---|
CN104392237A (en) | 2015-03-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2023029351A1 (en) | Self-supervised learning-based method, apparatus and device for predicting properties of drug small molecules | |
CN108427670A (en) | A kind of sentiment analysis method based on context word vector sum deep learning | |
CN110232186A (en) | The knowledge mapping for merging entity description, stratification type and text relation information indicates learning method | |
CN107609121A (en) | Newsletter archive sorting technique based on LDA and word2vec algorithms | |
CN112183435B (en) | A two-stage hand object detection method | |
CN108108699A (en) | Merge deep neural network model and the human motion recognition method of binary system Hash | |
CN107133213A (en) | A kind of text snippet extraction method and system based on algorithm | |
CN104573630B (en) | Online Recognition Method of Multiple EEG Patterns Based on Probability Output of Dual Support Vector Machines | |
CN105512692A (en) | BLSTM-based online handwritten mathematical expression symbol recognition method | |
CN105631468A (en) | RNN-based automatic picture description generation method | |
CN107742095A (en) | Chinese sign language recognition method based on convolutional neural network | |
CN109408833A (en) | A kind of interpretation method, device, equipment and readable storage medium storing program for executing | |
CN103246891A (en) | Chinese sign language recognition method based on kinect | |
CN110321566A (en) | Chinese name entity recognition method, device, computer equipment and storage medium | |
CN104392237B (en) | Fuzzy sign language identification method for data gloves | |
CN110135277B (en) | Human behavior recognition method based on convolutional neural network | |
CN112347761B (en) | BERT-based drug relation extraction method | |
CN101577062A (en) | Space encoding-based method for realizing interconversion between sign language motion information and text message | |
CN103150383B (en) | A kind of event evolution analysis method of short text data | |
CN111125396B (en) | Image retrieval method of single-model multi-branch structure | |
CN106097835A (en) | A kind of deaf mute exchanges the method for intelligent assistance system and exchange | |
CN109993131A (en) | A design intent discrimination system and method based on multimodal signal fusion | |
CN105912525A (en) | Sentiment classification method for semi-supervised learning based on theme characteristics | |
CN112489790B (en) | Method, device, equipment and storage medium for determining key data | |
CN105740354A (en) | Adaptive potential Dirichlet model selection method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20190211 Address after: 523808 Room 224, Building D-1, Innovation City, Xincheng Road University, Songshan Lake High-tech Industrial Development Zone, Dongguan City, Guangdong Province Patentee after: DONGGUAN YILIAN INTERATION INFORMATION TECHNOLOGY Co.,Ltd. Address before: 510663 Colorful Frequency Road No. 9, 801C, 801D, Guangzhou High-tech Industrial Development Zone, Guangdong Province Patentee before: GUANGZHOU CHANGTU SOFTWARE Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231011 Address after: Room 2430, building 526, Songhu Park, Dongguan Province Patentee after: DONGGUAN YILIAN INTERATION INFORMATION TECHNOLOGY Co.,Ltd. Patentee after: Fantasy (Zhuhai) Technology Co.,Ltd. Address before: 523808 Room 224, Building D-1, Innovation City, Xincheng Road University, Songshan Lake High-tech Industrial Development Zone, Dongguan City, Guangdong Province Patentee before: DONGGUAN YILIAN INTERATION INFORMATION TECHNOLOGY Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20171010 |
|
CF01 | Termination of patent right due to non-payment of annual fee |