EP0905644A2 - Dispositif de reconnaissance de gestes de la main - Google Patents
Dispositif de reconnaissance de gestes de la main Download PDFInfo
- Publication number
- EP0905644A2 EP0905644A2 EP98118008A EP98118008A EP0905644A2 EP 0905644 A2 EP0905644 A2 EP 0905644A2 EP 98118008 A EP98118008 A EP 98118008A EP 98118008 A EP98118008 A EP 98118008A EP 0905644 A2 EP0905644 A2 EP 0905644A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- hand gesture
- user
- hand
- category
- movement
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/28—Recognition of hand or arm movements, e.g. recognition of deaf sign language
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/20—Image signal generators
- H04N13/204—Image signal generators using stereoscopic image cameras
- H04N13/239—Image signal generators using stereoscopic image cameras using two 2D image sensors having a relative position equal to or related to the interocular distance
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/20—Image signal generators
- H04N13/204—Image signal generators using stereoscopic image cameras
- H04N13/243—Image signal generators using stereoscopic image cameras using three or more 2D image sensors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N2013/0074—Stereoscopic image analysis
- H04N2013/0081—Depth or disparity estimation from stereoscopic image signals
Definitions
- the present invention relates to hand gesture recognizing devices, and more particularly to a hand gesture recognizing device which can automatically recognize hand gestures.
- a first method is to measure movement of the body by sensing movement of a sensor attached to the body.
- a sensor attached to the body For example, refer to "Hand Gesture Recognizing Method and Applications," Yuichi Takahashi et al., The Institute of Electronics Information and Communication Engineers, Papers D-2 Vol.J73-D-2 No.121990, n.pag., and Japanese Patent Laying-Open No. 8-115408.
- the hands are sensed by a camera with multi-colored gloves put on the hands so as to measure movements of the fingers by extracting information about the outline of the hands through color information.
- the first to third methods described above require that users be equipped with a sensor, gloves, or an optical fiber, which gives an uncomfortable feeling to the users and limits movements of the users.
- the conventional methods which recognize movements by using absolute coordinate values of body parts obtained in advance from a particular person are susceptible to recognition errors due to differences in body size among actual users, movement of the body during performance, and the like. It may be suggested that coordinate values of the body parts be recorded for a plurality of users. However, this method will encounter the problem that an enormous amount of data must be recorded in proportion to the number of users.
- measured hand movements are compared with hand movements corresponding to hand gesture words recorded in a dictionary, word by word, for recognition. This raises the problem that the time required for recognition exponentially increases as the number of words to be recognized increases.
- an object of the present invention is to provide a hand gesture recognizing device which can recognize and translate hand gestures without requiring that users be equipped with some tools.
- Another object of the present invention is to provide a hand gesture recognizing device which can correctly recognize and translate hand gestures without error, regardless of differences in body size among users, movements of the body during performance, and the like.
- Still another object of the present invention is to provide a hand gesture recognizing device which can achieve recognizing and translating processing in a short time even with an increased number of words to be recognized.
- a first aspect is directed to a hand gesture recognizing device for recognizing hand gestures performed by a user comprises:
- the device extracts features of body parts from stereoscopic image data obtained by taking pictures of a user and detects three-dimensional movement of a hand gesture by utilizing parallax of the stereoscopic image data, and recognizes the hand gesture word on the basis of the detected result.
- This allows the hand gesture to be recognized without requiring the user to be equipped with any tool, and without contact.
- the device further divides the space surrounding the user into a plurality of regions corresponding to the body of the user and detects how the three-dimensional spatial positions of the user's hands move with respect to the divided regions. Accordingly it can always make recognition properly in accordance with the user's body, independently of body size of the user and movement of the body of the user, which remarkably improves the recognizing accuracy.
- a position of a body part which does not appear in the feature image is estimated and the space surrounding the user is divided into still smaller regions on the basis of the estimated position, which enables more accurate recognition.
- the device divides regions only when a difference value between feature images adjacent in time reaches or exceeds a predetermined threshold, which reduces the calculating load for the region dividing.
- hierarchically checking hand gesture words allows the recognition to be achieved in a shorter time as compared with the conventional method of calculating the degrees of similarity for every single word.
- the device extracts a sampling point at which the direction of movement largely changes as a control point from among a plurality of sampling points existing between the start point and the end point of the movement and represents the movement of the user's hands by using these start point, end point and control point. Accordingly the hand movement of the user can be represented more simply as compared with the method in which the hand movement of the user is represented by using all sampling points, and as the result, the hand gesture words can be determined more quickly.
- the precision in representing movement can be freely changed by changing the threshold.
- hierarchically determining the hand gesture word enables recognition to be made in a shorter time as compared with the conventional method in which degrees of similarity are calculated for all words.
- Fig.1 is a block diagram showing the structure of a sign language recognizing device according to a first embodiment of the present invention.
- the sign language recogninzing device of this embodiment includes a photographing device 1, an image storage device 2, a feature image extracting device 3, a spatial position calculating device 4, a region dividing device 5, a hand gesture detecting device 6, a category detecting device 8, a word recognizing device 9, a category dictionary 10, a word dictionary 11, an output device 12, and a color transformation table creating device 13.
- the photographing device 1 includes a plurality of TV cameras, which takes stereoscopic pictures of movements of a user.
- the image storage device 2 stores a plurality of frames of stereoscopic image data outputted from the photographing device 1.
- the color transformation table creating device 13 creates three color transformation tables respectively corresponding to first to third channels on the basis of colors of picture elements specified on a representative image selected by an operator from among the plurality of frames of stereoscopic image data stored in the image storage device 2.
- the feature image extracting device 3 reads the stereoscopic image data in order from the image storage device 2 and transforms the color data of the picture elements in the read stereoscopic image data according to the color transformation tables created by the color transformation table creating device 13 to extract a stereoscopic feature image showing body features of the user, which is disassembled and outputted in the first to third channels.
- the spatial position calculating device 4 calculates three-dimensional spatial positions of blobs (images each regarded as a lump of images) included in the individual channels by utilizing parallax of the stereoscopic images outputted in the individual channels from the feature image extracting device 4.
- the region dividing device 5 divides the three-dimensional space surrounding the body on the basis of the stereoscopic feature image outputted from the feature image extracting device 3 and the three-dimensional spatial positions of the blobs calculated in the spatial position calculating device 4 and creates spatial region codes for defining the divided regions.
- the hand gesture detecting device 6 detects how the blobs corresponding to the hands move in the space in relation to the spatial region codes created by the region dividing device 5 on the basis of the stereoscopic image outputted from the feature image extracting device 4, the three-dimensional spatial positions of the blobs calculated in the spatial position calculating device 4, and the spatial region codes created by the region dividing device 5.
- the category dictionary 10 contains features of sign language gestures classified into categories (groups each including similar sign language gestures).
- the category detecting device 8 detects which of the categories included in the category dictionary 10 features of the sign language gesture detected by the hand gesture detecting device 6 belong to.
- the word dictionary 11 contains features of gestures for sign language words belonging to the individual categories.
- the word recognizing device 9 detects which of the sign language words belonging to the category detected by the category detecting device 8 the features of the sign language gesture detected by the hand gesture detecting device 6 correspond to.
- the output device 12 outputs the result detected by the word recognizing device 9 in the form of image, letters, speech, etc.
- FIGs.2 and 3 are flowcharts showing the operation of the embodiment shown in Fig.1. Referring to Figs.2 and 3, the operation of this embodiment will now be described.
- the photographing device 1 starts taking pictures (Step S1).
- two TV cameras on the right and left sides included in the photographing device 1 stereoscopically senses the upper half of the body of the user at different angles.
- the stereoscopic image data outputted from the photographing device 1 is stored into the image storage device 2 at proper sampling cycle.
- the image storage device 2 stores the stereoscopic image data in a sampling interval of 1/30 second according to the NTSC standard.
- another sampling interval e.g., 1/10 sec or 1/5 sec
- individual frames of the stereoscopic image data stored in the image storage device 2 are serially numbered in a time series manner (as IMG1, IMG2, ).
- Step S2 determines whether a table setting flag (not shown) provided therein is set (Step S2). As will be described later, this table setting flag is set when color transformation tables are set (see Step S11). At first, the color transformation tables are not set and hence the table setting flag is in a reset state, the process therefore proceeds to Step S3.
- Step S3 as shown in Fig.5, the operator selects an arbitrary one frame of image data from among the plurality of frames of stereoscopic image data stored in the image storage device 2 as a representative image for feature extraction. While the image data outputted from the photographing device 1 is stored in the image storage device 2, it is also displayed in a display device not shown.
- the operator gives selecting instruction to the color transformation table creating device 13 with proper timing while watching the displayed contents in the display device to specify the representative image.
- the color transformation table creating device 13 then reads the image data of the operator-selected representative image from the image storage device 2. Subsequently, the color transformation table creating device 13 performs the process of setting the color transformation tables (Steps S4 to S11). The processing made in Steps S4 to S11 will now be described in detail.
- the first channel corresponds to the R signal
- the second channel corresponds to the G signal
- the third channel corresponds to the B signal.
- the first channel may correspond to the G or B signal.
- the operator specifies the first channel as the output channel.
- the operator specifies colors to be taken out into the first channel (Step S4).
- the operator specifies the part "a” in the hair and the part "b” in the eye in the representative image (Fig.5) displayed in a display device (not shown) by using a pointing device like a mouse.
- the color transformation table creating device 13 determines the RGB values representing the colors of the specified parts a and b as colors to be taken out in the first channel and sets the maximum value "255" in the corresponding color space regions in the color transformation table 131 for the first channel (see Fig.6).
- the color information acquired at this time may be of any of HSI system, YUV, YIQ.
- the operator specifies colors not to be outputted into the first channel (Step S5).
- the operator specifies the parts "c” and “e” in the clothes and the part "d” in the face in the representative image (Fig.5) by using a mouse or the like.
- the operator can specify a plurality of portions.
- the color transformation table creating device 13 determines the RGB values representing the colors of the specified parts c, d and e as colors not to be outputted in the first channel and sets the minimum value "0" in the corresponding color space regions in the first-channel color transformation table 131.
- the color transformation table creating device 13 determines that the output channel specified at this time is the first channel (Step S6), and performs given interpolating operation between the colors specified in Step S4 and the colors specified in Step S5 to calculate transform values for colors not specified in Step S4 and S5 and sets the calculated transform values in the corresponding color space regions in the first-channel color transformation table 131 (Step S7).
- the given interpolating operation performed in Step S7 above may be the color space transformation operation described in "A Method of Color Correction by using the Color Space Transformation," Jun Ikeda et al., The Institute of Image Information and Television Engineers, 1995 annual meeting, n.pag., for example. This transformation operation will now be described.
- ) and the minimum distance between the arbitrary color and the color not to be extracted is taken as Si 1 min (
- the operator specifies the second channel as the output channel and specifies colors to be taken out and not to be outputted in the second channel (Steps S4 and S5).
- the operator specifies the part "d" in the face in the selected representative image (Fig.5) with a mouse or the like as a color to be taken out in the second channel.
- the operator also specifies the parts other than the face as colors not to be outputted in the second channel by using a mouse or the like.
- the color transformation table creating device 13 sets the maximum value "255" and the minimum value "0" in the corresponding color space regions in the second-channel color transformation table 131 (see Fig.6).
- the color transformation table creating device 13 determines that the output channel specified at this time is the second channel (Step S8) and performs given interpolating operation between the color specified in Step S4 and the colors specified in Step S5 to calculate transform values for colors not specified in Steps S4 and S5 and set the transform values obtained by calculation in the corresponding color space regions in the second-channel color transformation table 131 (Step S9).
- the operator specifies the third channel as the output channel and specifies colors to be taken out and not to be outputted in the third channel (Steps S4 and S5).
- the operator specifies the parts "c” and “e” in the clothes in the representative image (Fig.5) as colors to be taken out in the third channel by using a mouse or the like.
- the operator also specifies a part other than the clothes (e.g., the background part) as colors not to be outputted in the third channel by using a mouse or the like.
- the color transformation table creating device 13 sets the maximum value "255" and the minimum value "0" in the corresponding color space regions in the third-channel color transformation table 131 (see Fig.6).
- the color transformation table creating device 13 determines that the output channel specified at this time is the third channel (Step S8) and performs given interpolating operation between the colors specified in Step S4 and the colors specified in Step S5 to calculate transform values for colors not specified in Steps S4 and S5 and set the calculated values in the corresponding color space regions in the third-channel color transformation table 131 (Step S10).
- the color transformation table creating device 13 sets the table setting flag (Step S11) and ends the processing of setting the color transformation tables 131.
- the feature image extracting device 3 transforms the picture elements included in the stereoscopic image data read from the image storage device 2 by using the three color transformation tables 131 created by the color transformation table creating device 13. The feature image extracting device 3 then outputs only those provided with transform values equal to or larger than a predetermined threshold.
- the stereoscopic feature images see Figs.7a to 7c showing the body features of the present user are outputted in the form disassembled in the first to third channels (Step S12).
- Fig.7a shows the feature image outputted in the first channel, which includes, as blobs (image treated as a lump of image), a blob 71 corresponding to the hair of the head, blobs 72 and 73 corresponding to the eyebrows, and blobs 74 and 75 corresponding to the eyes.
- Fig.7b shows the feature image outputted in the second channel, which includes a blob 76 corresponding to the face, and blobs 77 and 78 corresponding to the hands.
- Fig. 7c shows the feature image outputted in the third channel, which includes a blob 79 corresponding to the all region of the body.
- the spatial position calculating device 4 obtains the on-image center of gravity positions of the blobs included in the feature images in the first to third channels shown in Figs.7(a), (b), (c) (Step S13).
- Fig.8 the method for obtaining the center of gravity position of the blob corresponding to the right hand will be described.
- the circumscribed rectangle of the objective blob is obtained, where the coordinates of the diagonal vertexes ⁇ and ⁇ of the circumscribed rectangle are taken as (X st , Y st ), (X end , Y end ), respectively.
- the origin in the coordinates is taken at the upper left of the image as shown in Figs.7a to 7c.
- the spatial position calculating device 4 calculates the three-dimensional spatial positions of the blobs by utilizing the parallax of the feature images outputted from the feature image extracting device 3.
- the spatial position calculating device 4 records the three-dimensional spatial positions of the blobs calculated in Step S14 in such a three-dimensional spatial position table as shown in Fig.9.
- the right and left cameras can be set in arbitrary position.
- the equations (6) to (8) can be modified in accordance with the positional relation between the right and left cameras.
- the region dividing device 5 extracts the outline of the body as shown in Fig.10 from the feature image of the third channel shown in Fig.7c (Step S15).
- the region dividing device 5 detects representative lines representing body features (see Fig.11) from the extracted outline (Step S16).
- the line HUL is a line parallel to the X axis and touching the uppermost end of the person's outline, which represents the top of the head of the body.
- the lines FRL and FLL are lines parallel to the Y axis and touching the right and left ends of the upper part (the upper one-third) of the body outline, which represent the right and left sides of the face.
- the point at which the vertical extension of the line FRL intersects the outline is taken as frlp (Xf, Yf).
- the first intersection with the outline found when the image is searched from the left side is taken as tempp (Xt, Yt).
- the point with the maximum curvature found when the outline is searched from the point frlp to the point tempp is the point shp, which represents the right shoulder.
- the line SUL is parallel to the X axis and passes through the point shp.
- the line SHRL is parallel to the Y axis and passes through the point shp.
- the line MCL is parallel to the Y axis and located at the midpoint between the line FRL and the line FLL, which represents the center axis of the body.
- the line SHLL is a line symmetric to the line SHRL about the line MCL.
- the line ERL is a line symmetric to the line MCL about the line SHRL.
- the line ELL is a line symmetric to the line ERL about the line MCL.
- the line NEL is a line parallel to the X axis and located at the three-fourths position between the line SUL and the line HUL.
- the line BML is a line parallel to the X axis and located at the midpoint between the line SUL and the bottom end of the image.
- the region dividing device 5 obtains the intersections 0 to 21 of the representative lines (see Fig.12). Next, regarding the points with the same intersection numbers in the images sensed by the right camera and the left camera as corresponding right and left points, the region dividing device 5 calculates the three-dimensional spatial positions about the intersections 0 to 21 similarly to the spatial position calculating device 4 by utilizing the parallax (Step S17).
- the region dividing device 5 substitutes those coordinate values into the above-presented equations (6) to (8) to calculate its three-dimensional spatial position.
- the three-dimensional spatial positions are calculated in the same way for other intersections.
- the region dividing device 5 defines spatial region codes (0 to 24) for a first world as shown in Fig.13 on the basis of the results calculated in Step S17.
- the region dividing device 5 defines the region extending from the first world in the distance between the line MCL and the line SHRL ahead of the person as second world spatial region codes (25-49) and defines the region further ahead as third world spatial region codes (50-74).
- Fig.14 visually shows the positional relation among the first to third worlds defined by the region dividing device 5.
- the region dividing device 5 stores the defined spatial region codes and the three-dimensional coordinate values of the intersections for defining them in a spatial region code table (not shown) (Step S18).
- the regions can be divided in correspondence with the body parts of the user (the face, neck, chest, belly, sides of the face, etc.), with the spatial region codes indicating the correspondence with the body parts of the user.
- the region dividing device 5 may receive the three-dimensional spatial positions of the blobs corresponding to the hair of the head and the eyes from the spatial position calculating device 4. Then it estimates positions of other elements (the nose, mouth, ears, etc.) constituting the face from the positional relation between the hair and eyes, and divides the spatial region (i.e., the spatial region corresponding to the spatial region code (11) of Fig.13) into smaller ones on the basis of the estimated positions of other elements.
- the region dividing device 5 contains previously recorded general positional relation among the nose, mouth, ears, etc. with respect to the hair and the eyes.
- the region dividing device 5 divides the space into smaller regions on the basis of the estimated positions of the nose, mouth, ears, etc. in the three-dimensional space and defines spatial region codes for defining them.
- the region dividing device 5 may be configured to calculate difference values between images adjacent in time in a certain channel (e.g., the third channel) so that it can create the spatial region codes shown in Step S18 only when the difference value is at or over a predetermined threshold. In this case, since the spatial region codes are created only when the user moves largely, the calculating load to the region dividing device 5 is reduced. As shown in Fig.15, the region dividing device 5 may define the spatial region codes more roughly in greater-numbered worlds, as from first, second, to third worlds, that is, as it moves forward, ahead of the user.
- the hand gesture detecting device 6 specifies blobs having size corresponding to the hands from among the blobs obtained in the second channel as the hands, and determines which of the spatial region codes created in Step S18 (see Fig.13) the three-dimensional spatial positions of the corresponding blobs recorded in the three-dimensional spatial position table of Fig.9 belong to (Step S19).
- the results of the determination made at this time are recorded in such a region transition table as shown in Fig.16.
- the region transition table shown in Fig.16 contains data recorded when a sign language gesture meaning "postcard" is performed as an example.
- the hand gesture detecting device 6 determines blobs which satisfy the conditions given by the following expression (9) to be the hands and determines blobs which satisfy the conditions given by the following expression (10) to be blobs representing other parts: La> TH SM and La ⁇ TH BG La ⁇ TH SM and La> TH BG
- the blobs 77 and 78 shown in Fig.7b are determined to be blobs corresponding to the hands, and then the right hand and the left hand are specified.
- the hand gesture detecting device 6 determines whether the movement of the blobs corresponding to the hands has rested in a predetermined constant time period or longer (Step S20). When the movement of those blobs is continuing, the operations in Steps S12 to S19 are repeated. Then spatial region codes to which those blobs belong are thus recorded in a time series manner in the region transition table shown in Fig.16. Accordingly, it can be known how the hands move with respect to the body of the user by seeing the region transition table.
- the hand gesture detecting device 6 analyzes the spatial region codes recorded in the region transition table (see Fig.16) and disassembles the movement of the hands into elements to detect the features (Step S21). The following features are detected from the spatial region codes stored in the region transition table of Fig.16.
- Fig.17a shows a hand shape No.4 corresponding to " "(which is a phonogram pronounced as [hi]).
- Fig.17b shows a hand shape No.2 corresponding to " "(which is a phonogram pronounced as [te]).
- Step S21 the operation of detecting movement codes executed in Step S21 will be described in greater detail.
- the start point of the gesture is taken as ST (xs, ys, zs), and the end point of the gesture is taken as END (xe, ye, ze).
- the hand gesture detecting device 6 first obtains the straight line L1 connecting the start point ST and the end point END (Step S101).
- the hand gesture detecting device 6 obtains the perpendicular lines from individual sampling points n1 to n9 to the straight line L1 and obtains the lengths d1 to d9 of the perpendicular lines (Step S102).
- Step S102 the length of the perpendicular lines from the individual sampling points n1 to n9 to the straight line L1 are obtained by using the above equation (11).
- the hand gesture detecting device 6 takes the sampling point having the longest perpendicular line as a control candidate point (Step S103). In this case, the sampling point n3 having the maximum distance d3 to the straight line L1 is regarded as the control candidate point.
- the hand gesture detecting device 6 determines whether the maximum distance d3 is not smaller than a predetermined threshold THC (Step S104). When the maximum distance d3 is equal to or larger than the predetermined threshold THC, the hand gesture detecting device 6 defines this point n3 as a control point (Step S105). In this case, the maximum distance d3 is equal to or larger than the threshold THC and therefore the sampling point n3 is defined as a control point c1.
- the hand gesture detecting device 6 detects a new control point between the start point ST and the end point END (Step S106). This operation of detecting a new control point is repeatedly performed until no new control point is detected between the start point ST and the end point END any longer (Step S107).
- the hand gesture detecting device obtains the straight line L2 connecting the start point ST and the control point c1 and the straight line L3 connecting the control point c1 and the end point END, and then calculates the distances between the straight line L2 and the individual sampling points n1 and n2 existing between the start point ST and the control point c1 and the distances between the straight line L3 and the individual sampling points n4 to n9 existing between the control point c1 and the end point END by using the above-presented equation (11). Between the start point ST and the control point c1, the sampling point n2 has the maximum distance d2 to the straight line L2 and it is regarded as a control candidate point.
- the sampling point n2 is not defined as a control point. Hence no control point exists between the start point ST and the control point c1. Between the control point c1 and the end point END, the sampling point n8 has the maximum distance d8 to the straight line L3 and is regarded as a control candidate point. Since this distance d8 is equal to or larger than the threshold THC, the sampling point n8 is defined as a control point c2.
- the hand gesture detecting device 6 obtains the straight line L4 connecting the control point c1 and the control point c2 and calculates the distances between the straight line L4 and the individual sampling points n4 to n7 existing therebetween by using the above equation (11).
- the sampling point n7 having the maximum distance d7 is regarded as a control candidate point.
- the sampling point n4 is not defined as a control point. Accordingly no control point exists between the control point c1 and the control point c2.
- the hand gesture detecting device 6 obtains the straight line L5 connecting the control point c2 and the end point END and calculates the distance d9 between the straight line L5 and the sampling point n9 existing therebetween by using the equation (11).
- the sampling point n9 is regarded as the control candidate point but not defined as a control point, for the distance d9 is shorter than the threshold THC. Accordingly no control point exists between the control point c2 and the end point END. That is to say, in the movement of the hand from the start point ST to the end point END, there are two control points c1 and c2.
- the hand gesture detecting device 6 creates movement codes by using the start point, the control points, and the end point (Step S108). That it to say, in the case of the locus of the hand shown in Figs.19a to 19c, it can be disassembled into the movements of ST ⁇ c1, c1 ⁇ c2, c2 ⁇ END.
- ST ⁇ c1 corresponds to [1. right], c1 ⁇ c2 to [4. down], c2 ⁇ END to [2. left], respectively. Accordingly the movement codes are "right ⁇ down ⁇ left" in this case.
- the category detecting device 8 determines which of the categories recorded in the category dictionary 10 the features of the sign language gesture detected by the hand gesture detecting device 6 in Step S19 belong to (Step S22).
- the categories are groups each including a plurality of sign language gestures with similar movements.
- a plurality of sign language gestures as objects of recognition by this device are classified into a plurality of categories in advance.
- the category dictionary 10 contains features of the hand gestures in the individual categories recorded in advance. In this embodiment, it is assumed that the category dictionary 10 contains features in categories 1 to 7, for example.
- the category 1 includes hand gestures in which both hands first come closer and then move symmetrically on the right and left.
- the category 2 includes hand gestures in which both hands move independently while keeping certain or larger interval.
- the category 3 includes hand gestures in which the hands identically move in contact or while being coupled.
- the category 4 includes hand gestures in which one hand stands still and the other hand moves within a given region from the resting hand.
- the category 5 includes hand gestures in which one hand stands still and the other moves closer to and comes in contact with the resting hand from an interval equal to or larger than a given region.
- the category 6 includes hand gestures made by both hands other than those mentioned above.
- the category 7 includes hand gestures made by one hand only.
- the word dictionary 11 contains more detailed features of the movements for sign language words in the individual categories.
- Figs.22a to 22c show examples of sign language words belonging to the category 1.
- sign language words which satisfy the above-mentioned conditions include not only those shown in Figs.22a to 22c but also other words, it is assumed here for simplicity that the three sign language words satisfying the similar conditions, i.e., "postcard,” "all,” and “almost,” belong to the category 1.
- the word dictionary 11 contains information showing features of the movements for the three sign language words belonging to the category 1. That is to say, the word dictionary 11 records information such as "movement code,” “gesture start position code,” “gesture end position code,” “indicated particular part,” “positional relation between hands,” “hand shape,” etc.
- the word recognizing device 9 reads, from the word dictionary 11, the feature information about the movements for the sign language words belonging to the category detected by the category detecting device 8 (Step S23). Next, the word recognizing device 9 compares the features of the sign language gesture detected in Step S21 and the feature information about the sign language words read in Step S23 to calculate the degree of coincidence for each sign language word (Step S24).
- the degree of similarity is 100%.
- the degree of similarity is given in accordance with the degree of closeness. For example, as shown in Fig.16, while the gesture end position code for the left hand detected in Step S19 is "13,” the gesture end position code for the left hand for “postcard” shown in Fig.23 is “38.” In this case, as shown in Fig.24, the degree of similarity for the spatial position code "13" with respect to the spatial position code "38" is 89%.
- degrees of similarity shown in Fig.24 are shown just as examples and that those can be arbitrarily changed.
- Lower degrees of similarity e.g., a degree of similarity of 20%
- spatial position codes not shown in Fig.24 i.e., to spatial position codes separated in space from the spatial position code "38."
- a movement code recorded in the word dictionary 11 when a movement code recorded in the word dictionary 11 is taken as a reference movement code, four movement codes corresponding to the ridges of a quadrangular pyramid (the lines on which planes intersect on the sides of the quadrangular pyramid) formed around that reference movement code as a center axis are regarded as near-by codes for that reference movement code. Given degrees of similarity (e.g., a degree of similarity of 90%) are assigned to these four near-by codes. Lower degrees of similarity (e.g., a degree of similarity of 20%) are given to other movement codes.
- Fig.25 shows part of a movement near-by code table storing a list of near-by codes for reference movement codes.
- Fig.26 visually shows four near-by codes (shown by the dotted lines) for a reference movement code directed downward (shown by the solid line).
- the word recognizing device 9 refers to the near-by code table shown in Fig.25 to determine whether an actually detected movement code is a near-by code for a reference movement code recorded in the word dictionary 11.
- the output device 12 outputs the sign language word "postcard" specified by the word recognizing device 9 in speech, letters, image, or in an arbitrary combination thereof (Step S26). This enables the operator to know the result of recognition.
- the feature image extracting device 3 determines whether it has received an instruction to end the recognizing operation from the operator (Step S27). In the case of no instruction, it performs the operation in Step S12 again. The operations in Steps S13 to S26 are then repeated. If it receives an instruction to end from the operator, the color transformation table creating device 13 resets the table setting flag (Step S28). Then the sign language recogninzing device shown in Fig.1 ends the operation.
- the word recognizing device 9 in the above-described first embodiment outputs a sign language word with the highest degree of coincidence as the recognized result, it may be configured to output one or a plurality of sign language words having degrees of similarity equal to or larger than a predetermined threshold as the recognized result.
- movement codes for hand gestures are uniquely detected in the first preferred embodiment described above, another embodiment in which the movement codes are hierarchically detected and the sign language words are hierarchically recognized on the basis of the hierarchically detected movement codes will be described below as a second embodiment.
- Fig.27 is a block diagram showing the structure of a sign language recogninzing device according to the second embodiment of the present invention.
- the structure and operation of this embodiment are the same as those of the first embodiment shown in Fig.1 except in the following respects, and the corresponding parts are shown by the same reference numerals and are not described again.
- Fig.28 is a flowchart showing the operation of detecting movement codes executed in the hand gesture detecting device 60 of the second embodiment.
- the movement code detecting operation performed by the hand gesture detecting device 60 will be described on the basis of the hand locus shown in Figs.29a and 29b and Figs.30a to 30c as an example.
- the hand gesture detecting device 60 detects movement codes on the basis of a low resolution threshold THC1 (Step S201). At this time, the hand gesture detecting device 60 detects the movement codes by using the algorithm shown in Fig.18. That is to say, the hand gesture detecting device 60 obtains the straight line L1 connecting the start point ST and the end point END as shown in Fig.29a and then calculates the distances d1 to d4 between the straight line L1 and the individual sampling points n1 to n4 by using the above-presented equation (11).
- the sampling point n3 having the maximum distance d3 to the straight line L1 is regarded as a control candidate point.
- the hand gesture detecting device 60 compares the maximum distance d3 and the low resolution threshold THC1.
- the sampling point n3 is not defined as a control point. Accordingly, as shown in Fig.29b, no control point exists when the low resolution threshold THC1 is used.
- the hand gesture detecting device 60 represents the hand locus shown in Fig.29b detected by using the low resolution threshold THC1 as ST ⁇ END, and defines the movement code as "down" from the movement code table shown in Fig.21.
- the hand gesture detecting device 60 detects the movement code on the basis of a high resolution threshold THC2 (Step S202). At this time, the hand gesture detecting device 60 detects the movement codes by using the algorithm shown in Fig.18.
- the value of the high resolution threshold THC2 is selected to be smaller than the value of the low resolution threshold THC1. That is to say, the hand gesture detecting device 60 obtains the straight line L1 connecting the start point ST and the end point END as shown in Fig.30a, and then calculates the distances d1 to d4 between the straight line L1 and the individual sampling points n1 to n4 by using the equation (11).
- the maximum distance d3 is larger than the distance threshold THC2, and therefore the sampling point n3 is detected as a control point c1.
- the hand gesture detecting device 60 detects a new control point between the start point ST and the control point c1, and further between the control point c1 and the end point END.
- a new control point c2 is detected between the start point ST and the control point c1. Accordingly, there exist two control points c1 and c2 when the high resolution threshold THC2 is used.
- the hand gesture detecting device 60 represents the hand locus shown in Fig.30c detected by using the high resolution threshold THC2 as ST ⁇ c2, c2 ⁇ c1, c1 ⁇ END, and defines the movement codes as "down to right ⁇ down to left ⁇ down to right" from the movement code table shown in Fig.21.
- the category detecting device 80 selects the corresponding category by using the movement code "down" detected with the low resolution threshold THC1.
- both of "write” and “refreshing” in Fig.31 are selected as candidates for recognition objects.
- the word recognizing device 90 selects the corresponding word by using the movement codes "down to right ⁇ down to left ⁇ down to right” detected with the high resolution threshold THC2.
- the hand gesture word "write” in Fig.31 is selected.
- the low resolution threshold THC1 and the high resolution threshold THC2 can arbitrarily be selected as long as the relation THC1>THC2 holds. Three or more thresholds may be used.
- Fig.32 is a block diagram showing the structure of a sign language recogninzing device according to a third embodiment of the present invention.
- the sign language recogninzing device of this embodiment additionally has a start-of-gesture informing device 14 between the photographing device 1 and the image storage device 2.
- This structure is the same as that of the first embodiment shown in Fig.1 in other respects, and corresponding parts are shown by the same reference numerals and are not described again.
- This start-of-gesture informing device 14 usually gates image frames outputted from the photographing device 1 to inhibit supply of image frames to the image storage device 2.
- the start-of-gesture informing device 14 informs the user when to start the recognizing operation by light, speech, image, or the like. This allows the user to appropriately time to start the sign language gesture.
- the start-of-gesture informing device 14 supplies image frames outputted from the photographing device 1 to the image storage device 2 in response to the starting instruction from the operator. Then image frames are accumulated in the image storage device 2 and the processing for recognizing sign language gestures starts.
- this computer device includes a photographing device 1, an image storage device 2, a CPU 21, a RAM 22, a program storage device 23, an input device 24, and a display device 25.
- the program storage device 23 contains program data for realizing operations like those shown in the flowcharts in Figs.2 and 3.
- the CPU 21 executes the operations shown in Figs.2 and 3 in accordance with the program data.
- the RAM 22 stores work data generated during the processing by the CPU 21.
- the input device 24 includes a keyboard, a mouse, and the like, which enters various instructions and data into the CPU 21 in response to operation by an operator.
- the photographing device 1 and the image storage device 2 have the same configurations as the photographing device 1 and the image storage device 2 shown in Fig.1.
- the method for storing the program data into the program storage device 23 includes various methods.
- the program data is read from a storage medium (a floppy disk, a CD-ROM, a DVD, etc.) containing the program data and stored in the program storage device 23.
- program data transferred by on-line communication are received and stored in the program storage device 23.
- program data is stored in the program storage device 23 in advance at the time of shipment of the device.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- User Interface Of Digital Computer (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP26148397 | 1997-09-26 | ||
JP261483/97 | 1997-09-26 | ||
JP26148397 | 1997-09-26 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0905644A2 true EP0905644A2 (fr) | 1999-03-31 |
EP0905644A3 EP0905644A3 (fr) | 2004-02-25 |
Family
ID=17362541
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP98118008A Ceased EP0905644A3 (fr) | 1997-09-26 | 1998-09-23 | Dispositif de reconnaissance de gestes de la main |
Country Status (3)
Country | Link |
---|---|
US (1) | US6215890B1 (fr) |
EP (1) | EP0905644A3 (fr) |
CN (1) | CN1139893C (fr) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001069365A1 (fr) * | 2000-03-13 | 2001-09-20 | Ab In Credoble | Systeme de reconnaissance de gestuelle |
WO2002007839A2 (fr) * | 2000-07-24 | 2002-01-31 | Jestertek, Inc. | Systeme de controle d'images video |
WO2002029722A2 (fr) * | 2000-10-03 | 2002-04-11 | Jestertek, Inc. | Systeme de controle a cameras multiples |
WO2003071410A2 (fr) * | 2002-02-15 | 2003-08-28 | Canesta, Inc. | Systeme de reconnaissance de geste utilisant des capteurs de perception de profondeur |
FR2847357A1 (fr) * | 2002-11-19 | 2004-05-21 | Simag Dev | Methode de commande d'une machine au moyen de la position d'un objet mobile |
WO2005078558A1 (fr) * | 2004-02-16 | 2005-08-25 | Simone Soria | Procede permettant de generer des signaux de commande, notamment pour des utilisateurs handicapes |
WO2008084053A1 (fr) * | 2007-01-12 | 2008-07-17 | International Business Machines Corporation | Adaptation d'une expérience de consommateur basée sur un flux d'images capturées 3d d'une réponse de comsommateur |
US7725547B2 (en) | 2006-09-06 | 2010-05-25 | International Business Machines Corporation | Informing a user of gestures made by others out of the user's line of sight |
US7792328B2 (en) | 2007-01-12 | 2010-09-07 | International Business Machines Corporation | Warning a vehicle operator of unsafe operation behavior based on a 3D captured image stream |
US7801332B2 (en) | 2007-01-12 | 2010-09-21 | International Business Machines Corporation | Controlling a system based on user behavioral signals detected from a 3D captured image stream |
US7840031B2 (en) | 2007-01-12 | 2010-11-23 | International Business Machines Corporation | Tracking a range of body movement based on 3D captured image streams of a user |
US7877706B2 (en) | 2007-01-12 | 2011-01-25 | International Business Machines Corporation | Controlling a document based on user behavioral signals detected from a 3D captured image stream |
US7971156B2 (en) | 2007-01-12 | 2011-06-28 | International Business Machines Corporation | Controlling resource access based on user gesturing in a 3D captured image stream of the user |
US8269834B2 (en) | 2007-01-12 | 2012-09-18 | International Business Machines Corporation | Warning a user about adverse behaviors of others within an environment based on a 3D captured image stream |
WO2012128399A1 (fr) * | 2011-03-21 | 2012-09-27 | Lg Electronics Inc. | Dispositif d'affichage et procédé de commande associé |
CN103226692A (zh) * | 2012-11-22 | 2013-07-31 | 广东科学中心 | 一种视频流图像帧的识别系统及其方法 |
TWI469101B (zh) * | 2009-12-23 | 2015-01-11 | Chi Mei Comm Systems Inc | 手語識別系統及方法 |
TWI501205B (zh) * | 2014-07-04 | 2015-09-21 | Sabuz Tech Co Ltd | 手語圖像輸入方法及裝置 |
EP3043238A1 (fr) * | 2011-09-15 | 2016-07-13 | Koninklijke Philips N.V. | Interface utilisateur à base de gestes avec rétroaction |
US9704350B1 (en) | 2013-03-14 | 2017-07-11 | Harmonix Music Systems, Inc. | Musical combat game |
Families Citing this family (426)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8352400B2 (en) | 1991-12-23 | 2013-01-08 | Hoffberg Steven M | Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore |
US6560281B1 (en) * | 1998-02-24 | 2003-05-06 | Xerox Corporation | Method and apparatus for generating a condensed version of a video sequence including desired affordances |
US7904187B2 (en) | 1999-02-01 | 2011-03-08 | Hoffberg Steven M | Internet appliance system and method |
US20020071277A1 (en) * | 2000-08-12 | 2002-06-13 | Starner Thad E. | System and method for capturing an image |
JP2002077661A (ja) * | 2000-09-05 | 2002-03-15 | Fujitsu Ltd | 色信号値抽出方法および色変換テーブル作成方法 |
JP3784289B2 (ja) * | 2000-09-12 | 2006-06-07 | 松下電器産業株式会社 | メディア編集方法及びその装置 |
US7274800B2 (en) * | 2001-07-18 | 2007-09-25 | Intel Corporation | Dynamic gesture recognition from stereo sequences |
US7680295B2 (en) * | 2001-09-17 | 2010-03-16 | National Institute Of Advanced Industrial Science And Technology | Hand-gesture based interface apparatus |
US6564144B1 (en) * | 2002-01-10 | 2003-05-13 | Navigation Technologies Corporation | Method and system using a hand-gesture responsive device for collecting data for a geographic database |
US6990639B2 (en) * | 2002-02-07 | 2006-01-24 | Microsoft Corporation | System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration |
TW554293B (en) * | 2002-03-29 | 2003-09-21 | Ind Tech Res Inst | Method for extracting and matching hand gesture features of image |
US20030212552A1 (en) * | 2002-05-09 | 2003-11-13 | Liang Lu Hong | Face recognition procedure useful for audiovisual speech recognition |
US7165029B2 (en) | 2002-05-09 | 2007-01-16 | Intel Corporation | Coupled hidden Markov model for audiovisual speech recognition |
US7209883B2 (en) * | 2002-05-09 | 2007-04-24 | Intel Corporation | Factorial hidden markov model for audiovisual speech recognition |
US20040012643A1 (en) * | 2002-07-18 | 2004-01-22 | August Katherine G. | Systems and methods for visually communicating the meaning of information to the hearing impaired |
US7616784B2 (en) * | 2002-07-29 | 2009-11-10 | Robert William Kocher | Method and apparatus for contactless hand recognition |
JP3996015B2 (ja) * | 2002-08-09 | 2007-10-24 | 本田技研工業株式会社 | 姿勢認識装置及び自律ロボット |
US7774194B2 (en) * | 2002-08-14 | 2010-08-10 | Raanan Liebermann | Method and apparatus for seamless transition of voice and/or text into sign language |
US7171043B2 (en) | 2002-10-11 | 2007-01-30 | Intel Corporation | Image recognition using hidden markov models and coupled hidden markov models |
US7472063B2 (en) * | 2002-12-19 | 2008-12-30 | Intel Corporation | Audio-visual feature fusion and support vector machine useful for continuous speech recognition |
US7203368B2 (en) * | 2003-01-06 | 2007-04-10 | Intel Corporation | Embedded bayesian network for pattern recognition |
US8745541B2 (en) | 2003-03-25 | 2014-06-03 | Microsoft Corporation | Architecture for controlling a computer using hand gestures |
US7665041B2 (en) * | 2003-03-25 | 2010-02-16 | Microsoft Corporation | Architecture for controlling a computer using hand gestures |
EP1617374A4 (fr) * | 2003-04-11 | 2008-08-13 | Nat Inst Inf & Comm Tech | Dispositif et programme de reconnaissance d'image |
JP4355341B2 (ja) * | 2003-05-29 | 2009-10-28 | 本田技研工業株式会社 | 深度データを用いたビジュアルトラッキング |
EP1631937B1 (fr) * | 2003-06-12 | 2018-03-28 | Honda Motor Co., Ltd. | Estimation d'orientation de cible au moyen de la detection de profondeur |
JP3752246B2 (ja) * | 2003-08-11 | 2006-03-08 | 学校法人慶應義塾 | ハンドパターンスイッチ装置 |
US7565295B1 (en) * | 2003-08-28 | 2009-07-21 | The George Washington University | Method and apparatus for translating hand gestures |
EP1671219A2 (fr) * | 2003-09-30 | 2006-06-21 | Koninklijke Philips Electronics N.V. | Geste pour definir l'emplacement, la dimension et/ou le contenu d'une fenetre de contenu sur un afficheur |
US7590941B2 (en) * | 2003-10-09 | 2009-09-15 | Hewlett-Packard Development Company, L.P. | Communication and collaboration system using rich media environments |
US7707039B2 (en) | 2004-02-15 | 2010-04-27 | Exbiblio B.V. | Automatic modification of web pages |
US8442331B2 (en) | 2004-02-15 | 2013-05-14 | Google Inc. | Capturing text from rendered documents using supplemental information |
FI117308B (fi) * | 2004-02-06 | 2006-08-31 | Nokia Corp | Eleohjausjärjestelmä |
US7812860B2 (en) | 2004-04-01 | 2010-10-12 | Exbiblio B.V. | Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device |
US10635723B2 (en) | 2004-02-15 | 2020-04-28 | Google Llc | Search engines and systems with handheld document data capture devices |
US8081849B2 (en) | 2004-12-03 | 2011-12-20 | Google Inc. | Portable scanning and memory device |
US9008447B2 (en) | 2004-04-01 | 2015-04-14 | Google Inc. | Method and system for character recognition |
WO2008028674A2 (fr) | 2006-09-08 | 2008-03-13 | Exbiblio B.V. | Scanners optiques, tels que des scanners optiques portables |
US20060081714A1 (en) | 2004-08-23 | 2006-04-20 | King Martin T | Portable scanning device |
US8146156B2 (en) | 2004-04-01 | 2012-03-27 | Google Inc. | Archive of text captures from rendered documents |
US9143638B2 (en) | 2004-04-01 | 2015-09-22 | Google Inc. | Data capture from rendered documents using handheld device |
US7894670B2 (en) | 2004-04-01 | 2011-02-22 | Exbiblio B.V. | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
US7990556B2 (en) | 2004-12-03 | 2011-08-02 | Google Inc. | Association of a portable scanner with input/output and storage devices |
US9116890B2 (en) | 2004-04-01 | 2015-08-25 | Google Inc. | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
US20060098900A1 (en) | 2004-09-27 | 2006-05-11 | King Martin T | Secure data gathering from rendered documents |
US8713418B2 (en) | 2004-04-12 | 2014-04-29 | Google Inc. | Adding value to a rendered document |
US8345918B2 (en) * | 2004-04-14 | 2013-01-01 | L-3 Communications Corporation | Active subject privacy imaging |
CN100573548C (zh) * | 2004-04-15 | 2009-12-23 | 格斯图尔泰克股份有限公司 | 跟踪双手运动的方法和设备 |
US8620083B2 (en) | 2004-12-03 | 2013-12-31 | Google Inc. | Method and system for character recognition |
US8489624B2 (en) | 2004-05-17 | 2013-07-16 | Google, Inc. | Processing techniques for text capture from a rendered document |
US8874504B2 (en) | 2004-12-03 | 2014-10-28 | Google Inc. | Processing techniques for visual capture data from a rendered document |
US7308112B2 (en) * | 2004-05-14 | 2007-12-11 | Honda Motor Co., Ltd. | Sign based human-machine interaction |
US8346620B2 (en) | 2004-07-19 | 2013-01-01 | Google Inc. | Automatic modification of web pages |
JP4792824B2 (ja) * | 2004-11-05 | 2011-10-12 | 富士ゼロックス株式会社 | 動作分析装置 |
RU2323475C2 (ru) * | 2004-11-12 | 2008-04-27 | Общество с ограниченной ответственностью "Центр Нейросетевых Технологий - Интеллектуальные Системы Безопасности" (ООО "ИСС") | Способ (варианты) и устройство (варианты) автоматизированного обнаружения умышленных либо случайных нарушений технологической процедуры оператором |
US7386150B2 (en) * | 2004-11-12 | 2008-06-10 | Safeview, Inc. | Active subject imaging with body identification |
JP5160235B2 (ja) | 2005-01-07 | 2013-03-13 | クアルコム,インコーポレイテッド | 画像中の物体の検出及び追跡 |
WO2006078996A2 (fr) * | 2005-01-21 | 2006-07-27 | Gesturetek, Inc. | Poursuite basee sur le mouvement |
CN1304931C (zh) * | 2005-01-27 | 2007-03-14 | 北京理工大学 | 一种头戴式立体视觉手势识别装置 |
CN101536494B (zh) | 2005-02-08 | 2017-04-26 | 奥布隆工业有限公司 | 用于基于姿势的控制系统的系统和方法 |
CN1315024C (zh) * | 2005-03-11 | 2007-05-09 | 西北工业大学 | 一种视频识别输入系统的输入方法 |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
GB2431787B (en) * | 2005-10-31 | 2009-07-01 | Hewlett Packard Development Co | A method of tracking an object in a video stream |
US9075441B2 (en) * | 2006-02-08 | 2015-07-07 | Oblong Industries, Inc. | Gesture based control using three-dimensional information extracted over an extended depth of field |
US8537112B2 (en) * | 2006-02-08 | 2013-09-17 | Oblong Industries, Inc. | Control system for navigating a principal dimension of a data space |
US9910497B2 (en) * | 2006-02-08 | 2018-03-06 | Oblong Industries, Inc. | Gestural control of autonomous and semi-autonomous systems |
US9823747B2 (en) | 2006-02-08 | 2017-11-21 | Oblong Industries, Inc. | Spatial, multi-modal control device for use with spatial operating system |
US8370383B2 (en) | 2006-02-08 | 2013-02-05 | Oblong Industries, Inc. | Multi-process interactive systems and methods |
US8537111B2 (en) * | 2006-02-08 | 2013-09-17 | Oblong Industries, Inc. | Control system for navigating a principal dimension of a data space |
US8531396B2 (en) * | 2006-02-08 | 2013-09-10 | Oblong Industries, Inc. | Control system for navigating a principal dimension of a data space |
JP5161435B2 (ja) * | 2006-06-26 | 2013-03-13 | 株式会社ソニー・コンピュータエンタテインメント | 画像処理装置、画像処理システム、コンピュータの制御方法及びプログラム |
US20080036737A1 (en) * | 2006-08-13 | 2008-02-14 | Hernandez-Rebollar Jose L | Arm Skeleton for Capturing Arm Position and Movement |
US20100023314A1 (en) * | 2006-08-13 | 2010-01-28 | Jose Hernandez-Rebollar | ASL Glove with 3-Axis Accelerometers |
CN100426200C (zh) * | 2006-10-13 | 2008-10-15 | 广东威创视讯科技股份有限公司 | 基于交互式输入设备的智能输入编码方法 |
US8588464B2 (en) * | 2007-01-12 | 2013-11-19 | International Business Machines Corporation | Assisting a vision-impaired user with navigation based on a 3D captured image stream |
US8494234B1 (en) * | 2007-03-07 | 2013-07-23 | MotionDSP, Inc. | Video hashing system and method |
US8005238B2 (en) | 2007-03-22 | 2011-08-23 | Microsoft Corporation | Robust adaptive beamforming with enhanced noise suppression |
EP2150893A4 (fr) | 2007-04-24 | 2012-08-22 | Oblong Ind Inc | Protéines, pools et slaws (fiche d'analyse logistique de poste) dans des environnements de traitement |
US8005237B2 (en) | 2007-05-17 | 2011-08-23 | Microsoft Corp. | Sensor array beamformer post-processor |
US9282377B2 (en) * | 2007-05-31 | 2016-03-08 | iCommunicator LLC | Apparatuses, methods and systems to provide translations of information into sign language or other formats |
US20090012788A1 (en) * | 2007-07-03 | 2009-01-08 | Jason Andre Gilbert | Sign language translation system |
EP2191397B1 (fr) * | 2007-08-20 | 2019-01-23 | Qualcomm Incorporated | Rejet amélioré de mots hors vocabulaire |
EP2188737A4 (fr) | 2007-09-14 | 2011-05-18 | Intellectual Ventures Holding 67 Llc | Traitement d'interactions d'utilisateur basées sur des gestes |
JP4929109B2 (ja) * | 2007-09-25 | 2012-05-09 | 株式会社東芝 | ジェスチャ認識装置及びその方法 |
US8629976B2 (en) * | 2007-10-02 | 2014-01-14 | Microsoft Corporation | Methods and systems for hierarchical de-aliasing time-of-flight (TOF) systems |
US8005263B2 (en) * | 2007-10-26 | 2011-08-23 | Honda Motor Co., Ltd. | Hand sign recognition using label assignment |
US8159682B2 (en) | 2007-11-12 | 2012-04-17 | Intellectual Ventures Holding 67 Llc | Lens system |
CN101465961B (zh) * | 2007-12-19 | 2013-10-30 | 神基科技股份有限公司 | 以特征影像辨识控制快门的摄像装置及方法 |
US20090166684A1 (en) * | 2007-12-26 | 2009-07-02 | 3Dv Systems Ltd. | Photogate cmos pixel for 3d cameras having reduced intra-pixel cross talk |
US9035876B2 (en) | 2008-01-14 | 2015-05-19 | Apple Inc. | Three-dimensional user interface session control |
US8933876B2 (en) | 2010-12-13 | 2015-01-13 | Apple Inc. | Three dimensional user interface session control |
US9772689B2 (en) * | 2008-03-04 | 2017-09-26 | Qualcomm Incorporated | Enhanced gesture-based image manipulation |
US8259163B2 (en) | 2008-03-07 | 2012-09-04 | Intellectual Ventures Holding 67 Llc | Display with built in 3D sensing |
US10642364B2 (en) | 2009-04-02 | 2020-05-05 | Oblong Industries, Inc. | Processing tracking and recognition data in gestural recognition systems |
US9684380B2 (en) | 2009-04-02 | 2017-06-20 | Oblong Industries, Inc. | Operating environment with gestural control and multiple client devices, displays, and users |
US9740293B2 (en) | 2009-04-02 | 2017-08-22 | Oblong Industries, Inc. | Operating environment with gestural control and multiple client devices, displays, and users |
US8723795B2 (en) | 2008-04-24 | 2014-05-13 | Oblong Industries, Inc. | Detecting, representing, and interpreting three-space input: gestural continuum subsuming freespace, proximal, and surface-contact modes |
US9495013B2 (en) | 2008-04-24 | 2016-11-15 | Oblong Industries, Inc. | Multi-modal gestural interface |
US9740922B2 (en) | 2008-04-24 | 2017-08-22 | Oblong Industries, Inc. | Adaptive tracking system for spatial input devices |
US9952673B2 (en) | 2009-04-02 | 2018-04-24 | Oblong Industries, Inc. | Operating environment comprising multiple client devices, multiple displays, multiple users, and gestural control |
WO2009155465A1 (fr) * | 2008-06-18 | 2009-12-23 | Oblong Industries, Inc. | Système de commande sur la base de gestes pour des interfaces de véhicule |
US8385557B2 (en) * | 2008-06-19 | 2013-02-26 | Microsoft Corporation | Multichannel acoustic echo reduction |
US8325909B2 (en) | 2008-06-25 | 2012-12-04 | Microsoft Corporation | Acoustic echo suppression |
US8203699B2 (en) | 2008-06-30 | 2012-06-19 | Microsoft Corporation | System architecture design for time-of-flight system having reduced differential pixel size, and time-of-flight systems so designed |
WO2010011923A1 (fr) | 2008-07-24 | 2010-01-28 | Gesturetek, Inc. | Détection améliorée de geste d'engagement circulaire |
JP5432260B2 (ja) | 2008-07-25 | 2014-03-05 | クアルコム,インコーポレイテッド | ウェーブエンゲージメントジェスチャの改良された検出 |
EP2333652B1 (fr) * | 2008-09-29 | 2016-11-02 | Panasonic Intellectual Property Corporation of America | Méthode et appareil pour améliorer la vie privée des utilisateurs d'un dispositif d'affichage pour plusieurs utilisateurs dans lequel chaque utilisateur a un espace qui lui est alloué |
US20100142683A1 (en) * | 2008-12-09 | 2010-06-10 | Stuart Owen Goldman | Method and apparatus for providing video relay service assisted calls with reduced bandwidth |
US8270670B2 (en) * | 2008-12-25 | 2012-09-18 | Topseed Technology Corp. | Method for recognizing and tracing gesture |
US8379987B2 (en) * | 2008-12-30 | 2013-02-19 | Nokia Corporation | Method, apparatus and computer program product for providing hand segmentation for gesture analysis |
US8681321B2 (en) | 2009-01-04 | 2014-03-25 | Microsoft International Holdings B.V. | Gated 3D camera |
US8588465B2 (en) | 2009-01-30 | 2013-11-19 | Microsoft Corporation | Visual target tracking |
US8487938B2 (en) * | 2009-01-30 | 2013-07-16 | Microsoft Corporation | Standard Gestures |
US8577085B2 (en) * | 2009-01-30 | 2013-11-05 | Microsoft Corporation | Visual target tracking |
US20100199231A1 (en) * | 2009-01-30 | 2010-08-05 | Microsoft Corporation | Predictive determination |
US8577084B2 (en) * | 2009-01-30 | 2013-11-05 | Microsoft Corporation | Visual target tracking |
US8448094B2 (en) * | 2009-01-30 | 2013-05-21 | Microsoft Corporation | Mapping a natural input device to a legacy system |
US8295546B2 (en) | 2009-01-30 | 2012-10-23 | Microsoft Corporation | Pose tracking pipeline |
US8682028B2 (en) * | 2009-01-30 | 2014-03-25 | Microsoft Corporation | Visual target tracking |
US8267781B2 (en) | 2009-01-30 | 2012-09-18 | Microsoft Corporation | Visual target tracking |
US8565476B2 (en) | 2009-01-30 | 2013-10-22 | Microsoft Corporation | Visual target tracking |
US7996793B2 (en) | 2009-01-30 | 2011-08-09 | Microsoft Corporation | Gesture recognizer system architecture |
US8565477B2 (en) | 2009-01-30 | 2013-10-22 | Microsoft Corporation | Visual target tracking |
US8294767B2 (en) | 2009-01-30 | 2012-10-23 | Microsoft Corporation | Body scan |
WO2010096191A2 (fr) | 2009-02-18 | 2010-08-26 | Exbiblio B.V. | Informations de capture automatique telles que des informations de capture utilisant un dispositif prenant en charge des documents |
DE202010018551U1 (de) | 2009-03-12 | 2017-08-24 | Google, Inc. | Automatische Bereitstellung von Inhalten, die mit erfassten Informationen, wie etwa in Echtzeit erfassten Informationen, verknüpft sind |
US8447066B2 (en) | 2009-03-12 | 2013-05-21 | Google Inc. | Performing actions based on capturing information from rendered documents, such as documents under copyright |
US20100235786A1 (en) * | 2009-03-13 | 2010-09-16 | Primesense Ltd. | Enhanced 3d interfacing for remote devices |
US8773355B2 (en) | 2009-03-16 | 2014-07-08 | Microsoft Corporation | Adaptive cursor sizing |
US9256282B2 (en) * | 2009-03-20 | 2016-02-09 | Microsoft Technology Licensing, Llc | Virtual object manipulation |
US8988437B2 (en) | 2009-03-20 | 2015-03-24 | Microsoft Technology Licensing, Llc | Chaining animations |
US9313376B1 (en) | 2009-04-01 | 2016-04-12 | Microsoft Technology Licensing, Llc | Dynamic depth power equalization |
US9317128B2 (en) | 2009-04-02 | 2016-04-19 | Oblong Industries, Inc. | Remote devices used in a markerless installation of a spatial operating environment incorporating gestural control |
US10824238B2 (en) | 2009-04-02 | 2020-11-03 | Oblong Industries, Inc. | Operating environment with gestural control and multiple client devices, displays, and users |
US9377857B2 (en) | 2009-05-01 | 2016-06-28 | Microsoft Technology Licensing, Llc | Show body position |
US8340432B2 (en) | 2009-05-01 | 2012-12-25 | Microsoft Corporation | Systems and methods for detecting a tilt angle from a depth image |
US8660303B2 (en) * | 2009-05-01 | 2014-02-25 | Microsoft Corporation | Detection of body and props |
US9015638B2 (en) | 2009-05-01 | 2015-04-21 | Microsoft Technology Licensing, Llc | Binding users to a gesture based system and providing feedback to the users |
US8253746B2 (en) | 2009-05-01 | 2012-08-28 | Microsoft Corporation | Determine intended motions |
US9498718B2 (en) | 2009-05-01 | 2016-11-22 | Microsoft Technology Licensing, Llc | Altering a view perspective within a display environment |
US8181123B2 (en) | 2009-05-01 | 2012-05-15 | Microsoft Corporation | Managing virtual port associations to users in a gesture-based computing environment |
US8942428B2 (en) | 2009-05-01 | 2015-01-27 | Microsoft Corporation | Isolate extraneous motions |
US8649554B2 (en) | 2009-05-01 | 2014-02-11 | Microsoft Corporation | Method to control perspective for a camera-controlled computer |
US8638985B2 (en) | 2009-05-01 | 2014-01-28 | Microsoft Corporation | Human body pose estimation |
US8503720B2 (en) | 2009-05-01 | 2013-08-06 | Microsoft Corporation | Human body pose estimation |
US9898675B2 (en) | 2009-05-01 | 2018-02-20 | Microsoft Technology Licensing, Llc | User movement tracking feedback to improve tracking |
US9417700B2 (en) | 2009-05-21 | 2016-08-16 | Edge3 Technologies | Gesture recognition systems and related methods |
KR101094636B1 (ko) * | 2009-05-21 | 2011-12-20 | 팅크웨어(주) | 제스처 기반 사용자 인터페이스 시스템 및 그 방법 |
US20100302365A1 (en) * | 2009-05-29 | 2010-12-02 | Microsoft Corporation | Depth Image Noise Reduction |
US8856691B2 (en) | 2009-05-29 | 2014-10-07 | Microsoft Corporation | Gesture tool |
US9182814B2 (en) | 2009-05-29 | 2015-11-10 | Microsoft Technology Licensing, Llc | Systems and methods for estimating a non-visible or occluded body part |
US8625837B2 (en) | 2009-05-29 | 2014-01-07 | Microsoft Corporation | Protocol and format for communicating an image from a camera to a computing environment |
US9400559B2 (en) | 2009-05-29 | 2016-07-26 | Microsoft Technology Licensing, Llc | Gesture shortcuts |
US9383823B2 (en) | 2009-05-29 | 2016-07-05 | Microsoft Technology Licensing, Llc | Combining gestures beyond skeletal |
US8320619B2 (en) | 2009-05-29 | 2012-11-27 | Microsoft Corporation | Systems and methods for tracking a model |
US8379101B2 (en) | 2009-05-29 | 2013-02-19 | Microsoft Corporation | Environment and/or target segmentation |
US8693724B2 (en) | 2009-05-29 | 2014-04-08 | Microsoft Corporation | Method and system implementing user-centric gesture control |
US8509479B2 (en) * | 2009-05-29 | 2013-08-13 | Microsoft Corporation | Virtual object |
US8542252B2 (en) * | 2009-05-29 | 2013-09-24 | Microsoft Corporation | Target digitization, extraction, and tracking |
US8418085B2 (en) | 2009-05-29 | 2013-04-09 | Microsoft Corporation | Gesture coach |
US8744121B2 (en) | 2009-05-29 | 2014-06-03 | Microsoft Corporation | Device for identifying and tracking multiple humans over time |
US8487871B2 (en) | 2009-06-01 | 2013-07-16 | Microsoft Corporation | Virtual desktop coordinate transformation |
US8654187B2 (en) * | 2009-06-08 | 2014-02-18 | Panasonic Corporation | Work recognition system, work recognition device, and work recognition method |
WO2010144050A1 (fr) * | 2009-06-08 | 2010-12-16 | Agency For Science, Technology And Research | Procédé et système de manipulation à base de gestes d'une image tridimensionnelle ou d'un objet |
US8390680B2 (en) | 2009-07-09 | 2013-03-05 | Microsoft Corporation | Visual representation expression based on player expression |
US9159151B2 (en) | 2009-07-13 | 2015-10-13 | Microsoft Technology Licensing, Llc | Bringing a visual representation to life via learned input from the user |
US8264536B2 (en) * | 2009-08-25 | 2012-09-11 | Microsoft Corporation | Depth-sensitive imaging via polarization-state mapping |
US9141193B2 (en) | 2009-08-31 | 2015-09-22 | Microsoft Technology Licensing, Llc | Techniques for using human gestures to control gesture unaware programs |
US8330134B2 (en) * | 2009-09-14 | 2012-12-11 | Microsoft Corporation | Optical fault monitoring |
US8508919B2 (en) | 2009-09-14 | 2013-08-13 | Microsoft Corporation | Separation of electrical and optical components |
US8760571B2 (en) * | 2009-09-21 | 2014-06-24 | Microsoft Corporation | Alignment of lens and image sensor |
US8976986B2 (en) * | 2009-09-21 | 2015-03-10 | Microsoft Technology Licensing, Llc | Volume adjustment based on listener position |
US8428340B2 (en) * | 2009-09-21 | 2013-04-23 | Microsoft Corporation | Screen space plane identification |
US9014546B2 (en) | 2009-09-23 | 2015-04-21 | Rovi Guides, Inc. | Systems and methods for automatically detecting users within detection regions of media devices |
US8452087B2 (en) * | 2009-09-30 | 2013-05-28 | Microsoft Corporation | Image selection techniques |
US8723118B2 (en) * | 2009-10-01 | 2014-05-13 | Microsoft Corporation | Imager for constructing color and depth images |
US8867820B2 (en) | 2009-10-07 | 2014-10-21 | Microsoft Corporation | Systems and methods for removing a background of an image |
US8564534B2 (en) | 2009-10-07 | 2013-10-22 | Microsoft Corporation | Human tracking system |
US8963829B2 (en) | 2009-10-07 | 2015-02-24 | Microsoft Corporation | Methods and systems for determining and tracking extremities of a target |
US7961910B2 (en) | 2009-10-07 | 2011-06-14 | Microsoft Corporation | Systems and methods for tracking a model |
US9971807B2 (en) | 2009-10-14 | 2018-05-15 | Oblong Industries, Inc. | Multi-process interactive systems and methods |
US9933852B2 (en) | 2009-10-14 | 2018-04-03 | Oblong Industries, Inc. | Multi-process interactive systems and methods |
US9400548B2 (en) * | 2009-10-19 | 2016-07-26 | Microsoft Technology Licensing, Llc | Gesture personalization and profile roaming |
CN101694692B (zh) * | 2009-10-22 | 2011-09-07 | 浙江大学 | 一种基于加速度传感器的手势识别的方法 |
US20110099476A1 (en) * | 2009-10-23 | 2011-04-28 | Microsoft Corporation | Decorating a display environment |
CN101719015B (zh) * | 2009-11-03 | 2011-08-31 | 上海大学 | 指示手势的手指尖定位方法 |
US8988432B2 (en) * | 2009-11-05 | 2015-03-24 | Microsoft Technology Licensing, Llc | Systems and methods for processing an image for target tracking |
KR20110055062A (ko) * | 2009-11-19 | 2011-05-25 | 삼성전자주식회사 | 로봇 시스템 및 그 제어 방법 |
US8843857B2 (en) | 2009-11-19 | 2014-09-23 | Microsoft Corporation | Distance scalable no touch computing |
US9081799B2 (en) | 2009-12-04 | 2015-07-14 | Google Inc. | Using gestalt information to identify locations in printed information |
TWI476632B (zh) * | 2009-12-08 | 2015-03-11 | Micro Star Int Co Ltd | 運動物體辨識方法及基於運動物體辨識之指令輸入方法 |
US9323784B2 (en) | 2009-12-09 | 2016-04-26 | Google Inc. | Image search using text-based elements within the contents of images |
US9244533B2 (en) | 2009-12-17 | 2016-01-26 | Microsoft Technology Licensing, Llc | Camera navigation for presentations |
US20110150271A1 (en) | 2009-12-18 | 2011-06-23 | Microsoft Corporation | Motion detection using depth images |
US20110151974A1 (en) * | 2009-12-18 | 2011-06-23 | Microsoft Corporation | Gesture style recognition and reward |
US8320621B2 (en) | 2009-12-21 | 2012-11-27 | Microsoft Corporation | Depth projector system with integrated VCSEL array |
EP2521092A1 (fr) * | 2009-12-28 | 2012-11-07 | Cyber Ai Entertainment Inc. | Système de reconnaissance d'image |
US20110164032A1 (en) * | 2010-01-07 | 2011-07-07 | Prime Sense Ltd. | Three-Dimensional User Interface |
US8631355B2 (en) | 2010-01-08 | 2014-01-14 | Microsoft Corporation | Assigning gesture dictionaries |
US9268404B2 (en) * | 2010-01-08 | 2016-02-23 | Microsoft Technology Licensing, Llc | Application gesture interpretation |
US9019201B2 (en) | 2010-01-08 | 2015-04-28 | Microsoft Technology Licensing, Llc | Evolving universal gesture sets |
US8334842B2 (en) | 2010-01-15 | 2012-12-18 | Microsoft Corporation | Recognizing user intent in motion capture system |
US8933884B2 (en) | 2010-01-15 | 2015-01-13 | Microsoft Corporation | Tracking groups of users in motion capture system |
US20120319813A1 (en) * | 2010-01-15 | 2012-12-20 | Electronics And Telecommunications Research Inst. | Apparatus and method for processing a scene |
US8676581B2 (en) | 2010-01-22 | 2014-03-18 | Microsoft Corporation | Speech recognition analysis via identification information |
US8265341B2 (en) | 2010-01-25 | 2012-09-11 | Microsoft Corporation | Voice-body identity correlation |
US8864581B2 (en) * | 2010-01-29 | 2014-10-21 | Microsoft Corporation | Visual based identitiy tracking |
US8891067B2 (en) | 2010-02-01 | 2014-11-18 | Microsoft Corporation | Multiple synchronized optical sources for time-of-flight range finding systems |
US8619122B2 (en) | 2010-02-02 | 2013-12-31 | Microsoft Corporation | Depth camera compatibility |
US8687044B2 (en) | 2010-02-02 | 2014-04-01 | Microsoft Corporation | Depth camera compatibility |
US8717469B2 (en) * | 2010-02-03 | 2014-05-06 | Microsoft Corporation | Fast gating photosurface |
US8659658B2 (en) * | 2010-02-09 | 2014-02-25 | Microsoft Corporation | Physical interaction zone for gesture-based user interfaces |
US8499257B2 (en) * | 2010-02-09 | 2013-07-30 | Microsoft Corporation | Handles interactions for human—computer interface |
US8633890B2 (en) * | 2010-02-16 | 2014-01-21 | Microsoft Corporation | Gesture detection based on joint skipping |
US20110199302A1 (en) * | 2010-02-16 | 2011-08-18 | Microsoft Corporation | Capturing screen objects using a collision volume |
US8928579B2 (en) * | 2010-02-22 | 2015-01-06 | Andrew David Wilson | Interacting with an omni-directionally projected display |
US8655069B2 (en) | 2010-03-05 | 2014-02-18 | Microsoft Corporation | Updating image segmentation following user input |
US8411948B2 (en) | 2010-03-05 | 2013-04-02 | Microsoft Corporation | Up-sampling binary images for segmentation |
US8422769B2 (en) * | 2010-03-05 | 2013-04-16 | Microsoft Corporation | Image segmentation using reduced foreground training data |
US20110221755A1 (en) * | 2010-03-12 | 2011-09-15 | Kevin Geisner | Bionic motion |
US20110223995A1 (en) | 2010-03-12 | 2011-09-15 | Kevin Geisner | Interacting with a computer based application |
US8279418B2 (en) | 2010-03-17 | 2012-10-02 | Microsoft Corporation | Raster scanning for depth detection |
US8213680B2 (en) * | 2010-03-19 | 2012-07-03 | Microsoft Corporation | Proxy training data for human body tracking |
WO2011119154A1 (fr) * | 2010-03-24 | 2011-09-29 | Hewlett-Packard Development Company, L.P. | Mise en correspondance de geste pour dispositif d'affichage |
US20110234481A1 (en) * | 2010-03-26 | 2011-09-29 | Sagi Katz | Enhancing presentations using depth sensing cameras |
US8514269B2 (en) * | 2010-03-26 | 2013-08-20 | Microsoft Corporation | De-aliasing depth images |
US8523667B2 (en) * | 2010-03-29 | 2013-09-03 | Microsoft Corporation | Parental control settings based on body dimensions |
US8605763B2 (en) | 2010-03-31 | 2013-12-10 | Microsoft Corporation | Temperature measurement and control for laser and light-emitting diodes |
US9646340B2 (en) | 2010-04-01 | 2017-05-09 | Microsoft Technology Licensing, Llc | Avatar-based virtual dressing room |
US9098873B2 (en) | 2010-04-01 | 2015-08-04 | Microsoft Technology Licensing, Llc | Motion-based interactive shopping environment |
US8351651B2 (en) | 2010-04-26 | 2013-01-08 | Microsoft Corporation | Hand-location post-process refinement in a tracking system |
US8379919B2 (en) | 2010-04-29 | 2013-02-19 | Microsoft Corporation | Multiple centroid condensation of probability distribution clouds |
CN102238350A (zh) * | 2010-04-30 | 2011-11-09 | 鸿富锦精密工业(深圳)有限公司 | 电视频道切换遥控系统及方法 |
US8284847B2 (en) | 2010-05-03 | 2012-10-09 | Microsoft Corporation | Detecting motion for a multifunction sensor device |
US8498481B2 (en) | 2010-05-07 | 2013-07-30 | Microsoft Corporation | Image segmentation using star-convexity constraints |
US8885890B2 (en) | 2010-05-07 | 2014-11-11 | Microsoft Corporation | Depth map confidence filtering |
US8457353B2 (en) | 2010-05-18 | 2013-06-04 | Microsoft Corporation | Gestures and gesture modifiers for manipulating a user-interface |
US8396252B2 (en) | 2010-05-20 | 2013-03-12 | Edge 3 Technologies | Systems and related methods for three dimensional gesture recognition in vehicles |
US8803888B2 (en) | 2010-06-02 | 2014-08-12 | Microsoft Corporation | Recognition system for sharing information |
US8751215B2 (en) | 2010-06-04 | 2014-06-10 | Microsoft Corporation | Machine based sign language interpreter |
US9008355B2 (en) | 2010-06-04 | 2015-04-14 | Microsoft Technology Licensing, Llc | Automatic depth camera aiming |
US9557574B2 (en) | 2010-06-08 | 2017-01-31 | Microsoft Technology Licensing, Llc | Depth illumination and detection optics |
US8330822B2 (en) | 2010-06-09 | 2012-12-11 | Microsoft Corporation | Thermally-tuned depth camera light source |
US8675981B2 (en) | 2010-06-11 | 2014-03-18 | Microsoft Corporation | Multi-modal gender recognition including depth data |
US9384329B2 (en) | 2010-06-11 | 2016-07-05 | Microsoft Technology Licensing, Llc | Caloric burn determination from body movement |
US8749557B2 (en) | 2010-06-11 | 2014-06-10 | Microsoft Corporation | Interacting with user interface via avatar |
US8982151B2 (en) | 2010-06-14 | 2015-03-17 | Microsoft Technology Licensing, Llc | Independently processing planes of display data |
US8670029B2 (en) | 2010-06-16 | 2014-03-11 | Microsoft Corporation | Depth camera illuminator with superluminescent light-emitting diode |
US8558873B2 (en) | 2010-06-16 | 2013-10-15 | Microsoft Corporation | Use of wavefront coding to create a depth image |
US8296151B2 (en) | 2010-06-18 | 2012-10-23 | Microsoft Corporation | Compound gesture-speech commands |
US8381108B2 (en) | 2010-06-21 | 2013-02-19 | Microsoft Corporation | Natural user input for driving interactive stories |
US8416187B2 (en) | 2010-06-22 | 2013-04-09 | Microsoft Corporation | Item navigation using motion-capture data |
US9075434B2 (en) | 2010-08-20 | 2015-07-07 | Microsoft Technology Licensing, Llc | Translating user motion into multiple object responses |
US8613666B2 (en) | 2010-08-31 | 2013-12-24 | Microsoft Corporation | User selection and navigation based on looped motions |
US8655093B2 (en) | 2010-09-02 | 2014-02-18 | Edge 3 Technologies, Inc. | Method and apparatus for performing segmentation of an image |
US8666144B2 (en) | 2010-09-02 | 2014-03-04 | Edge 3 Technologies, Inc. | Method and apparatus for determining disparity of texture |
WO2012030872A1 (fr) | 2010-09-02 | 2012-03-08 | Edge3 Technologies Inc. | Procédé et dispositif d'apprentissage par l'erreur |
US8582866B2 (en) | 2011-02-10 | 2013-11-12 | Edge 3 Technologies, Inc. | Method and apparatus for disparity computation in stereo images |
US20120058824A1 (en) | 2010-09-07 | 2012-03-08 | Microsoft Corporation | Scalable real-time motion recognition |
US8437506B2 (en) | 2010-09-07 | 2013-05-07 | Microsoft Corporation | System for fast, probabilistic skeletal tracking |
KR101708696B1 (ko) * | 2010-09-15 | 2017-02-21 | 엘지전자 주식회사 | 휴대 단말기 및 그 동작 제어방법 |
US8620024B2 (en) * | 2010-09-17 | 2013-12-31 | Sony Corporation | System and method for dynamic gesture recognition using geometric classification |
US8988508B2 (en) | 2010-09-24 | 2015-03-24 | Microsoft Technology Licensing, Llc. | Wide angle field of view active illumination imaging system |
US8681255B2 (en) | 2010-09-28 | 2014-03-25 | Microsoft Corporation | Integrated low power depth camera and projection device |
WO2012042390A2 (fr) * | 2010-09-30 | 2012-04-05 | France Telecom | Système d'interface utilisateur et son procédé de fonctionnement |
US8548270B2 (en) | 2010-10-04 | 2013-10-01 | Microsoft Corporation | Time-of-flight depth imaging |
CN101951474A (zh) * | 2010-10-12 | 2011-01-19 | 冠捷显示科技(厦门)有限公司 | 基于手势控制的电视技术 |
US9484065B2 (en) | 2010-10-15 | 2016-11-01 | Microsoft Technology Licensing, Llc | Intelligent determination of replays based on event identification |
JP2012094060A (ja) * | 2010-10-28 | 2012-05-17 | Sharp Corp | 電子装置 |
US8592739B2 (en) | 2010-11-02 | 2013-11-26 | Microsoft Corporation | Detection of configuration changes of an optical element in an illumination system |
US8866889B2 (en) | 2010-11-03 | 2014-10-21 | Microsoft Corporation | In-home depth camera calibration |
US8667519B2 (en) | 2010-11-12 | 2014-03-04 | Microsoft Corporation | Automatic passive and anonymous feedback system |
US10726861B2 (en) | 2010-11-15 | 2020-07-28 | Microsoft Technology Licensing, Llc | Semi-private communication in open environments |
US8730157B2 (en) * | 2010-11-15 | 2014-05-20 | Hewlett-Packard Development Company, L.P. | Hand pose recognition |
US9349040B2 (en) | 2010-11-19 | 2016-05-24 | Microsoft Technology Licensing, Llc | Bi-modal depth-image analysis |
US10234545B2 (en) | 2010-12-01 | 2019-03-19 | Microsoft Technology Licensing, Llc | Light source module |
US8872762B2 (en) | 2010-12-08 | 2014-10-28 | Primesense Ltd. | Three dimensional user interface cursor control |
US8553934B2 (en) | 2010-12-08 | 2013-10-08 | Microsoft Corporation | Orienting the position of a sensor |
US8618405B2 (en) | 2010-12-09 | 2013-12-31 | Microsoft Corp. | Free-space gesture musical instrument digital interface (MIDI) controller |
US8408706B2 (en) | 2010-12-13 | 2013-04-02 | Microsoft Corporation | 3D gaze tracker |
US9171264B2 (en) | 2010-12-15 | 2015-10-27 | Microsoft Technology Licensing, Llc | Parallel processing machine learning decision tree training |
US8920241B2 (en) | 2010-12-15 | 2014-12-30 | Microsoft Corporation | Gesture controlled persistent handles for interface guides |
US8884968B2 (en) | 2010-12-15 | 2014-11-11 | Microsoft Corporation | Modeling an object from image data |
US8448056B2 (en) | 2010-12-17 | 2013-05-21 | Microsoft Corporation | Validation analysis of human target |
US8803952B2 (en) | 2010-12-20 | 2014-08-12 | Microsoft Corporation | Plural detector time-of-flight depth mapping |
US9821224B2 (en) | 2010-12-21 | 2017-11-21 | Microsoft Technology Licensing, Llc | Driving simulator control with virtual skeleton |
US8994718B2 (en) | 2010-12-21 | 2015-03-31 | Microsoft Technology Licensing, Llc | Skeletal control of three-dimensional virtual world |
US9848106B2 (en) | 2010-12-21 | 2017-12-19 | Microsoft Technology Licensing, Llc | Intelligent gameplay photo capture |
US9823339B2 (en) | 2010-12-21 | 2017-11-21 | Microsoft Technology Licensing, Llc | Plural anode time-of-flight sensor |
US8385596B2 (en) | 2010-12-21 | 2013-02-26 | Microsoft Corporation | First person shooter control with virtual skeleton |
US9123316B2 (en) | 2010-12-27 | 2015-09-01 | Microsoft Technology Licensing, Llc | Interactive content creation |
US8488888B2 (en) | 2010-12-28 | 2013-07-16 | Microsoft Corporation | Classification of posture states |
US8401242B2 (en) | 2011-01-31 | 2013-03-19 | Microsoft Corporation | Real-time camera tracking using depth maps |
US9247238B2 (en) | 2011-01-31 | 2016-01-26 | Microsoft Technology Licensing, Llc | Reducing interference between multiple infra-red depth cameras |
US8401225B2 (en) | 2011-01-31 | 2013-03-19 | Microsoft Corporation | Moving object segmentation using depth images |
US8587583B2 (en) | 2011-01-31 | 2013-11-19 | Microsoft Corporation | Three-dimensional environment reconstruction |
US8724887B2 (en) | 2011-02-03 | 2014-05-13 | Microsoft Corporation | Environmental modifications to mitigate environmental factors |
US8970589B2 (en) | 2011-02-10 | 2015-03-03 | Edge 3 Technologies, Inc. | Near-touch interaction with a stereo camera grid structured tessellations |
US8942917B2 (en) | 2011-02-14 | 2015-01-27 | Microsoft Corporation | Change invariant scene recognition by an agent |
US8497838B2 (en) | 2011-02-16 | 2013-07-30 | Microsoft Corporation | Push actuation of interface controls |
CN102096471B (zh) * | 2011-02-18 | 2013-04-10 | 广东威创视讯科技股份有限公司 | 一种基于机器视觉的人机交互方法 |
EP2678757B1 (fr) * | 2011-02-21 | 2017-08-16 | Koninklijke Philips N.V. | Système de reconnaissance gestuelle |
US9551914B2 (en) | 2011-03-07 | 2017-01-24 | Microsoft Technology Licensing, Llc | Illuminator with refractive optical element |
US9067136B2 (en) | 2011-03-10 | 2015-06-30 | Microsoft Technology Licensing, Llc | Push personalization of interface controls |
US9079313B2 (en) * | 2011-03-15 | 2015-07-14 | Microsoft Technology Licensing, Llc | Natural human to robot remote control |
US8571263B2 (en) | 2011-03-17 | 2013-10-29 | Microsoft Corporation | Predicting joint positions |
US9470778B2 (en) | 2011-03-29 | 2016-10-18 | Microsoft Technology Licensing, Llc | Learning from high quality depth measurements |
US9298287B2 (en) | 2011-03-31 | 2016-03-29 | Microsoft Technology Licensing, Llc | Combined activation for natural user interface systems |
US10642934B2 (en) | 2011-03-31 | 2020-05-05 | Microsoft Technology Licensing, Llc | Augmented conversational understanding architecture |
US9760566B2 (en) | 2011-03-31 | 2017-09-12 | Microsoft Technology Licensing, Llc | Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof |
US9842168B2 (en) | 2011-03-31 | 2017-12-12 | Microsoft Technology Licensing, Llc | Task driven user intents |
US8824749B2 (en) | 2011-04-05 | 2014-09-02 | Microsoft Corporation | Biometric recognition |
US8503494B2 (en) | 2011-04-05 | 2013-08-06 | Microsoft Corporation | Thermal management system |
US8620113B2 (en) | 2011-04-25 | 2013-12-31 | Microsoft Corporation | Laser diode modes |
US9259643B2 (en) | 2011-04-28 | 2016-02-16 | Microsoft Technology Licensing, Llc | Control of separate computer game elements |
US8702507B2 (en) | 2011-04-28 | 2014-04-22 | Microsoft Corporation | Manual and camera-based avatar control |
US8811719B2 (en) | 2011-04-29 | 2014-08-19 | Microsoft Corporation | Inferring spatial object descriptions from spatial gestures |
US10671841B2 (en) | 2011-05-02 | 2020-06-02 | Microsoft Technology Licensing, Llc | Attribute state classification |
US8888331B2 (en) | 2011-05-09 | 2014-11-18 | Microsoft Corporation | Low inductance light source module |
US9137463B2 (en) | 2011-05-12 | 2015-09-15 | Microsoft Technology Licensing, Llc | Adaptive high dynamic range camera |
US9064006B2 (en) | 2012-08-23 | 2015-06-23 | Microsoft Technology Licensing, Llc | Translating natural language utterances to keyword search queries |
US8788973B2 (en) | 2011-05-23 | 2014-07-22 | Microsoft Corporation | Three-dimensional gesture controlled avatar configuration interface |
US8760395B2 (en) | 2011-05-31 | 2014-06-24 | Microsoft Corporation | Gesture recognition techniques |
US8526734B2 (en) | 2011-06-01 | 2013-09-03 | Microsoft Corporation | Three-dimensional background removal for vision system |
US9594430B2 (en) | 2011-06-01 | 2017-03-14 | Microsoft Technology Licensing, Llc | Three-dimensional foreground selection for vision system |
US9098110B2 (en) | 2011-06-06 | 2015-08-04 | Microsoft Technology Licensing, Llc | Head rotation tracking from depth-based center of mass |
US9013489B2 (en) | 2011-06-06 | 2015-04-21 | Microsoft Technology Licensing, Llc | Generation of avatar reflecting player appearance |
US9208571B2 (en) | 2011-06-06 | 2015-12-08 | Microsoft Technology Licensing, Llc | Object digitization |
US10796494B2 (en) | 2011-06-06 | 2020-10-06 | Microsoft Technology Licensing, Llc | Adding attributes to virtual representations of real-world objects |
US8897491B2 (en) | 2011-06-06 | 2014-11-25 | Microsoft Corporation | System for finger recognition and tracking |
US8929612B2 (en) | 2011-06-06 | 2015-01-06 | Microsoft Corporation | System for recognizing an open or closed hand |
US8597142B2 (en) | 2011-06-06 | 2013-12-03 | Microsoft Corporation | Dynamic camera based practice mode |
US9724600B2 (en) | 2011-06-06 | 2017-08-08 | Microsoft Technology Licensing, Llc | Controlling objects in a virtual environment |
US9597587B2 (en) | 2011-06-08 | 2017-03-21 | Microsoft Technology Licensing, Llc | Locational node device |
US9207767B2 (en) | 2011-06-29 | 2015-12-08 | International Business Machines Corporation | Guide mode for gesture spaces |
US9459758B2 (en) | 2011-07-05 | 2016-10-04 | Apple Inc. | Gesture-based interface with enhanced features |
US8881051B2 (en) | 2011-07-05 | 2014-11-04 | Primesense Ltd | Zoom-based gesture user interface |
US9377865B2 (en) | 2011-07-05 | 2016-06-28 | Apple Inc. | Zoom-based gesture user interface |
US8891868B1 (en) | 2011-08-04 | 2014-11-18 | Amazon Technologies, Inc. | Recognizing gestures captured by video |
US10088924B1 (en) * | 2011-08-04 | 2018-10-02 | Amazon Technologies, Inc. | Overcoming motion effects in gesture recognition |
US9030498B2 (en) | 2011-08-15 | 2015-05-12 | Apple Inc. | Combining explicit select gestures and timeclick in a non-tactile three dimensional user interface |
US8786730B2 (en) | 2011-08-18 | 2014-07-22 | Microsoft Corporation | Image exposure using exclusion regions |
US9218063B2 (en) * | 2011-08-24 | 2015-12-22 | Apple Inc. | Sessionless pointing user interface |
CN102956132B (zh) * | 2011-08-25 | 2015-02-25 | 赛恩倍吉科技顾问(深圳)有限公司 | 手语翻译系统、手语翻译装置及手语翻译方法 |
KR20130047890A (ko) * | 2011-11-01 | 2013-05-09 | 삼성전기주식회사 | 원격 조정 장치 및 원격 조정 장치의 제스처 인식 방법 |
US8793118B2 (en) | 2011-11-01 | 2014-07-29 | PES School of Engineering | Adaptive multimodal communication assist system |
US9557836B2 (en) | 2011-11-01 | 2017-01-31 | Microsoft Technology Licensing, Llc | Depth image compression |
US9117281B2 (en) | 2011-11-02 | 2015-08-25 | Microsoft Corporation | Surface segmentation from RGB and depth images |
US8854426B2 (en) | 2011-11-07 | 2014-10-07 | Microsoft Corporation | Time-of-flight camera with guided light |
US9672609B1 (en) | 2011-11-11 | 2017-06-06 | Edge 3 Technologies, Inc. | Method and apparatus for improved depth-map estimation |
US8724906B2 (en) | 2011-11-18 | 2014-05-13 | Microsoft Corporation | Computing pose and/or shape of modifiable entities |
CN102509088B (zh) * | 2011-11-28 | 2014-01-08 | Tcl集团股份有限公司 | 一种手部运动检测的方法、装置及人机交互系统 |
US8509545B2 (en) | 2011-11-29 | 2013-08-13 | Microsoft Corporation | Foreground subject detection |
US8635637B2 (en) | 2011-12-02 | 2014-01-21 | Microsoft Corporation | User interface presenting an animated avatar performing a media reaction |
US8803800B2 (en) | 2011-12-02 | 2014-08-12 | Microsoft Corporation | User interface control based on head orientation |
EP2788839A4 (fr) * | 2011-12-06 | 2015-12-16 | Thomson Licensing | Procédé et système pour répondre à un geste de sélection, par un utilisateur, d'un objet affiché en trois dimensions |
US9100685B2 (en) | 2011-12-09 | 2015-08-04 | Microsoft Technology Licensing, Llc | Determining audience state or interest using passive sensor data |
US8879831B2 (en) | 2011-12-15 | 2014-11-04 | Microsoft Corporation | Using high-level attributes to guide image processing |
US8630457B2 (en) | 2011-12-15 | 2014-01-14 | Microsoft Corporation | Problem states for pose tracking pipeline |
US8971612B2 (en) | 2011-12-15 | 2015-03-03 | Microsoft Corporation | Learning image processing tasks from scene reconstructions |
US8811938B2 (en) | 2011-12-16 | 2014-08-19 | Microsoft Corporation | Providing a user interface experience based on inferred vehicle state |
US9342139B2 (en) | 2011-12-19 | 2016-05-17 | Microsoft Technology Licensing, Llc | Pairing a computing device to a user |
KR101896473B1 (ko) * | 2012-01-04 | 2018-10-24 | 삼성전자주식회사 | 로봇 핸드의 제어 방법 |
US9720089B2 (en) | 2012-01-23 | 2017-08-01 | Microsoft Technology Licensing, Llc | 3D zoom imager |
US9229534B2 (en) | 2012-02-28 | 2016-01-05 | Apple Inc. | Asymmetric mapping for tactile and non-tactile user interfaces |
US8898687B2 (en) | 2012-04-04 | 2014-11-25 | Microsoft Corporation | Controlling a media program based on a media reaction |
US9210401B2 (en) | 2012-05-03 | 2015-12-08 | Microsoft Technology Licensing, Llc | Projected visual cues for guiding physical movement |
CA2775700C (fr) | 2012-05-04 | 2013-07-23 | Microsoft Corporation | Determination d'une portion future dune emission multimedia en cours de presentation |
WO2013166513A2 (fr) * | 2012-05-04 | 2013-11-07 | Oblong Industries, Inc. | Interface utilisateur à suivi de la main et reconnaissance de forme inter-utilisateur |
US9349207B2 (en) | 2012-05-31 | 2016-05-24 | Samsung Electronics Co., Ltd. | Apparatus and method for parsing human body image |
TWI450024B (zh) * | 2012-06-05 | 2014-08-21 | Wistron Corp | 立體深度影像建立系統及其方法 |
KR101911133B1 (ko) | 2012-06-21 | 2018-10-23 | 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 | 깊이 카메라를 이용한 아바타 구성 |
US9836590B2 (en) | 2012-06-22 | 2017-12-05 | Microsoft Technology Licensing, Llc | Enhanced accuracy of user presence status determination |
US9696427B2 (en) | 2012-08-14 | 2017-07-04 | Microsoft Technology Licensing, Llc | Wide angle depth detection |
US9423939B2 (en) * | 2012-11-12 | 2016-08-23 | Microsoft Technology Licensing, Llc | Dynamic adjustment of user interface |
KR20140063272A (ko) * | 2012-11-16 | 2014-05-27 | 엘지전자 주식회사 | 영상표시장치, 및 그 동작방법 |
US8882310B2 (en) | 2012-12-10 | 2014-11-11 | Microsoft Corporation | Laser die light source module with low inductance |
US20140173524A1 (en) * | 2012-12-14 | 2014-06-19 | Microsoft Corporation | Target and press natural user input |
KR101956073B1 (ko) * | 2012-12-20 | 2019-03-08 | 삼성전자주식회사 | 시각적 인디케이터를 이용하여 사용자 인터페이스를 제공하는 3차원 입체 영상 표시 장치 및 그 장치를 이용한 방법 |
US9857470B2 (en) | 2012-12-28 | 2018-01-02 | Microsoft Technology Licensing, Llc | Using photometric stereo for 3D environment modeling |
US9251590B2 (en) | 2013-01-24 | 2016-02-02 | Microsoft Technology Licensing, Llc | Camera pose estimation for 3D reconstruction |
US9052746B2 (en) | 2013-02-15 | 2015-06-09 | Microsoft Technology Licensing, Llc | User center-of-mass and mass distribution extraction using depth images |
US9940553B2 (en) | 2013-02-22 | 2018-04-10 | Microsoft Technology Licensing, Llc | Camera/object pose from predicted coordinates |
US9135516B2 (en) | 2013-03-08 | 2015-09-15 | Microsoft Technology Licensing, Llc | User body angle, curvature and average extremity positions extraction using depth images |
US9524028B2 (en) | 2013-03-08 | 2016-12-20 | Fastvdo Llc | Visual language for human computer interfaces |
US9092657B2 (en) | 2013-03-13 | 2015-07-28 | Microsoft Technology Licensing, Llc | Depth image processing |
US9274606B2 (en) | 2013-03-14 | 2016-03-01 | Microsoft Technology Licensing, Llc | NUI video conference controls |
US10721448B2 (en) | 2013-03-15 | 2020-07-21 | Edge 3 Technologies, Inc. | Method and apparatus for adaptive exposure bracketing, segmentation and scene organization |
CN103214824B (zh) * | 2013-03-26 | 2015-10-21 | 安徽瑞之星电缆集团有限公司 | 一种等规聚丙烯绝缘辐照交联电缆料及其制备方法 |
US9953213B2 (en) | 2013-03-27 | 2018-04-24 | Microsoft Technology Licensing, Llc | Self discovery of autonomous NUI devices |
US9442186B2 (en) | 2013-05-13 | 2016-09-13 | Microsoft Technology Licensing, Llc | Interference reduction for TOF systems |
US9829984B2 (en) * | 2013-05-23 | 2017-11-28 | Fastvdo Llc | Motion-assisted visual language for human computer interfaces |
US9462253B2 (en) | 2013-09-23 | 2016-10-04 | Microsoft Technology Licensing, Llc | Optical modules that reduce speckle contrast and diffraction artifacts |
US9443310B2 (en) | 2013-10-09 | 2016-09-13 | Microsoft Technology Licensing, Llc | Illumination modules that emit structured light |
US9674563B2 (en) | 2013-11-04 | 2017-06-06 | Rovi Guides, Inc. | Systems and methods for recommending content |
US9769459B2 (en) | 2013-11-12 | 2017-09-19 | Microsoft Technology Licensing, Llc | Power efficient laser diode driver circuit and method |
US9508385B2 (en) | 2013-11-21 | 2016-11-29 | Microsoft Technology Licensing, Llc | Audio-visual project generator |
EP3090321A4 (fr) | 2014-01-03 | 2017-07-05 | Harman International Industries, Incorporated | Système audio spatial à porter utilisant l'interaction gestuelle |
US9971491B2 (en) | 2014-01-09 | 2018-05-15 | Microsoft Technology Licensing, Llc | Gesture library for natural user input |
US9990046B2 (en) | 2014-03-17 | 2018-06-05 | Oblong Industries, Inc. | Visual collaboration interface |
US10146318B2 (en) | 2014-06-13 | 2018-12-04 | Thomas Malzbender | Techniques for using gesture recognition to effectuate character selection |
CN104408395A (zh) * | 2014-06-26 | 2015-03-11 | 青岛海信电器股份有限公司 | 一种手势识别方法和系统 |
CN104317385A (zh) * | 2014-06-26 | 2015-01-28 | 青岛海信电器股份有限公司 | 一种手势识别方法和系统 |
JP6603024B2 (ja) | 2015-02-10 | 2019-11-06 | 任天堂株式会社 | 情報処理プログラム、情報処理装置、情報処理システム、および、情報処理方法 |
JP6534011B2 (ja) | 2015-02-10 | 2019-06-26 | 任天堂株式会社 | 情報処理装置、情報処理プログラム、情報処理システム、および、情報処理方法 |
JP6519075B2 (ja) * | 2015-02-10 | 2019-05-29 | 任天堂株式会社 | 情報処理装置、情報処理プログラム、情報処理システム、および、情報処理方法 |
JP6561400B2 (ja) * | 2015-02-10 | 2019-08-21 | 任天堂株式会社 | 情報処理装置、情報処理プログラム、情報処理システム、および、情報処理方法 |
US9846808B2 (en) * | 2015-12-31 | 2017-12-19 | Adaptive Computation, Llc | Image integration search based on human visual pathway model |
US10412280B2 (en) | 2016-02-10 | 2019-09-10 | Microsoft Technology Licensing, Llc | Camera with light valve over sensor array |
US10257932B2 (en) | 2016-02-16 | 2019-04-09 | Microsoft Technology Licensing, Llc. | Laser diode chip on printed circuit board |
US10462452B2 (en) | 2016-03-16 | 2019-10-29 | Microsoft Technology Licensing, Llc | Synchronizing active illumination cameras |
WO2017200571A1 (fr) | 2016-05-16 | 2017-11-23 | Google Llc | Commande gestuelle d'une interface utilisateur |
US10529302B2 (en) | 2016-07-07 | 2020-01-07 | Oblong Industries, Inc. | Spatially mediated augmentations of and interactions among distinct devices and applications via extended pixel manifold |
US20180088671A1 (en) * | 2016-09-27 | 2018-03-29 | National Kaohsiung University Of Applied Sciences | 3D Hand Gesture Image Recognition Method and System Thereof |
CN106909938B (zh) * | 2017-02-16 | 2020-02-21 | 青岛科技大学 | 基于深度学习网络的视角无关性行为识别方法 |
CN113924568A (zh) | 2019-06-26 | 2022-01-11 | 谷歌有限责任公司 | 基于雷达的认证状态反馈 |
US11385722B2 (en) | 2019-07-26 | 2022-07-12 | Google Llc | Robust radar-based gesture-recognition by user equipment |
CN118444784A (zh) | 2019-07-26 | 2024-08-06 | 谷歌有限责任公司 | 基于雷达的姿势识别的情境敏感控制 |
JP7316383B2 (ja) | 2019-07-26 | 2023-07-27 | グーグル エルエルシー | Imuおよびレーダーを介した認証管理 |
US11868537B2 (en) | 2019-07-26 | 2024-01-09 | Google Llc | Robust radar-based gesture-recognition by user equipment |
JP7292437B2 (ja) | 2019-07-26 | 2023-06-16 | グーグル エルエルシー | Imuおよびレーダーに基づいて状態を下げること |
US11080383B2 (en) * | 2019-08-09 | 2021-08-03 | BehavioSec Inc | Radar-based behaviometric user authentication |
KR20210151957A (ko) | 2019-08-30 | 2021-12-14 | 구글 엘엘씨 | 모바일 디바이스에 대한 입력 방법 |
CN113892072B (zh) | 2019-08-30 | 2024-08-09 | 谷歌有限责任公司 | 启用用于暂停的雷达姿势的视觉指示器的方法和电子设备 |
US11467672B2 (en) | 2019-08-30 | 2022-10-11 | Google Llc | Context-sensitive control of radar-based gesture-recognition |
KR20220098805A (ko) | 2019-08-30 | 2022-07-12 | 구글 엘엘씨 | 다중 입력 모드에 대한 입력 모드 통지 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0774730A2 (fr) * | 1995-11-01 | 1997-05-21 | Canon Kabushiki Kaisha | Procédé d'extraction d'objets, et appareil de prise d'images utilisant ce procédé |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5759044A (en) * | 1990-02-22 | 1998-06-02 | Redmond Productions | Methods and apparatus for generating and processing synthetic and absolute real time environments |
US5887069A (en) * | 1992-03-10 | 1999-03-23 | Hitachi, Ltd. | Sign recognition apparatus and method and sign translation system using same |
US5659764A (en) * | 1993-02-25 | 1997-08-19 | Hitachi, Ltd. | Sign language generation apparatus and sign language translation apparatus |
EP0650301B1 (fr) * | 1993-10-26 | 2000-05-31 | Matsushita Electric Industrial Co., Ltd. | Appareil d'affichage d'images tridimensionnelles |
JP3630712B2 (ja) * | 1994-02-03 | 2005-03-23 | キヤノン株式会社 | ジェスチャー入力方法及びその装置 |
US5732227A (en) * | 1994-07-05 | 1998-03-24 | Hitachi, Ltd. | Interactive information processing system responsive to user manipulation of physical objects and displayed images |
JPH08115408A (ja) | 1994-10-19 | 1996-05-07 | Hitachi Ltd | 手話認識装置 |
JPH08331473A (ja) * | 1995-05-29 | 1996-12-13 | Hitachi Ltd | テレビジョン信号の表示装置 |
US6002808A (en) * | 1996-07-26 | 1999-12-14 | Mitsubishi Electric Information Technology Center America, Inc. | Hand gesture control system |
US6028960A (en) * | 1996-09-20 | 2000-02-22 | Lucent Technologies Inc. | Face feature analysis for automatic lipreading and character animation |
-
1998
- 1998-09-23 EP EP98118008A patent/EP0905644A3/fr not_active Ceased
- 1998-09-25 CN CNB981207650A patent/CN1139893C/zh not_active Expired - Fee Related
- 1998-09-25 US US09/160,048 patent/US6215890B1/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0774730A2 (fr) * | 1995-11-01 | 1997-05-21 | Canon Kabushiki Kaisha | Procédé d'extraction d'objets, et appareil de prise d'images utilisant ce procédé |
Non-Patent Citations (3)
Title |
---|
PAVLOVIC V I ET AL: "Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review" IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, IEEE INC. NEW YORK, US, vol. 19, no. 7, 1 July 1997 (1997-07-01), pages 677-695, XP000698168 ISSN: 0162-8828 * |
TAMURA S ET AL: "RECOGNITION OF SIGN LANGUAGE MOTION IMAGES" PATTERN RECOGNITION, PERGAMON PRESS INC. ELMSFORD, N.Y, US, vol. 21, no. 4, 1988, pages 343-353, XP000195812 ISSN: 0031-3203 * |
UTSUMI A ET AL: "Hand Gesture Recognition System Using Multiple Cameras" PROCEEDINGS ICPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VIENNA AU, vol. 1, 25 - 30 August 1996, pages 667-671, XP000675717 * |
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001069365A1 (fr) * | 2000-03-13 | 2001-09-20 | Ab In Credoble | Systeme de reconnaissance de gestuelle |
US7129927B2 (en) | 2000-03-13 | 2006-10-31 | Hans Arvid Mattson | Gesture recognition system |
US7227526B2 (en) | 2000-07-24 | 2007-06-05 | Gesturetek, Inc. | Video-based image control system |
WO2002007839A2 (fr) * | 2000-07-24 | 2002-01-31 | Jestertek, Inc. | Systeme de controle d'images video |
US7898522B2 (en) | 2000-07-24 | 2011-03-01 | Gesturetek, Inc. | Video-based image control system |
WO2002007839A3 (fr) * | 2000-07-24 | 2003-07-10 | Jestertek Inc | Systeme de controle d'images video |
US8274535B2 (en) | 2000-07-24 | 2012-09-25 | Qualcomm Incorporated | Video-based image control system |
US8624932B2 (en) | 2000-07-24 | 2014-01-07 | Qualcomm Incorporated | Video-based image control system |
EP1967941A3 (fr) * | 2000-07-24 | 2008-11-19 | GestureTek, Inc. | Système de commande d'image vidéo |
US8963963B2 (en) | 2000-07-24 | 2015-02-24 | Qualcomm Incorporated | Video-based image control system |
WO2002029722A3 (fr) * | 2000-10-03 | 2003-09-25 | Jestertek Inc | Systeme de controle a cameras multiples |
AU2001294970C1 (en) * | 2000-10-03 | 2008-07-24 | Gesturetek, Inc. | Object tracking system using multiple cameras |
US7421093B2 (en) | 2000-10-03 | 2008-09-02 | Gesturetek, Inc. | Multiple camera control system |
US7555142B2 (en) | 2000-10-03 | 2009-06-30 | Gesturetek, Inc. | Multiple camera control system |
US8131015B2 (en) | 2000-10-03 | 2012-03-06 | Qualcomm Incorporated | Multiple camera control system |
US8625849B2 (en) | 2000-10-03 | 2014-01-07 | Qualcomm Incorporated | Multiple camera control system |
WO2002029722A2 (fr) * | 2000-10-03 | 2002-04-11 | Jestertek, Inc. | Systeme de controle a cameras multiples |
WO2003071410A3 (fr) * | 2002-02-15 | 2004-03-18 | Canesta Inc | Systeme de reconnaissance de geste utilisant des capteurs de perception de profondeur |
WO2003071410A2 (fr) * | 2002-02-15 | 2003-08-28 | Canesta, Inc. | Systeme de reconnaissance de geste utilisant des capteurs de perception de profondeur |
FR2847357A1 (fr) * | 2002-11-19 | 2004-05-21 | Simag Dev | Methode de commande d'une machine au moyen de la position d'un objet mobile |
WO2005078558A1 (fr) * | 2004-02-16 | 2005-08-25 | Simone Soria | Procede permettant de generer des signaux de commande, notamment pour des utilisateurs handicapes |
US7725547B2 (en) | 2006-09-06 | 2010-05-25 | International Business Machines Corporation | Informing a user of gestures made by others out of the user's line of sight |
US7801332B2 (en) | 2007-01-12 | 2010-09-21 | International Business Machines Corporation | Controlling a system based on user behavioral signals detected from a 3D captured image stream |
WO2008084053A1 (fr) * | 2007-01-12 | 2008-07-17 | International Business Machines Corporation | Adaptation d'une expérience de consommateur basée sur un flux d'images capturées 3d d'une réponse de comsommateur |
US8269834B2 (en) | 2007-01-12 | 2012-09-18 | International Business Machines Corporation | Warning a user about adverse behaviors of others within an environment based on a 3D captured image stream |
US7877706B2 (en) | 2007-01-12 | 2011-01-25 | International Business Machines Corporation | Controlling a document based on user behavioral signals detected from a 3D captured image stream |
US7971156B2 (en) | 2007-01-12 | 2011-06-28 | International Business Machines Corporation | Controlling resource access based on user gesturing in a 3D captured image stream of the user |
US10354127B2 (en) | 2007-01-12 | 2019-07-16 | Sinoeast Concept Limited | System, method, and computer program product for alerting a supervising user of adverse behavior of others within an environment by providing warning signals to alert the supervising user that a predicted behavior of a monitored user represents an adverse behavior |
US7840031B2 (en) | 2007-01-12 | 2010-11-23 | International Business Machines Corporation | Tracking a range of body movement based on 3D captured image streams of a user |
US7792328B2 (en) | 2007-01-12 | 2010-09-07 | International Business Machines Corporation | Warning a vehicle operator of unsafe operation behavior based on a 3D captured image stream |
TWI469101B (zh) * | 2009-12-23 | 2015-01-11 | Chi Mei Comm Systems Inc | 手語識別系統及方法 |
WO2012128399A1 (fr) * | 2011-03-21 | 2012-09-27 | Lg Electronics Inc. | Dispositif d'affichage et procédé de commande associé |
EP3043238A1 (fr) * | 2011-09-15 | 2016-07-13 | Koninklijke Philips N.V. | Interface utilisateur à base de gestes avec rétroaction |
US9910502B2 (en) | 2011-09-15 | 2018-03-06 | Koninklijke Philips N.V. | Gesture-based user-interface with user-feedback |
CN103226692A (zh) * | 2012-11-22 | 2013-07-31 | 广东科学中心 | 一种视频流图像帧的识别系统及其方法 |
US9704350B1 (en) | 2013-03-14 | 2017-07-11 | Harmonix Music Systems, Inc. | Musical combat game |
TWI501205B (zh) * | 2014-07-04 | 2015-09-21 | Sabuz Tech Co Ltd | 手語圖像輸入方法及裝置 |
US9524656B2 (en) | 2014-07-04 | 2016-12-20 | Sabuz Tech. Co., Ltd. | Sign language image input method and device |
Also Published As
Publication number | Publication date |
---|---|
CN1139893C (zh) | 2004-02-25 |
EP0905644A3 (fr) | 2004-02-25 |
CN1218936A (zh) | 1999-06-09 |
US6215890B1 (en) | 2001-04-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0905644A2 (fr) | Dispositif de reconnaissance de gestes de la main | |
US7720285B2 (en) | Head detecting apparatus, head detecting method, and head detecting program | |
US8971585B2 (en) | Image processing apparatus for retrieving object from moving image and method thereof | |
US9542003B2 (en) | Image processing device, image processing method, and a computer-readable non-transitory medium | |
EP0991011A2 (fr) | Méthode et dispositif pour la segmentation de gestes de la main | |
MX2011012725A (es) | Aparato de busqueda de imagenes y metodo de busqueda de imagenes. | |
CN106648078B (zh) | 应用于智能机器人的多模态交互方法及系统 | |
KR101612605B1 (ko) | 얼굴 특징점 추출 방법 및 이를 수행하는 장치 | |
JP2001216515A (ja) | 人物の顔の検出方法およびその装置 | |
CN104636725A (zh) | 一种基于深度图像的手势识别方法与系统 | |
KR20140002007A (ko) | 정보 처리 장치, 정보 처리 방법 및 기록 매체 | |
CN104364733A (zh) | 注视位置检测装置、注视位置检测方法和注视位置检测程序 | |
US11798177B2 (en) | Hand tracking method, device and system | |
CN105912126B (zh) | 一种手势运动映射到界面的增益自适应调整方法 | |
CN112633084A (zh) | 人脸框确定方法、装置、终端设备及存储介质 | |
WO2013051681A1 (fr) | Dispositif d'estimation de forme de doigt, procédé d'estimation de forme de doigt et programme d'estimation de forme de doigt | |
JP2010104754A (ja) | 情動分析装置 | |
Plouffe et al. | Natural human-computer interaction using static and dynamic hand gestures | |
JPH11174948A (ja) | 手動作認識装置 | |
KR101089847B1 (ko) | 얼굴 인식을 위한 sift 알고리즘을 이용한 키포인트 매칭 시스템 및 방법 | |
US20170336874A1 (en) | Method and apparatus for processing hand gesture command for media-centric wearable electronic device | |
JP2016099643A (ja) | 画像処理装置、画像処理方法および画像処理プログラム | |
KR101541421B1 (ko) | 손 자세 인식 기반 사용자 인터페이스 방법 및 시스템 | |
JP2000172163A (ja) | 手動作分節方法および装置 | |
JP2000149025A (ja) | ジェスチャ認識装置及び方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: COMMUNICATION RESEARCH LABORATORY, MINISTRY OF PO Owner name: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 06K 9/00 B Ipc: 7H 04N 13/00 B Ipc: 7G 06F 3/00 B Ipc: 7G 06F 3/033 A |
|
17P | Request for examination filed |
Effective date: 20040415 |
|
AKX | Designation fees paid |
Designated state(s): DE FR GB |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIO Owner name: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20070722 |