DE19609052A1 - Speech generation device for character recognition - Google Patents
Speech generation device for character recognitionInfo
- Publication number
- DE19609052A1 DE19609052A1 DE19609052A DE19609052A DE19609052A1 DE 19609052 A1 DE19609052 A1 DE 19609052A1 DE 19609052 A DE19609052 A DE 19609052A DE 19609052 A DE19609052 A DE 19609052A DE 19609052 A1 DE19609052 A1 DE 19609052A1
- Authority
- DE
- Germany
- Prior art keywords
- signals
- camera
- arrangement according
- characters
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005236 sound signal Effects 0.000 claims description 2
- 230000001771 impaired effect Effects 0.000 description 2
- 210000000887 face Anatomy 0.000 description 1
- 208000002780 macular degeneration Diseases 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61F—FILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
- A61F9/00—Methods or devices for treatment of the eyes; Devices for putting in contact-lenses; Devices to correct squinting; Apparatus to guide the blind; Protective devices for the eyes, carried on the body or in the hand
- A61F9/08—Devices or methods enabling eye-patients to replace direct visual perception by another kind of perception
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/142—Image acquisition using hand-held instruments; Constructional details of the instruments
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B21/00—Teaching, or communicating with, the blind, deaf or mute
- G09B21/04—Devices for conversing with the deaf-blind
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Business, Economics & Management (AREA)
- Ophthalmology & Optometry (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biomedical Technology (AREA)
- Heart & Thoracic Surgery (AREA)
- Vascular Medicine (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Image Processing (AREA)
Abstract
Description
Die Erfindung betrifft eine Anordnung von Hardware zum Erzeugen von Tonsignalen (Sprache) aus beliebig großen und verschieden weit entfernten Schriftzeichen und Formen.The invention relates to an arrangement of hardware for generating Sound signals (speech) from any size and at different distances Characters and shapes.
Dabei wird mit einer CCD-Kamera ein beliebiges Bild aufgenommen, die Daten werden ausgewertet und akustisch wiedergegeben. Das Gerät ist in erster Linie für sehbehinderte Personen gedacht, die zwar in der Lage sind, sich zu orientieren, aber nicht die Möglichkeit haben, Texte und Formen zu erkennnen, bzw. zu entziffern. Dies trifft besonders auf die Gruppe der sogenannten Makulopathien zu, bei denen nur die Netzhautmitte geschädigt ist, und die ca. 80% der sogenannten Zivilblinden stellen. Ein weiterer Anwendungskreis sind die Analphabeten.Any image is captured with a CCD camera Data are evaluated and reproduced acoustically. The device is in primarily intended for visually impaired people who are able to are to orient themselves, but do not have the opportunity to read texts and Recognize or decipher forms. This is particularly true of the Group of so-called maculopathies, in which only the middle of the retina is damaged, and about 80% of the so-called civil blind put. Another area of application is illiteracy.
Der Anwender kann mit Hilfe der CCD-Kamera ein Objekt seiner Wahl digitalisieren. Dieses Bild wird softwaregesteuert ausgewertet. Der Microcomputer wandelt die digitalen Daten in akustische Signale um, die dann über Lautsprecher wiedergegeben werden.The user can use the CCD camera to select an object of his choice digitize. This image is evaluated under software control. Of the Microcomputer converts the digital data into acoustic signals that then be played through speakers.
Zur Zeit werden zur Unterstützung von sehschwachen Personen Geräte verwendet, die entweder mittels Scanner Texte einscannen und mit OCR- Software auswerten und als Sprache ausgeben. Als zweite Möglichkeit wird ein Vergrößerungssystem verwendet, bei dem eine Textvorlage ausschnittsweise vergrößert wird. Die beiden Techniken benötigen eine Textvorlage, die eine bestimmte Größe nicht überschreiten darf. Weiterhin muß bei der Textvorlage eine bestimmte Höhe eingehalten werden.Devices are currently being used to support the visually impaired used, which either scan text using a scanner and use OCR Evaluate software and output it as language. As a second option a magnification system is used in which a text template is cut out is enlarged. The two techniques require a text template, which must not exceed a certain size. Farther a certain height must be observed for the text template.
Bei dem vorliegenden Gerät ist eine Einhaltung bestimmter Maße nicht erforderlich, weiterhin können auch weiter entfernte Schriftzeichen und bekannte Formen erkannt und ausgewertet werden. Zu erkennende Formen sind z. B. Verkehrszeichen, Bus-Aufschriften, Fahrpläne an Haltestellen, Gesichter usw. Auch der Abstand von der Kamera zum Objekt kann mittels Autofocussystem ermittelt, und dem Träger des erfindungsgemäßen Gerätes akustisch mitgeteilt werden. Der Anwender ist somit in der Lage, sich in seiner Umwelt besser zu orientieren und informieren.In the present device, compliance with certain dimensions is not required, further removed characters and known forms are recognized and evaluated. Shapes to be recognized are z. B. Traffic signs, bus inscriptions, timetables at stops, Faces etc. Also the distance from the camera to the object can be determined using an autofocus system, and the wearer of the invention Device be communicated acoustically. The user is therefore in the Able to better orientate and inform in its environment.
Claims (5)
- a) ersten für die Handhabung geeigneten Mitteln zum fotografieren vorhandener Schriftzeichen und Formen und deren digitaler Speicherung;
- b) zweiten Mitteln zum Empfang des mit den ersten Mitteln gespeicherten digitalisierten Bildes und zum Erkennen von Buchstaben, Ziffern und bekannten Formen aus der Bildinformation;
- c) dritten Mitteln zum Umwandeln der aus den zweiten Mitteln herrührenden Buchstaben, Ziffern und bekannten Formen in akustische Signale (Sprache),
- a) first means suitable for handling to photograph existing characters and shapes and their digital storage;
- b) second means for receiving the digitized image stored with the first means and for recognizing letters, numbers and known shapes from the image information;
- c) third means for converting the letters, numbers and known forms resulting from the second means into acoustic signals (speech),
- a1. eine Kamera mit hochauflösendem CCD-Sensor und gegebenenfalls mit Autofocussystem verwendet wird, die mit den zweiten Mitteln verbunden ist und
- a2. ein einziger Auslöseknopf für die komplette Durchführung der Operation von der Aufnahme bis zur Sprachausgabe ausreichend ist.
- a1. a camera with a high-resolution CCD sensor and possibly with an auto focus system is used, which is connected to the second means and
- a2. a single release button is sufficient for the complete execution of the operation from the recording to the voice output.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19609052A DE19609052A1 (en) | 1996-03-08 | 1996-03-08 | Speech generation device for character recognition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19609052A DE19609052A1 (en) | 1996-03-08 | 1996-03-08 | Speech generation device for character recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
DE19609052A1 true DE19609052A1 (en) | 1997-09-18 |
Family
ID=7787661
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE19609052A Withdrawn DE19609052A1 (en) | 1996-03-08 | 1996-03-08 | Speech generation device for character recognition |
Country Status (1)
Country | Link |
---|---|
DE (1) | DE19609052A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19815073A1 (en) * | 1998-04-05 | 1999-10-14 | Gfal Sachsen Ev | PC audio tactile information system for the blind and those with vision problems |
DE10157921B4 (en) * | 2001-11-26 | 2004-06-24 | Hub, Andreas, Dr. | Portable video camera |
GB2405018A (en) * | 2004-07-24 | 2005-02-16 | Photolink | Text to speech for electronic programme guide |
GB2415079A (en) * | 2004-06-09 | 2005-12-14 | Darren Raymond Taylor | Portable OCR reader which produces synthesised speech output |
EP2772829A1 (en) * | 2013-02-28 | 2014-09-03 | King Saud University | System for enabling a visually impaired or blind person to use an input device having at least one key |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4811400A (en) * | 1984-12-27 | 1989-03-07 | Texas Instruments Incorporated | Method for transforming symbolic data |
US4841575A (en) * | 1985-11-14 | 1989-06-20 | British Telecommunications Public Limited Company | Image encoding and synthesis |
DE3901023A1 (en) * | 1989-01-14 | 1990-07-19 | Ulrich Dipl Ing Ritter | Reading device for blind or visually impaired persons with a scanner reading normal script |
DE8916023U1 (en) * | 1989-09-19 | 1992-12-03 | Medvey, Bela, 8255 Schwindegg | dictionary |
DE4123465A1 (en) * | 1991-07-16 | 1993-01-21 | Bernd Kamppeter | Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user |
DE9217643U1 (en) * | 1992-04-29 | 1993-06-03 | Siwoff, Ronald, Bridgewater, N.J. | Video glasses |
DE4339997A1 (en) * | 1993-11-24 | 1995-06-01 | Hossein Zahedi | Combined radio cassette and obstacle locator for blind people |
DE4400021A1 (en) * | 1994-01-03 | 1995-07-13 | Andreas Dante | Identifying colours of document original for colour blind persons |
US5444486A (en) * | 1992-03-30 | 1995-08-22 | Elmo Co., Ltd. | Portable image input equipment |
-
1996
- 1996-03-08 DE DE19609052A patent/DE19609052A1/en not_active Withdrawn
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4811400A (en) * | 1984-12-27 | 1989-03-07 | Texas Instruments Incorporated | Method for transforming symbolic data |
US4841575A (en) * | 1985-11-14 | 1989-06-20 | British Telecommunications Public Limited Company | Image encoding and synthesis |
DE3901023A1 (en) * | 1989-01-14 | 1990-07-19 | Ulrich Dipl Ing Ritter | Reading device for blind or visually impaired persons with a scanner reading normal script |
DE8916023U1 (en) * | 1989-09-19 | 1992-12-03 | Medvey, Bela, 8255 Schwindegg | dictionary |
DE4123465A1 (en) * | 1991-07-16 | 1993-01-21 | Bernd Kamppeter | Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user |
US5444486A (en) * | 1992-03-30 | 1995-08-22 | Elmo Co., Ltd. | Portable image input equipment |
DE9217643U1 (en) * | 1992-04-29 | 1993-06-03 | Siwoff, Ronald, Bridgewater, N.J. | Video glasses |
DE4339997A1 (en) * | 1993-11-24 | 1995-06-01 | Hossein Zahedi | Combined radio cassette and obstacle locator for blind people |
DE4400021A1 (en) * | 1994-01-03 | 1995-07-13 | Andreas Dante | Identifying colours of document original for colour blind persons |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19815073A1 (en) * | 1998-04-05 | 1999-10-14 | Gfal Sachsen Ev | PC audio tactile information system for the blind and those with vision problems |
DE19815073C2 (en) * | 1998-04-05 | 2000-04-27 | Gfal Sachsen Ev Innovative Tec | Information system, especially for the blind |
DE10157921B4 (en) * | 2001-11-26 | 2004-06-24 | Hub, Andreas, Dr. | Portable video camera |
GB2415079A (en) * | 2004-06-09 | 2005-12-14 | Darren Raymond Taylor | Portable OCR reader which produces synthesised speech output |
GB2405018A (en) * | 2004-07-24 | 2005-02-16 | Photolink | Text to speech for electronic programme guide |
GB2405018B (en) * | 2004-07-24 | 2005-06-29 | Photolink | Electronic programme guide comprising speech synthesiser |
EP2772829A1 (en) * | 2013-02-28 | 2014-09-03 | King Saud University | System for enabling a visually impaired or blind person to use an input device having at least one key |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69634740T2 (en) | System for speech recognition and translation | |
CN106657865B (en) | Conference summary generation method and device and video conference system | |
DE69526871T2 (en) | SIGNALING TELEPHONE SYSTEM FOR COMMUNICATION BETWEEN HEARING AND NON-HEARING | |
US4757541A (en) | Audio visual speech recognition | |
DE69429235T2 (en) | Method and apparatus for displaying sign language images corresponding to text or language | |
Elrefaei et al. | An Arabic visual dataset for visual speech recognition | |
DE112019002205T5 (en) | REAL-TIME NOTIFICATION OF SYMPTOMS IN TELEMEDICINE | |
DE19609052A1 (en) | Speech generation device for character recognition | |
EP1187095A3 (en) | Grapheme-phoneme assignment | |
WO2002049003A1 (en) | Method and system for converting text to speech | |
DE4123465A1 (en) | Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user | |
DE102016003401B4 (en) | Acquisition device and method for acquiring a speech utterance by a speaking person in a motor vehicle | |
CN109919127B (en) | Mute language conversion system | |
Palo et al. | Pre-speech tongue movements recorded with ultrasound | |
JP3254542B2 (en) | News transmission device for the hearing impaired | |
DE112018006597B4 (en) | Speech processing device and speech processing method | |
JPH0139147B2 (en) | ||
Yeung et al. | Pitch range, intensity, and vocal fry in non-native and native English focus intonation | |
Suganuma et al. | Effect of Japanese utterance training using lip movement | |
DE102021116285A1 (en) | Method and arrangement for converting and transmitting instructional content and presentations | |
KR100235194B1 (en) | Sign language recognition system | |
DE10157921B4 (en) | Portable video camera | |
DE10056762B4 (en) | Method for creating electronic messages | |
Nishizaka | The neglected situation of vision in experimental psychology | |
DE10003898A1 (en) | Mobile telephone has a facility to scan documents and transmit data to a remote computer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
OM8 | Search report available as to paragraph 43 lit. 1 sentence 1 patent law | ||
8139 | Disposal/non-payment of the annual fee |