[go: up one dir, main page]

DE19609052A1 - Speech generation device for character recognition - Google Patents

Speech generation device for character recognition

Info

Publication number
DE19609052A1
DE19609052A1 DE19609052A DE19609052A DE19609052A1 DE 19609052 A1 DE19609052 A1 DE 19609052A1 DE 19609052 A DE19609052 A DE 19609052A DE 19609052 A DE19609052 A DE 19609052A DE 19609052 A1 DE19609052 A1 DE 19609052A1
Authority
DE
Germany
Prior art keywords
signals
camera
arrangement according
characters
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
DE19609052A
Other languages
German (de)
Inventor
Bernd Dr Med Kamppeter
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to DE19609052A priority Critical patent/DE19609052A1/en
Publication of DE19609052A1 publication Critical patent/DE19609052A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61FFILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
    • A61F9/00Methods or devices for treatment of the eyes; Devices for putting in contact-lenses; Devices to correct squinting; Apparatus to guide the blind; Protective devices for the eyes, carried on the body or in the hand
    • A61F9/08Devices or methods enabling eye-patients to replace direct visual perception by another kind of perception
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/142Image acquisition using hand-held instruments; Constructional details of the instruments
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • G09B21/04Devices for conversing with the deaf-blind
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Business, Economics & Management (AREA)
  • Ophthalmology & Optometry (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biomedical Technology (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Vascular Medicine (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Image Processing (AREA)

Abstract

The device includes a CCD camera for taking pictures of characters of various sizes and forms. The camera includes an automatic focussing facility. The signals obtained are digitised and fed to a microcomputer which identifies the characters. The digital signals are converted into acoustic signals which are provided to either a loudspeaker or headphones.

Description

Die Erfindung betrifft eine Anordnung von Hardware zum Erzeugen von Tonsignalen (Sprache) aus beliebig großen und verschieden weit entfernten Schriftzeichen und Formen.The invention relates to an arrangement of hardware for generating Sound signals (speech) from any size and at different distances Characters and shapes.

Dabei wird mit einer CCD-Kamera ein beliebiges Bild aufgenommen, die Daten werden ausgewertet und akustisch wiedergegeben. Das Gerät ist in erster Linie für sehbehinderte Personen gedacht, die zwar in der Lage sind, sich zu orientieren, aber nicht die Möglichkeit haben, Texte und Formen zu erkennnen, bzw. zu entziffern. Dies trifft besonders auf die Gruppe der sogenannten Makulopathien zu, bei denen nur die Netzhautmitte geschädigt ist, und die ca. 80% der sogenannten Zivilblinden stellen. Ein weiterer Anwendungskreis sind die Analphabeten.Any image is captured with a CCD camera Data are evaluated and reproduced acoustically. The device is in primarily intended for visually impaired people who are able to are to orient themselves, but do not have the opportunity to read texts and Recognize or decipher forms. This is particularly true of the Group of so-called maculopathies, in which only the middle of the retina is damaged, and about 80% of the so-called civil blind put. Another area of application is illiteracy.

Der Anwender kann mit Hilfe der CCD-Kamera ein Objekt seiner Wahl digitalisieren. Dieses Bild wird softwaregesteuert ausgewertet. Der Microcomputer wandelt die digitalen Daten in akustische Signale um, die dann über Lautsprecher wiedergegeben werden.The user can use the CCD camera to select an object of his choice digitize. This image is evaluated under software control. Of the Microcomputer converts the digital data into acoustic signals that then be played through speakers.

Zur Zeit werden zur Unterstützung von sehschwachen Personen Geräte verwendet, die entweder mittels Scanner Texte einscannen und mit OCR- Software auswerten und als Sprache ausgeben. Als zweite Möglichkeit wird ein Vergrößerungssystem verwendet, bei dem eine Textvorlage ausschnittsweise vergrößert wird. Die beiden Techniken benötigen eine Textvorlage, die eine bestimmte Größe nicht überschreiten darf. Weiterhin muß bei der Textvorlage eine bestimmte Höhe eingehalten werden.Devices are currently being used to support the visually impaired used, which either scan text using a scanner and use OCR Evaluate software and output it as language. As a second option a magnification system is used in which a text template is cut out is enlarged. The two techniques require a text template, which must not exceed a certain size. Farther a certain height must be observed for the text template.

Bei dem vorliegenden Gerät ist eine Einhaltung bestimmter Maße nicht erforderlich, weiterhin können auch weiter entfernte Schriftzeichen und bekannte Formen erkannt und ausgewertet werden. Zu erkennende Formen sind z. B. Verkehrszeichen, Bus-Aufschriften, Fahrpläne an Haltestellen, Gesichter usw. Auch der Abstand von der Kamera zum Objekt kann mittels Autofocussystem ermittelt, und dem Träger des erfindungsgemäßen Gerätes akustisch mitgeteilt werden. Der Anwender ist somit in der Lage, sich in seiner Umwelt besser zu orientieren und informieren.In the present device, compliance with certain dimensions is not required, further removed characters and known forms are recognized and evaluated. Shapes to be recognized are z. B. Traffic signs, bus inscriptions, timetables at stops, Faces etc. Also the distance from the camera to the object can be determined using an autofocus system, and the wearer of the invention Device be communicated acoustically. The user is therefore in the Able to better orientate and inform in its environment.

Claims (5)

1. Anordnung von Hardware zum Erzeugen von Tonsignalen (Sprache) aus beliebig und verschieden weit entfernten Schriftzeichen und Formen mit
  • a) ersten für die Handhabung geeigneten Mitteln zum fotografieren vorhandener Schriftzeichen und Formen und deren digitaler Speicherung;
  • b) zweiten Mitteln zum Empfang des mit den ersten Mitteln gespeicherten digitalisierten Bildes und zum Erkennen von Buchstaben, Ziffern und bekannten Formen aus der Bildinformation;
  • c) dritten Mitteln zum Umwandeln der aus den zweiten Mitteln herrührenden Buchstaben, Ziffern und bekannten Formen in akustische Signale (Sprache),
1. Arrangement of hardware for generating sound signals (language) from arbitrarily and differently distant characters and forms with
  • a) first means suitable for handling to photograph existing characters and shapes and their digital storage;
  • b) second means for receiving the digitized image stored with the first means and for recognizing letters, numbers and known shapes from the image information;
  • c) third means for converting the letters, numbers and known forms resulting from the second means into acoustic signals (speech),
dadurch gekennzeichnet, daß als erstes Mittel
  • a1. eine Kamera mit hochauflösendem CCD-Sensor und gegebenenfalls mit Autofocussystem verwendet wird, die mit den zweiten Mitteln verbunden ist und
  • a2. ein einziger Auslöseknopf für die komplette Durchführung der Operation von der Aufnahme bis zur Sprachausgabe ausreichend ist.
characterized in that the first means
  • a1. a camera with a high-resolution CCD sensor and possibly with an auto focus system is used, which is connected to the second means and
  • a2. a single release button is sufficient for the complete execution of the operation from the recording to the voice output.
2. Anordnung nach Anspruch 1, dadurch gekennzeichnet, daß zur Bilddigitalisierung entweder die in der CCD-Kamera integrierte oder eine gesonderte Digitalisiervorrichtung im nachgeordneten Computer verwendet wird.2. Arrangement according to claim 1, characterized in that for Image digitization either the integrated in the CCD camera or a separate digitizing device in the downstream computer is used. 3. Anordnung nach Anspruch 1, dadurch gekennzeichnet, daß im zweiten Mittel ein Mikrocomputer die digitalen Signale der Kamera auswertet und bis zur Sprachausgabe über Lautsprecher bzw. Kopfhörer verarbeitet.3. Arrangement according to claim 1, characterized in that in second means a microcomputer the digital signals from the camera evaluates and up to speech output via loudspeaker or headphones processed. 4. Anordnung nach Anspruch 1, dadurch gekennzeichnet, daß im zweiten Mittel eine Software integriert ist, welche in der Lage ist, die vom CCD-Sensor gelieferten Signale in akustische Signale umzuwandeln und ohne gesonderte Steuerung durch den Benutzer auskommt, wobei erforderlichenfalls die Möglichkeit einer Einflußnahme auf den Programmablauf gegeben ist.4. Arrangement according to claim 1, characterized in that in second means, software is integrated which is able to convert the signals supplied by the CCD sensor into acoustic signals and manages without separate control by the user, where necessary, the possibility of exerting influence is given to the program flow.
DE19609052A 1996-03-08 1996-03-08 Speech generation device for character recognition Withdrawn DE19609052A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
DE19609052A DE19609052A1 (en) 1996-03-08 1996-03-08 Speech generation device for character recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE19609052A DE19609052A1 (en) 1996-03-08 1996-03-08 Speech generation device for character recognition

Publications (1)

Publication Number Publication Date
DE19609052A1 true DE19609052A1 (en) 1997-09-18

Family

ID=7787661

Family Applications (1)

Application Number Title Priority Date Filing Date
DE19609052A Withdrawn DE19609052A1 (en) 1996-03-08 1996-03-08 Speech generation device for character recognition

Country Status (1)

Country Link
DE (1) DE19609052A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19815073A1 (en) * 1998-04-05 1999-10-14 Gfal Sachsen Ev PC audio tactile information system for the blind and those with vision problems
DE10157921B4 (en) * 2001-11-26 2004-06-24 Hub, Andreas, Dr. Portable video camera
GB2405018A (en) * 2004-07-24 2005-02-16 Photolink Text to speech for electronic programme guide
GB2415079A (en) * 2004-06-09 2005-12-14 Darren Raymond Taylor Portable OCR reader which produces synthesised speech output
EP2772829A1 (en) * 2013-02-28 2014-09-03 King Saud University System for enabling a visually impaired or blind person to use an input device having at least one key

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4811400A (en) * 1984-12-27 1989-03-07 Texas Instruments Incorporated Method for transforming symbolic data
US4841575A (en) * 1985-11-14 1989-06-20 British Telecommunications Public Limited Company Image encoding and synthesis
DE3901023A1 (en) * 1989-01-14 1990-07-19 Ulrich Dipl Ing Ritter Reading device for blind or visually impaired persons with a scanner reading normal script
DE8916023U1 (en) * 1989-09-19 1992-12-03 Medvey, Bela, 8255 Schwindegg dictionary
DE4123465A1 (en) * 1991-07-16 1993-01-21 Bernd Kamppeter Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user
DE9217643U1 (en) * 1992-04-29 1993-06-03 Siwoff, Ronald, Bridgewater, N.J. Video glasses
DE4339997A1 (en) * 1993-11-24 1995-06-01 Hossein Zahedi Combined radio cassette and obstacle locator for blind people
DE4400021A1 (en) * 1994-01-03 1995-07-13 Andreas Dante Identifying colours of document original for colour blind persons
US5444486A (en) * 1992-03-30 1995-08-22 Elmo Co., Ltd. Portable image input equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4811400A (en) * 1984-12-27 1989-03-07 Texas Instruments Incorporated Method for transforming symbolic data
US4841575A (en) * 1985-11-14 1989-06-20 British Telecommunications Public Limited Company Image encoding and synthesis
DE3901023A1 (en) * 1989-01-14 1990-07-19 Ulrich Dipl Ing Ritter Reading device for blind or visually impaired persons with a scanner reading normal script
DE8916023U1 (en) * 1989-09-19 1992-12-03 Medvey, Bela, 8255 Schwindegg dictionary
DE4123465A1 (en) * 1991-07-16 1993-01-21 Bernd Kamppeter Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user
US5444486A (en) * 1992-03-30 1995-08-22 Elmo Co., Ltd. Portable image input equipment
DE9217643U1 (en) * 1992-04-29 1993-06-03 Siwoff, Ronald, Bridgewater, N.J. Video glasses
DE4339997A1 (en) * 1993-11-24 1995-06-01 Hossein Zahedi Combined radio cassette and obstacle locator for blind people
DE4400021A1 (en) * 1994-01-03 1995-07-13 Andreas Dante Identifying colours of document original for colour blind persons

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19815073A1 (en) * 1998-04-05 1999-10-14 Gfal Sachsen Ev PC audio tactile information system for the blind and those with vision problems
DE19815073C2 (en) * 1998-04-05 2000-04-27 Gfal Sachsen Ev Innovative Tec Information system, especially for the blind
DE10157921B4 (en) * 2001-11-26 2004-06-24 Hub, Andreas, Dr. Portable video camera
GB2415079A (en) * 2004-06-09 2005-12-14 Darren Raymond Taylor Portable OCR reader which produces synthesised speech output
GB2405018A (en) * 2004-07-24 2005-02-16 Photolink Text to speech for electronic programme guide
GB2405018B (en) * 2004-07-24 2005-06-29 Photolink Electronic programme guide comprising speech synthesiser
EP2772829A1 (en) * 2013-02-28 2014-09-03 King Saud University System for enabling a visually impaired or blind person to use an input device having at least one key

Similar Documents

Publication Publication Date Title
DE69634740T2 (en) System for speech recognition and translation
CN106657865B (en) Conference summary generation method and device and video conference system
DE69526871T2 (en) SIGNALING TELEPHONE SYSTEM FOR COMMUNICATION BETWEEN HEARING AND NON-HEARING
US4757541A (en) Audio visual speech recognition
DE69429235T2 (en) Method and apparatus for displaying sign language images corresponding to text or language
Elrefaei et al. An Arabic visual dataset for visual speech recognition
DE112019002205T5 (en) REAL-TIME NOTIFICATION OF SYMPTOMS IN TELEMEDICINE
DE19609052A1 (en) Speech generation device for character recognition
EP1187095A3 (en) Grapheme-phoneme assignment
WO2002049003A1 (en) Method and system for converting text to speech
DE4123465A1 (en) Text-to-speech converter using optical character recognition - reads scanned text into memory for reproduction by loudspeaker or on video screen at discretion of user
DE102016003401B4 (en) Acquisition device and method for acquiring a speech utterance by a speaking person in a motor vehicle
CN109919127B (en) Mute language conversion system
Palo et al. Pre-speech tongue movements recorded with ultrasound
JP3254542B2 (en) News transmission device for the hearing impaired
DE112018006597B4 (en) Speech processing device and speech processing method
JPH0139147B2 (en)
Yeung et al. Pitch range, intensity, and vocal fry in non-native and native English focus intonation
Suganuma et al. Effect of Japanese utterance training using lip movement
DE102021116285A1 (en) Method and arrangement for converting and transmitting instructional content and presentations
KR100235194B1 (en) Sign language recognition system
DE10157921B4 (en) Portable video camera
DE10056762B4 (en) Method for creating electronic messages
Nishizaka The neglected situation of vision in experimental psychology
DE10003898A1 (en) Mobile telephone has a facility to scan documents and transmit data to a remote computer

Legal Events

Date Code Title Description
OM8 Search report available as to paragraph 43 lit. 1 sentence 1 patent law
8139 Disposal/non-payment of the annual fee