[go: up one dir, main page]

CN101996160B - Method and system for processing script data - Google Patents

Method and system for processing script data Download PDF

Info

Publication number
CN101996160B
CN101996160B CN 200910090817 CN200910090817A CN101996160B CN 101996160 B CN101996160 B CN 101996160B CN 200910090817 CN200910090817 CN 200910090817 CN 200910090817 A CN200910090817 A CN 200910090817A CN 101996160 B CN101996160 B CN 101996160B
Authority
CN
China
Prior art keywords
font
index
data
character
encoding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 200910090817
Other languages
Chinese (zh)
Other versions
CN101996160A (en
Inventor
丁力
张磊
仇睿恒
王毅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New Founder Holdings Development Co ltd
Peking University
Peking University Founder Research and Development Center
Original Assignee
BEIDA FANGZHENG TECHN INST Co Ltd BEIJING
Peking University
Peking University Founder Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIDA FANGZHENG TECHN INST Co Ltd BEIJING, Peking University, Peking University Founder Group Co Ltd filed Critical BEIDA FANGZHENG TECHN INST Co Ltd BEIJING
Priority to CN 200910090817 priority Critical patent/CN101996160B/en
Publication of CN101996160A publication Critical patent/CN101996160A/en
Application granted granted Critical
Publication of CN101996160B publication Critical patent/CN101996160B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Controls And Circuits For Display Device (AREA)

Abstract

本发明公开了一种字体数据的处理方法及系统,属于字体技术领域。现有字体数据的数据量较大,使用字体数据的逻辑较复杂,速度较慢,效率较低。本发明所述方法及系统首先建立文字编码与字形索引之间直接的映射关系,记录在文字编码与字形索引映射表中,然后将字体数据中的冗余数据删除;使用字体时,通过查询文字编码与字形索引映射表,先获得文字的字形索引,再根据字形索引从字体数据中获取文字的字形描述数据。采用本发明所述的方法及系统对字体数据进行处理后,减少了字体数据的数据量并提高了字体数据的使用效率。本发明特别适用于文字信息与字体数据整合在一起的文档中,或者附加到文档阅读软件中。

The invention discloses a font data processing method and system, belonging to the technical field of fonts. The existing font data has a large amount of data, and the logic of using the font data is complex, slow and inefficient. The method and system of the present invention first establishes the direct mapping relationship between the character code and the font index, records it in the character code and font index mapping table, and then deletes the redundant data in the font data; when using the font, by querying the character The encoding and glyph index mapping table first obtains the glyph index of the text, and then obtains the glyph description data of the text from the font data according to the glyph index. After the font data is processed by the method and system of the present invention, the data volume of the font data is reduced and the use efficiency of the font data is improved. The present invention is particularly suitable for documents in which text information and font data are integrated, or added to document reading software.

Description

A kind of disposal route of character font data and system
Technical field
The invention belongs to the font technical field, be specifically related to a kind of disposal route and system of character font data, be specially adapted to perhaps append in the document ocr software in the document that Word message and character font data combine.
Background technology
Electronic document comprise the elements such as figure, picture, form, formula, multilingual literal, and literal is the element of topmost expression document content when storage and showing, the proportion that accounts in document is also maximum.Character font data is as a kind of resource, the font description data of a series of literal have been stored, when document was showed, according to the Word message in the document, the font data in the corresponding character font data rendered image or one group of path and is shown to computer screen or outputs on the printer.
Type1 full name PostScript Type1, it is a cover vector fonts standard that was proposed by Adobe company in 1985, because this standard is based on PostScript Description Language (PDL), and PDL is the printing descriptive language of high-end printer first-selection, so Type1 comes into vogue rapidly.But Type1 is non-open font, and Adobe imposes the usage charges of great number to the company that uses Type1.
TrueType is a kind of Mathematical font description technology.It describes the character contour profile with mathematical function, contains the instructions such as glyph construction, color filling, digital described function, the control of flow process condition, grid processing controls, additional prompt control.Quadratic b-spline curve and straight line are described the appearance profile of font in the TrueType employing geometry, and be characterized in: TrueType both can make type fount, can be used as screen display again; Because it is by instruction font to be described, so it is irrelevant with resolution, during output always the resolution according to printer export.No matter zoom in or out, character is always smooth, does not have sawtooth and occurs.But relative PostScript font, its quality is less better.Particularly at literal too hour, just show to such an extent that be not very clear.
OpenType also is the Type2 font, and it also is a kind of cutline font, and is more more powerful than TrueType, and the most obvious benefit is exactly to be embedded into the PostScript font in the software of TrueType.And support a plurality of platforms, and support very large character set, also have copyright protection.Can say that it is the superset of Type1 and TrueType.
The major advantage of OpenType is as follows:
1) the cross-platform function that strengthens
2) better support the international character collection of Unicode standard definition
3) support senior printing control ability
4) document size that generates is less
5) be supported in and add digital signature in the character set, guarantee the integrated functionality of file
The OpenType standard has also defined the suffix name of OpenType file name.Comprise the OpenType file suffixes .ttf by name of TureType font, the file suffixes that comprises the PostScript font is called .OTF.If comprise the font packet file of a series of TrueType fonts, suffix is called .TTC so.
Unicode (Unicode, ten thousand country codes, single code) is a kind of character code that uses on computers.It is in every kind of language each character setting unified and unique binary coding, stride language, the cross-platform requirement of carrying out text-converted, processing to satisfy.
Want the Word message content of the displaying of the former formula of real master, the color that must keep the user that literal is arranged, font, the attribute informations such as size.Guarantee the same result of output in any system, just must integrate character font data and Word message as a whole.At this moment just need to process character font data, processing mode of the prior art is: the information of character font data being removed partial redundance, it mainly is the data of description of font, the for example description to font among the glyf in the OpenType font, because the data to other do not process, so font use-pattern and complete font are as broad as long.
There is following shortcoming in prior art:
1. font uses logic complicated.Because the character font data that prior art was processed is as broad as long with normal font in the use, to according to current literal code type, determine to use which kind of mapping table during use.For example, a plurality of cmap tables are arranged among the OpenType, search could be determined the position of font description data through one or many, and also needs coding is carried out the one or many conversion when using some mapping table, and logic is complicated.
2. still have some redundant datas in the character font data.Owing to only having removed part font data of description, some information that also exist some and environment for use to have nothing to do in the font are such as some information in name table among the OpenType and the cmap table.
Summary of the invention
For the defective that exists in the prior art, the purpose of this invention is to provide a kind of disposal route and system of character font data.After the method and system process character font data, can improve the service efficiency of character font data.
To achieve these goals, the technical solution used in the present invention is as follows:
A kind of disposal route of character font data, the corresponding relation between the literal code of the method model and the font index is recorded in literal code and the font index-mapping table;
When using character font data, by described literal code and font index-mapping table, obtain the font index corresponding with literal code, from character font data, obtain again the font description data of literal according to described font index.
The disposal route of character font data as mentioned above, wherein, literal code refers to the standard code of literal, comprises that Unicode coding and GBK encode.If literal code is not the Unicode coding, then be translated into the Unicode coding.
The disposal route of character font data as mentioned above, wherein, the process of setting up corresponding relation between literal code and the font index may further comprise the steps:
(1) obtains and identifies literal code;
(2) resolve character font data, from character font data, obtain the corresponding relation of literal code and font index;
(3) generating character coding and font index-mapping table.
The disposal route of character font data as mentioned above, in the step (2), resolve character font data, the process of obtaining literal code and font index corresponding relation is: find one or more mapping tables that can finally obtain literal code and font index corresponding relation according to the usage platform of literal and the type of literal code in character font data, obtain the font index corresponding with literal code according to the one or more mapping tables that find again.
The disposal route of character font data as mentioned above, wherein, the process of setting up corresponding relation between literal code and the font index may further comprise the steps:
(1) resolves character font data, find one or more mapping tables that can finally obtain corresponding relation between literal code and the font index;
(2) whether Unicode encodes in the literal code of judgement font use, if not the Unicode coding, then converts thereof into the Unicode coding;
(3) in the mapping table that step (1) finds, extract the font index corresponding with literal code, generating character coding and font index-mapping table, the Unicode coding of described literal code and font index-mapping table record literal and the corresponding relation between the font index.
The disposal route of character font data wherein, behind the corresponding relation of setting up between literal code and the font index, is deleted the redundant data in the character font data as mentioned above.
A kind of disposal system of character font data comprises font processor (2) and font application apparatus (3);
Described font processor (2) comprises for resolving character font data, obtains the font index acquisition module (22) of literal code and font index corresponding relation; The mapping table generation module (23) that is used for generating character coding and font index-mapping table, described literal code and font index-mapping table are used for the corresponding relation between shorthand coding and the font index;
Described font application apparatus (3) comprises for resolving literal code and font index-mapping table, obtain the font index corresponding with literal code or with the mapping table parsing module (31) of literal code corresponding to font index; Be used for obtaining from character font data according to the font index font description data acquisition module (32) of text font data of description.
The disposal system of character font data as mentioned above, wherein, font processor (2) also comprises be used to the literal code of obtaining and identify literal code and obtaining and identification module (21), when literal code is non-standard coding, also is used for non-standard code conversion is become standard code.
The disposal system of character font data as mentioned above, wherein, font processor (2) also comprises for the redundant data removing module (24) with the redundant data deletion of character font data.
Method and system of the present invention compared with prior art have following advantage:
(1) font uses logic simple, and speed is fast.According to literal code and font index, the mapping table of corresponding relation between generating character coding and the font index when saving each use font, all needs to carry out the judgement of literal code type and the operations such as parsing of cmap table.And literal code and font index-mapping table use simple, have saved the complex logic of repeatedly searching the cmap table, make font index location faster, have improved the operating speed of character font data.
(2) removed more redundant information, made character font data less.According to literal code and font index-mapping table, removed in the character font data redundant datas such as description to environment for use, for example, in the name of the OpenType table to different platform, the data of description of different language, and some redundant mapping tables in the cmap table.
Description of drawings
Fig. 1 is the structured flowchart of system described in the embodiment 1;
Fig. 2 is the structured flowchart of system described in the embodiment 2;
Fig. 3 is the process flow diagram that adopts the described system of Fig. 1 that character font data is processed and used;
Fig. 4 is the process flow diagram of processing character font data among the embodiment 1;
Fig. 5 is the process flow diagram of using character font data among the embodiment 1.
Embodiment
Core concept of the present invention is: the character font data in the document that existing Word message and character font data are combined or have whole character font data now and process, set up direct mapping relations between literal code and the font index, be recorded in literal code and the font index-mapping table, then the redundant data in the character font data deleted.Wherein, character font data refers to the data in the font file, comprises describing needed all data of font, such as one or more mapping tables of title, copyright, font description data and record font description data and literal code corresponding relation.The font index is used to indicate the position of font description data in character font data.When using font, by inquiry literal code and font index-mapping table, obtain first the font index of literal, from character font data, obtain again the font description data of literal according to the font index.
Below in conjunction with embodiment and accompanying drawing, describe the present invention.
Embodiment 1
The present embodiment is to be treated to example to the character font data in the document that literal code and character font data are combined.
Fig. 1 has shown the structure of the described system of the present embodiment, and this system comprises character font data treating apparatus 2 and character font data application apparatus 3.
Character font data treating apparatus 2 comprises that literal code is obtained and identification module 21, font index acquisition module 22 and mapping table generation module 23.Literal code is obtained and identification module 21 is used for obtaining and identifying the literal code of document 1, when literal code is non-standard coding, also for non-standard code conversion is become standard code.Font index acquisition module 22 is used for the character font data of parse documents 1, obtains the corresponding relation of literal code and font index, and described font index is used to indicate the position of font description data in character font data.Mapping table generation module 23 is used for generating character coding and font index-mapping table, and this table is used for the corresponding relation between shorthand coding and the font index.
Character font data application apparatus 3 comprises mapping table parsing module 31 and font description data acquisition module 32.Mapping table parsing module 31 is used for resolving literal code and font index-mapping table, obtains the font index corresponding with literal code or the literal code corresponding with the font index.Font description data acquisition module 32 is used for obtaining the font description data according to the font index from character font data.
In addition, for the redundant data in the delete font data, reduce the data volume of character font data, character font data treating apparatus 2 also comprises redundant data removing module 24.Behind generating character coding and font index-mapping table, redundant data removing module 24 can be deleted the redundant data in the character font data.
Fig. 3 has shown the employing system shown in Figure 1 and character font data has been processed and used the flow process of processing rear character font data, comprise character font data treating apparatus 2 processing character font datas, and character font data application apparatus 3 is used character font datas.
The process that character font data treating apparatus 2 is processed character font data may further comprise the steps:
(1) literal code is obtained and identification module 21 obtains and identify literal code in the document 1.
Literal code can be standard code, such as Unicode or GBK, and also can criteria of right and wrong coding.If non-standard coding then needs non-standard coding is converted into standard code.
(2) font index acquisition module 22 is resolved character font data, obtains the corresponding relation of literal code and font index from character font data.
At first in character font data, find one or more mapping tables that can finally obtain literal code and font index corresponding relation according to the usage platform of literal and the type of literal code, obtain the font index corresponding with literal code according to the one or more mapping tables that find again.
(3) mapping table generation module 23 generating characters coding and font index-mapping table, this table is used for the corresponding relation between shorthand coding and the font index.
Behind generating character coding and the font index-mapping table, redundant data removing module 24 is the deletion of the redundant data in the character font data, as in the name table of OpenType to the data of description of different platform, different language, and some redundant mapping tables in the cmap table.
The process that character font data application apparatus 3 is used the character font data of processing through character font data treating apparatus 2 may further comprise the steps:
(4) mapping table parsing module 31 is resolved literal code and font index-mapping table, obtains literal code or font index.The use-pattern of literal code and font index-mapping table has following two kinds:
1. obtain literal code, resolve literal code and font index-mapping table, search the font index corresponding with literal code;
2. obtain the font index, resolve literal code and font index-mapping table, search the literal code corresponding with the font index.
(5) font description data acquisition module 32 obtains the font description data of literal according to the font index from character font data.
The below is illustrated the processing of above-mentioned character font data and the process of application so that " specification " two words in the embedded font of PDF document are treated to example.As shown in Figure 4, at first obtained by literal code and identification module 21 obtains and identify the literal code of " specification ", be respectively 0x21 and 0x22.Because this is encoded to unknown coding, namely therefore non-standard coding need to convert thereof into first standard code.Can obtain the Unicode coding of " specification " according to the table of the ToUnicode in the PDF document, be respectively 0x89C4 and 0x683C.
Because the usage platform of font is Windows, type of coding is Unicode, therefore in character font data, search Platform ID=3 (Windows), the cmap table (character map) of Encoding ID=1 (Unicode BMP (UCS-2)).Font index acquisition module 22 is resolved character font data, finds to have in the character font data cmap table, but this table is not the cmap table of Platform ID=3, Encoding ID=1.At this moment, adopt the cmap table (type is 4) of section mapped mode, directly " specification " coding 0x21,0x22 are mapped to 0xF000 to 0xF08F.Coding 0x21 is mapped to 0xF021, and coding 0x22 is mapped to 0xF022.The cmap table of the section of searching mapped mode, the font index that acquisition is corresponding with 0xF021,0xF022 is respectively 1 and 2.
Mapping table generation module 33 is according to Unicode coding and the font index of " specification ", and generating character is encoded and font index-mapping table, and is as follows:
The Unicode coding The font index
0x89C4
1
0x683C 2
Behind generating character coding and the font index-mapping table, redundant data removing module 24 is deleted the redundant data in the character font data.
As shown in Figure 5, during use, obtain first Unicode coding 0x89C4, the 0x683C of " specification ", then mapping table parsing module 31 is resolved first above-mentioned literal code and font index-mapping table, obtain the font index 1 and 2 corresponding with the Unicode coding of " specification ", then font description data acquisition module 32 obtains the font description data corresponding with this font index according to font index 1 and 2 in the loca table (character locating table) of character font data.
Embodiment 2
The present embodiment carries out bulk treatment to character font data, append in the document ocr software, like this can in the situation that not this font use this font resource, and a plurality of document can multiplexing same font resource.
Fig. 2 has shown the structure of the described system of the present embodiment, compare with embodiment 1, because the object of processing is not the character font data of integrating in the document, but take whole character font data as processing object, therefore, do not comprise in the font processor 2 that literal code is obtained and identification module.
So that the Eu-bx font is appended in the Apabi Reader software as example as a means of the source side formula, as follows to the processing procedure of character font data:
(1) font index acquisition module 22 is resolved the Eu-bx character font data, obtains the cmap table in the character font data.Two cmap tables are arranged in the Eu-bx font, a Platform ID=3 is wherein arranged, the cmap table of Specific ID=1.
(2) because the Eu-bx font uses is not the Unicode coding of standard, be the GBK coding so need according to " EUtoGBK.dat " resource with the Eu code conversion, and then convert thereof into the Unicode coding.
(4) extract font index corresponding to Unicode coding among the cmap.
(5) mapping table generation module 23 generating characters coding and font index-mapping table.
(6) redundant data removing module 24 is removed the font redundant data.
(7) will append to through the character font data after the above-mentioned processing in the Apabi Reader software.
Behind generating character coding and the font index-mapping table, can utilize this table and the known literal code inquiry font index corresponding with literal code, and then obtain the font description data; Also can utilize this table and the known font search index literal code corresponding with the font index.
As seen from the above-described embodiment: the present invention is by the analysis to literal code and character font data, obtain the font index, generating character coding and font index-mapping table are removed the redundant data in the character font data according to above-mentioned mapping table again, greatly the data volume of the few character font data of letter.And in use, can directly obtain the font index according to literal code and literal code and font index-mapping table, and then obtain the font description data, simplify the complicated processes of obtaining the font description data according to literal code, thereby improved the operating speed of literal.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technology thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (8)

1.一种字体数据的处理方法,其特征在于:所述方法首先建立文字编码与字形索引之间的对应关系,记录在文字编码与字形索引映射表中;建立文字编码与字形索引之间对应关系的过程包括以下步骤:1. a processing method of font data, it is characterized in that: described method at first establishes the corresponding relation between word coding and font index, is recorded in word coding and font index mapping table; Establishes correspondence between word coding and font index The relationship process consists of the following steps: (1)获取并识别文字编码;(1) Obtain and identify the text code; (2)解析字体数据,从字体数据中获取文字编码和字形索引的对应关系;(2) Analyzing the font data, and obtaining the corresponding relationship between the character encoding and the font index from the font data; (3)生成文字编码与字形索引映射表;(3) Generate text encoding and font index mapping table; 或者包括以下步骤:or include the following steps: 1)解析字体数据,找到能够最终获得文字编码与字形索引之间对应关系的一个或多个映射表;1) Analyzing the font data to find one or more mapping tables that can finally obtain the corresponding relationship between the character encoding and the glyph index; 2)判断字体使用的文字编码是否是Unicode编码,如果不是Unicode编码,则将其转换成Unicode编码;2) Determine whether the text encoding used by the font is Unicode encoding, if not, convert it to Unicode encoding; 3)在步骤1)找到的映射表中提取与文字编码对应的字形索引,生成文字编码与字形索引映射表,所述文字编码与字形索引映射表记录文字的Unicode编码与字形索引之间的对应关系;3) Extract the font index corresponding to the character code in the mapping table found in step 1), generate a character code and grapheme index mapping table, and the character code and grapheme index mapping table record the correspondence between the Unicode code and the grapheme index of the text relation; 使用字体数据时,通过所述文字编码与字形索引映射表,获得与文字编码对应的字形索引,再根据所述字形索引从字体数据中获取文字的字形描述数据。When font data is used, the glyph index corresponding to the character code is obtained through the character code and glyph index mapping table, and then the glyph description data of the character is obtained from the font data according to the glyph index. 2.如权利要求1所述的一种字体数据的处理方法,其特征在于:所述文字编码是指字体的标准编码,包括Unicode编码和GBK编码。2. The method for processing font data according to claim 1, wherein the character encoding refers to standard encoding of fonts, including Unicode encoding and GBK encoding. 3.如权利要求2所述的一种字体数据的处理方法,其特征在于:所述文字编码如果不是Unicode编码,则将其转化为Unicode编码。3. A method for processing font data according to claim 2, characterized in that: if the character encoding is not Unicode encoding, it is converted into Unicode encoding. 4.如权利要求1至3之一所述的一种字体数据的处理方法,其特征在于,步骤(2)中,解析字体数据,获取文字编码和字形索引对应关系的过程为:根据文字的使用平台和文字编码的类型在字体数据中找到能够最终获得文字编码与字形索引对应关系的一个或多个映射表,再根据找到的一个或多个映射表获取与文字编码对应的字形索引。4. A method for processing font data as claimed in any one of claims 1 to 3, characterized in that, in step (2), the process of parsing the font data and obtaining the corresponding relationship between character encoding and font index is as follows: Using the platform and the type of text encoding to find one or more mapping tables in the font data that can finally obtain the correspondence between the text encoding and the glyph index, and then obtain the glyph index corresponding to the text encoding according to the found one or more mapping tables. 5.如权利要求1所述的一种字体数据的处理方法,其特征在于:在建立文字编码与字形索引之间的对应关系后,将字体数据中的冗余数据删除。5. A method for processing font data as claimed in claim 1, characterized in that: after establishing the correspondence between character codes and font indexes, redundant data in the font data is deleted. 6.一种字体数据的处理系统,其特征在于:所述系统包括用于建立文字编码和字形索引之间的对应关系,并将该对应关系记录在文字编码与字形索引映射表中的字体处理装置;用于在使用字体数据时,通过所述文字编码与字形索引映射表获得与文字编码对应的字形索引,再根据所述字形索引从字体数据中获取文字的字形描述数据的字体应用装置;6. A font data processing system, characterized in that: the system includes font processing for establishing a correspondence between a character code and a font index, and recording the correspondence in a mapping table between a character code and a font index A device; a font application device for obtaining a glyph index corresponding to a character code through the character code and glyph index mapping table when using font data, and then obtaining glyph description data of characters from the font data according to the glyph index; 其中,所述字体处理装置包括用于获取并识别文字编码的文字编码获取及识别模块;用于解析字体数据,获取文字编码和字形索引对应关系的字形索引获取模块;以及用于生成文字编码与字形索引映射表的映射表生成模块;或者Wherein, the font processing device includes a character code acquisition and recognition module for acquiring and identifying character codes; a font index acquisition module for parsing font data and obtaining the corresponding relationship between character codes and font indexes; and a font index acquisition module for generating character codes and A mapping table generation module of the font index mapping table; or 所述字体处理装置包括用于解析字体数据,找到能够最终获得文字编码与字形索引之间对应关系的一个或多个映射表的对应关系映射表获取模块;用于判断字体使用的文字编码是否是Unicode编码,如果不是Unicode编码,则将其转换成Unicode编码的文字编码转换模块;以及用于在对应关系映射表获取模块中找到的映射表中提取与文字编码对应的字形索引,生成文字编码与字形索引映射表的映射表生成模块;The font processing device includes a corresponding relationship mapping table acquisition module for parsing font data and finding one or more mapping tables that can finally obtain the corresponding relationship between the character code and the font index; it is used to determine whether the character code used by the font is Unicode encoding, if it is not Unicode encoding, it is converted into a text encoding conversion module of Unicode encoding; and it is used to extract the glyph index corresponding to the text encoding from the mapping table found in the correspondence mapping table acquisition module, and generate the text encoding and A mapping table generating module of the font index mapping table; 所述文字编码与字形索引映射表记录文字的Unicode编码与字形索引之间的对应关系。The character encoding and glyph index mapping table records the correspondence between the Unicode encoding of the character and the glyph index. 7.如权利要求6所述的一种字体数据的处理系统,其特征在于:所述文字编码为非标准编码时,文字编码获取及识别模块还用于将非标准编码转换成标准编码。7. The font data processing system according to claim 6, wherein when the character code is a non-standard code, the character code acquisition and identification module is also used to convert the non-standard code into a standard code. 8.如权利要求6或7所述的一种字体数据的处理系统,其特征在于:所述字体处理装置还包括用于将字体数据中的冗余数据删除的冗余数据删除模块。8. A system for processing font data according to claim 6 or 7, wherein the font processing device further comprises a redundant data deletion module for deleting redundant data in the font data.
CN 200910090817 2009-08-10 2009-08-10 Method and system for processing script data Expired - Fee Related CN101996160B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200910090817 CN101996160B (en) 2009-08-10 2009-08-10 Method and system for processing script data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910090817 CN101996160B (en) 2009-08-10 2009-08-10 Method and system for processing script data

Publications (2)

Publication Number Publication Date
CN101996160A CN101996160A (en) 2011-03-30
CN101996160B true CN101996160B (en) 2013-01-02

Family

ID=43786330

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910090817 Expired - Fee Related CN101996160B (en) 2009-08-10 2009-08-10 Method and system for processing script data

Country Status (1)

Country Link
CN (1) CN101996160B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102662926B (en) * 2012-03-29 2015-09-30 常州华文文字技术有限公司 The storage and inquire method of character library
CN104424163B (en) * 2013-08-29 2017-09-22 北大方正集团有限公司 Literal processing method and system
CN105224509A (en) * 2014-05-30 2016-01-06 北大方正集团有限公司 A kind of method and device generating font format
CN104965834B (en) * 2014-06-25 2018-09-28 腾讯科技(深圳)有限公司 Font display methods and device
CN105488471B (en) * 2015-11-30 2019-03-29 北大方正集团有限公司 A kind of font recognition methods and device
CN106502968A (en) * 2016-10-12 2017-03-15 北京奇虎科技有限公司 The method and device of data processing
CN107908377A (en) * 2017-11-16 2018-04-13 理光图像技术(上海)有限公司 The analytic method of font coding information and its device and application in print language
CN110765072A (en) * 2018-07-27 2020-02-07 珠海金山办公软件有限公司 Method and device for saving file
CN111353114A (en) * 2018-12-20 2020-06-30 深圳市茁壮网络股份有限公司 Font display processing method and device
CN109887197A (en) * 2018-12-21 2019-06-14 福建新大陆支付技术有限公司 A kind of vector fonts Method of printing and POS machine based on POS machine
CN110222617A (en) * 2019-05-29 2019-09-10 四川译讯信息科技有限公司 A kind of pdf document restorative procedure and system
CN110196964B (en) * 2019-06-05 2020-12-29 北大方正集团有限公司 Font format conversion method and device
CN110825925B (en) * 2019-11-04 2023-05-26 沈华伟 A Fast Way to Do Multiple String Matching
CN110955747B (en) * 2019-11-29 2023-03-14 北大方正集团有限公司 Method and device for modifying complex text font
CN113573125B (en) * 2020-04-29 2023-07-25 深圳Tcl新技术有限公司 Subtitle rendering method and system based on SVG word stock
CN119294353A (en) * 2024-10-10 2025-01-10 深圳昆仑技创科技开发有限责任公司 A font non-caching display method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1196534A (en) * 1997-04-15 1998-10-21 英业达股份有限公司 Text Display Method in Windows Environment
US7228501B2 (en) * 2002-11-01 2007-06-05 Microsoft Corporation Method for selecting a font

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1196534A (en) * 1997-04-15 1998-10-21 英业达股份有限公司 Text Display Method in Windows Environment
US7228501B2 (en) * 2002-11-01 2007-06-05 Microsoft Corporation Method for selecting a font

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王瑜等.Windows中TrueType字形数据的存取技术.《小型微型计算机系统》.1997,第18卷(第11期),75-81. *

Also Published As

Publication number Publication date
CN101996160A (en) 2011-03-30

Similar Documents

Publication Publication Date Title
CN101996160B (en) Method and system for processing script data
US8433997B1 (en) Efficient portable document
US7636885B2 (en) Method of determining Unicode values corresponding to the text in digital documents
CN101441713B (en) Optical character recognition method and apparatus of PDF document
US7623710B2 (en) Document content and structure conversion
CN109492199B (en) PDF file conversion method based on OCR pre-judgment
CN101782896B (en) PDF character extraction method combined with OCR technology
US9158742B2 (en) Automatically detecting layout of bidirectional (BIDI) text
CN101364216B (en) Method and device for displaying electronic book documentary on mobile terminal
CN104462068B (en) Character conversion system and character conversion method
CN102081594A (en) Equipment and method for extracting enclosing rectangles of characters from portable electronic documents
CN1952955A (en) Printer having a direct printing function and printing method thereof
JP2014013534A (en) Document processor, image processor, image processing method and document processing program
CN111797595A (en) A method and device for generating OFD layout page based on XML template
CN111176650A (en) Parser generation method, search method, server, and storage medium
CN101187939A (en) Method and device for embedding font files
CN116521621A (en) Data processing method and device, electronic equipment and storage medium
CN102063416B (en) Method and system for embedding double-byte fonts into PDF file
CN105335356A (en) Semantic recognition-oriented paper translation method and translation pen device
CN102063415B (en) Method and system for embedding single-byte fonts in PDF (Portable Document Format) file
US8930808B2 (en) Processing rich text data for storing as legacy data records in a data storage system
Hasnat et al. An open source tesseract based optical character recognizer for bangla script
CN113723063B (en) Method for converting RTF (real time transport format) into HTML (hypertext markup language) and realizing effect in PDF (portable document format) file
CN102467490B (en) Method and device for processing font data
CN103853849A (en) Method for establishing and drawing high-compression reflowable file

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220908

Address after: 3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031

Patentee after: New founder holdings development Co.,Ltd.

Patentee after: PEKING University FOUNDER R & D CENTER

Patentee after: Peking University

Address before: 100871, fangzheng building, 298 Fu Cheng Road, Beijing, Haidian District

Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd.

Patentee before: PEKING University FOUNDER R & D CENTER

Patentee before: Peking University

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130102

CF01 Termination of patent right due to non-payment of annual fee