JP6790712B2

JP6790712B2 - Shape extraction program, shape extraction method and shape extraction device

Info

Publication number: JP6790712B2
Application number: JP2016205442A
Authority: JP
Inventors: 隆寛福田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2016-10-19
Filing date: 2016-10-19
Publication date: 2020-11-25
Anticipated expiration: 2036-10-19
Also published as: JP2018067153A

Description

本発明は、形状抽出プログラム、形状抽出方法及び形状抽出装置に関する。 The present invention relates to a shape extraction program, a shape extraction method, and a shape extraction device.

新聞等の編集作業においては、予め編集する情報の内容等に応じて各ページの所望の位置に、文字や画像を配置するためのフレームやボックスと呼ばれる所望のサイズの枠を設定した後、枠内に文字や画像を流し込むことにより、作業が行われることが多い。また、最近では、フレームやボックスが予め所望の位置に配置されたひな形を用意しておき、選択して作業に利用することで、省力化を図ることも行われている。 In the editing work of newspapers, etc., after setting a frame of a desired size called a frame or box for arranging characters and images at a desired position on each page according to the content of information to be edited in advance, the frame Work is often done by pouring characters and images inside. Further, recently, labor saving has been achieved by preparing a template in which a frame or a box is arranged at a desired position in advance, selecting the template, and using it for work.

なお、特許文献１等には、文書画像処理を自動的に行う技術について開示されている。 In addition, Patent Document 1 and the like disclose a technique for automatically performing document image processing.

特開平１−１８３７８４号公報Japanese Unexamined Patent Publication No. 1-183784

しかしながら、実際に製作した新聞等を人手でひな形化する場合、製作した新聞等から記事が配置されている領域を特定し、特定した領域の形状を抽出する必要があり、手間と労力がかかる。 However, when a newspaper or the like actually produced is manually modeled, it is necessary to identify the area where the article is arranged from the produced newspaper or the like and extract the shape of the specified area, which takes time and effort. ..

１つの側面では、本発明は、文書情報から文字が配置された領域の形状を簡易に抽出することが可能な形状抽出プログラム、形状抽出方法及び形状抽出装置を提供することを目的とする。 In one aspect, it is an object of the present invention to provide a shape extraction program, a shape extraction method, and a shape extraction device capable of easily extracting the shape of a region in which characters are arranged from document information.

一つの態様では、形状抽出プログラムは、文字を含む文書情報から、区切り線、前記文字のフォント、フォントサイズ、前記文字の色の１又は複数の組み合わせに基づいて部分領域を特定し、特定した前記部分領域から文字を除いた形状を抽出し、特定した前記部分領域に配置された文字を形態素解析して抽出した名詞を属性情報とし、該属性情報を抽出した前記形状に対応づけて記憶部に記憶させる、処理をコンピュータに実行させるプログラムである。 In one embodiment, the shape extraction program identifies and identifies a partial area from document information including characters based on a separator line, the font of the characters, a font size, and one or more combinations of the colors of the characters. The shape obtained by removing the characters from the partial area is extracted , the characters arranged in the specified partial area are morphologically analyzed, and the extracted nomenclature is used as attribute information, and the attribute information is associated with the extracted shape and stored in the storage unit. It is a program that causes a computer to perform processing that is stored .

文書情報から文字が配置された領域の形状を簡易に抽出することができる。 The shape of the area where characters are arranged can be easily extracted from the document information.

一実施形態に係る新聞製作システムの構成を概略的に示す図である。It is a figure which shows roughly the structure of the newspaper production system which concerns on one Embodiment. 図２（ａ）は、情報処理装置のハードウェア構成を示す図であり、図２（ｂ）は、ユーザ端末のハードウェア構成を示す図である。FIG. 2A is a diagram showing a hardware configuration of an information processing device, and FIG. 2B is a diagram showing a hardware configuration of a user terminal. 情報処理装置及びユーザ端末の機能ブロック図である。It is a functional block diagram of an information processing apparatus and a user terminal. 蓄積対象テーブルの一例を示す図である。It is a figure which shows an example of the storage target table. ひな形ＤＢのデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of a template DB. ひな形生成処理の一例を示すフローチャートである。It is a flowchart which shows an example of a template generation process. 図６のステップＳ１２の具体的な処理を示すフローチャートである。It is a flowchart which shows the specific process of step S12 of FIG. 図７のステップＳ１２２の処理を説明するための図である。It is a figure for demonstrating the process of step S122 of FIG. ユーザが製作した新聞紙面の一例を示す図である。It is a figure which shows an example of the newspaper page produced by a user. 新聞紙面のレイアウトデータから得られる、配置領域の位置や大きさと種別を示す図である。It is a figure which shows the position, size and type of the arrangement area obtained from the layout data of a newspaper page. モジュール分割により新聞紙面の配置領域が分割された状態を太線枠にて示す図である。It is a figure which shows the state which the arrangement area of a newspaper surface is divided by a module division by a thick line frame. 書き割り化後の新聞レイアウトを示す図である。It is a figure which shows the newspaper layout after the writing division. 図１３（ａ）は、組み上がり状態（新聞発行時の状態）を示す表示画の一例であり、図１３（ｂ）は、書き割り状態（配置領域のみの状態）を示す表示画での一例である。FIG. 13A is an example of a display image showing an assembled state (state at the time of newspaper publication), and FIG. 13B is an example of a display image showing a writing division state (state of only the arrangement area). Is. ひな形提示処理の一例を示すフローチャートである。It is a flowchart which shows an example of the template presentation processing. 図１５（ａ）、図１５（ｂ）は、図１４のステップＳ５０において紙面の情報を取得するために用いる画面の例を示す図である。15 (a) and 15 (b) are diagrams showing an example of a screen used for acquiring information on the paper surface in step S50 of FIG. ひな形を提示する一覧画面の例を示す図（その１）である。It is a figure (the 1) which shows the example of the list screen which presents a model. ひな形を提示する一覧画面の例を示す図（その２）である。It is a figure (the 2) which shows the example of the list screen which presents a model. ひな形を表示部上に展開した状態を示す図である。It is a figure which shows the state which the model is expanded on the display part.

以下、新聞製作システムの一実施形態について、図１〜図１８に基づいて詳細に説明する。 Hereinafter, one embodiment of the newspaper production system will be described in detail with reference to FIGS. 1 to 18.

図１には、一実施形態に係る新聞製作システム１００の構成が概略的に示されている。図１に示すように、新聞製作システム１００は、形状抽出装置及び編集支援装置としての情報処理装置１０と、ユーザ端末６０と、レイアウトデータ格納装置７０と、を備える。新聞製作システム１００に含まれる各装置は、インターネットなどのネットワーク８０に接続されている。 FIG. 1 schematically shows the configuration of the newspaper production system 100 according to the embodiment. As shown in FIG. 1, the newspaper production system 100 includes an information processing device 10 as a shape extraction device and an editing support device, a user terminal 60, and a layout data storage device 70. Each device included in the newspaper production system 100 is connected to a network 80 such as the Internet.

情報処理装置１０は、サーバ等を含み、ユーザが製作した文書情報（ここでは、新聞紙面のデータ（レイアウトデータ））から新聞製作に利用可能なひな形を自動的に生成するとともに、ユーザ端末６０に対して適切なひな形を提供するための装置である。 The information processing device 10 includes a server and the like, and automatically generates a template that can be used for newspaper production from document information produced by the user (here, data on newspaper (layout data)), and the user terminal 60. It is a device for providing an appropriate model for.

図２（ａ）には、情報処理装置１０のハードウェア構成が示されている。図２（ａ）に示すように、情報処理装置１０は、ＣＰＵ（Central Processing Unit）９０、ＲＯＭ（Read Only Memory）９２、ＲＡＭ（Random Access Memory）９４、記憶部（ここではＨＤＤ（Hard Disk Drive））９６、ネットワークインタフェース９７、及び可搬型記憶媒体用ドライブ９９等を備えている。これら情報処理装置１０の構成各部は、バス９８に接続されている。情報処理装置１０では、ＲＯＭ９２あるいはＨＤＤ９６に格納されているプログラム（形状抽出プログラムや編集支援プログラムを含む）、或いは可搬型記憶媒体用ドライブ９９が可搬型記憶媒体９１から読み取ったプログラム（形状抽出プログラムや編集支援プログラムを含む）をＣＰＵ９０が実行することにより、図３に示す、各部の機能が実現される。なお、図３には、情報処理装置１０のＨＤＤ９６等に格納されているデータベースやテーブルについても図示されている。 FIG. 2A shows the hardware configuration of the information processing device 10. As shown in FIG. 2A, the information processing device 10 includes a CPU (Central Processing Unit) 90, a ROM (Read Only Memory) 92, a RAM (Random Access Memory) 94, and a storage unit (here, an HDD (Hard Disk Drive)). )) 96, a network interface 97, a drive 99 for a portable storage medium, and the like. Each component of the information processing device 10 is connected to the bus 98. In the information processing apparatus 10, a program stored in the ROM 92 or the HDD 96 (including a shape extraction program and an editing support program), or a program read from the portable storage medium 91 by the portable storage medium drive 99 (shape extraction program or By executing the CPU 90 (including the editing support program), the functions of each part shown in FIG. 3 are realized. Note that FIG. 3 also shows a database and a table stored in the HDD 96 or the like of the information processing apparatus 10.

図３には、情報処理装置１０及びユーザ端末６０の機能ブロック図が示されている。図３に示すように、情報処理装置１０は、ひな形生成部２０と、ひな形提示部２２として機能する。 FIG. 3 shows a functional block diagram of the information processing device 10 and the user terminal 60. As shown in FIG. 3, the information processing device 10 functions as a model generation unit 20 and a model presentation unit 22.

ひな形生成部２０は、蓄積対象テーブル３０を参照して、ユーザが製作した新聞紙面が蓄積対象か否かを判定し、蓄積対象である場合には、ユーザが製作した新聞紙面に基づいて、次回以降の新聞紙面製作に利用可能なひな形を自動生成する。ひな形生成部２０が自動生成したひな形（レイアウトデータ）は、ひな形ＤＢ３２に格納される。ここで、ひな形生成部２０は、ひな形を自動生成する際に、ユーザが製作した新聞紙面を解析する。具体的には、ひな形生成部２０は、ユーザが製作した新聞紙面の記事や見出し、画像が配置された部分領域（配置領域）の属性（配置領域が、記事、見出し、画像のいずれが配置された領域であるか）を判定する。また、各配置領域のカテゴリ（記事や見出し、画像の内容）を判定する。 The template generation unit 20 refers to the accumulation target table 30 to determine whether or not the newspaper surface produced by the user is the accumulation target, and if it is the accumulation target, the template generation unit 20 is based on the newspaper surface produced by the user. Automatically generate a template that can be used for newspaper production from the next time onward. The stationery (layout data) automatically generated by the stationery generation unit 20 is stored in the stationery DB 32. Here, the model generator 20 analyzes the newspaper surface produced by the user when automatically generating the model. Specifically, the template generation unit 20 arranges an attribute (arrangement area is an article, a headline, or an image) of a partial area (arrangement area) in which an article, a headline, or an image is arranged on a newspaper produced by a user. Whether it is the area that has been created) is determined. In addition, the category (article, headline, image content) of each arrangement area is determined.

また、ひな形生成部２０は、ユーザが製作した新聞紙面の解析結果を用いて、新聞紙面をカテゴリごとに分割してモジュール化し、各モジュールにおいて配置領域を抽出する「モジュール分割」を実行する。なお、モジュールとは、同一のテーマ（カテゴリ）に属する記事（文字）や写真等のまとまりを意味する。更に、ひな形生成部２０は、抽出した配置領域から文字や画像を削除して配置領域だけの状態にする「書き割り化」を実行し、書き割り化後の配置領域のレイアウトデータをひな形とする。 In addition, the template generation unit 20 divides the newspaper surface into modules by using the analysis result of the newspaper surface produced by the user, and executes "module division" to extract the arrangement area in each module. The module means a group of articles (characters), photographs, etc. that belong to the same theme (category). Further, the template generation unit 20 executes "writing division" in which characters and images are deleted from the extracted arrangement area to make only the arrangement area, and the layout data of the arrangement area after the writing division is used as a template. And.

ひな形提示部２２は、ユーザが新聞紙面の製作を開始する際に、ひな形ＤＢ３２を参照して、ユーザが製作しようとしている新聞紙面（編集対象の文書ページ）に適したひな形を選定し、選定したひな形の一覧画面をユーザ端末６０に送信する。また、ひな形提示部２２は、一覧画面でユーザにより選択されたひな形をユーザ端末６０に送信する。 When the user starts producing a newspaper page, the template presentation unit 22 refers to the template DB 32 and selects a template suitable for the newspaper page (document page to be edited) that the user intends to produce. , The list screen of the selected template is transmitted to the user terminal 60. In addition, the template presentation unit 22 transmits the template selected by the user on the list screen to the user terminal 60.

図４には、蓄積対象テーブル３０のデータ構造の一例が示されている。図４に示すように、蓄積対象テーブル３０は、「面名」と、「蓄積対象／対象外」のフィールドを有する。「面名」のフィールドには、面の名称（一面、二面、社会面、くらし面など）が格納される。「蓄積対象／対象外」のフィールドには、ひな形の情報を生成し、蓄積する対象であるか否か（対象／対象外）が格納される。したがって、ひな形生成部２０は、蓄積対象テーブル３０を参照することで、新聞紙面のうち、どの面のひな形を生成し、蓄積すればよいのかを判定することができる。 FIG. 4 shows an example of the data structure of the storage target table 30. As shown in FIG. 4, the storage target table 30 has fields of “face name” and “storage target / non-target”. In the "face name" field, the name of the face (one face, two faces, social face, living face, etc.) is stored. In the "accumulation target / non-target" field, template information is generated and whether or not it is a storage target (target / non-target) is stored. Therefore, the model generation unit 20 can determine which side of the newspaper surface the model should be generated and stored by referring to the storage target table 30.

図５には、ひな形ＤＢ３２の一例が示されている。ひな形ＤＢ３２は、ひな形のデータをひな形を生成する際に利用した新聞紙面の属性と対応付けて記憶するデータベースである。具体的には、ひな形ＤＢ３２は、図５に示すように「データ種別」、「作成日時」、「最終使用日時」、「使用回数」、「掲載年月日」、「掲載曜日」、「左右ページ」、「面名」、「版名」、「モジュール名」、「縦大きさ」、「横大きさ」、「記事部段数」、「レイアウト格納パス」、「表示画格納パス（書き割り）」、「表示画格納パス（組み上がり）」、「モジュール構成情報格納パス」、「記事数」、「記事行数」、「画像数」、「画像の縦横」の各フィールドを有する。 FIG. 5 shows an example of the model DB 32. The template DB 32 is a database that stores template data in association with the attributes of newspaper pages used when generating the template. Specifically, as shown in FIG. 5, the template DB 32 has "data type", "creation date", "last use date", "number of uses", "post date", "post date", and "post date". Left and right pages, "face name", "version name", "module name", "vertical size", "horizontal size", "number of article copies", "layout storage path", "display image storage path (writing) It has fields of "split)", "display image storage path (assembled)", "module configuration information storage path", "number of articles", "number of article lines", "number of images", and "vertical and horizontal images".

「データ種別」のフィールドには、「面」又は「モジュール」が格納される。「面」が格納されているデータ（行）は、新聞の面（１ページ）全体のレイアウト（ひな形）に関するデータを意味し、「モジュール」が格納されているデータ（行）は、面に含まれる各記事のレイアウト（ひな形）に関するデータを意味する。 A "face" or "module" is stored in the "data type" field. The data (row) in which the "face" is stored means the data related to the layout (template) of the entire newspaper face (1 page), and the data (row) in which the "module" is stored is in the face. It means data about the layout (template) of each contained article.

「作成日時」のフィールドには、ひな形が作成された日時の情報が格納され、「最終使用日時」のフィールドには、ひな形が最後に使用された日時の情報が格納される。「使用回数」のフィールドには、ひな形が使用された回数の情報が格納され、「掲載年月日」、「掲載曜日」のフィールドには、ひな形の作成に利用した新聞が発行された日付及び曜日が格納される。なお、「作成日時」や「最終使用日時」、「使用回数」のフィールドに格納されている情報は、あまり利用されないひな形のデータを削除するか否かを判断する判断基準として利用することができる。 The "Created date and time" field stores information on the date and time when the stationery was created, and the "Last used date and time" field stores information on the date and time when the stationery was last used. Information on the number of times the stationery was used was stored in the "Number of uses" field, and the newspaper used to create the stationery was published in the "Date of publication" and "Day of the week" fields. The date and day of the week are stored. The information stored in the "Created date", "Last used date", and "Number of uses" fields can be used as a criterion for determining whether to delete the template data that is rarely used. it can.

「左右ページ」のフィールドには、面及びフィールドが紙面の左側のページ及び右側のページのいずれに配置されたかの情報が格納され、「面名」のフィールドには、面の名称又はモジュールが配置された面の名称が格納される。「版名」のフィールドには、面及びモジュールが掲載された版の名称が格納され、「モジュール名」のフィールドには、モジュールの名称が格納される。なお、データ種別が「面」のデータ（行）においては、モジュール名は、面名と同一となる。 The "left and right page" field stores information on whether the face and the field are placed on the left page or the right page of the page, and the "face name" field contains the face name or module. The name of the face is stored. The name of the plate on which the face and the module are posted is stored in the "version name" field, and the name of the module is stored in the "module name" field. In the data (row) whose data type is "face", the module name is the same as the face name.

「縦大きさ」、「横大きさ」のフィールドには、面やモジュールの縦横の大きさが格納され、「記事部段数」のフィールドには、面において記事（広告を除く部分）が何段配置されているかの情報が格納される。記事部段数は、面が有する段数から広告の段数を差し引いた段数であるといえる。 The vertical and horizontal sizes of faces and modules are stored in the "vertical size" and "horizontal size" fields, and the number of articles (parts excluding advertisements) on the face is stored in the "article number of columns" field. Information on whether it is placed is stored. It can be said that the number of article copies is the number of stages obtained by subtracting the number of advertisement stages from the number of stages of the surface.

「レイアウト格納パス」のフィールドには、レイアウトデータ（ひな形情報）が保存されている保存先のパスが格納される。なお、レイアウトデータのファイルは、情報処理装置１０のストレージに格納される。なお、ストレージは、図２（ａ）のＨＤＤ９６等であるため、以下においては、「ストレージ９６」とも表記するものとする。例えば、レイアウトデータのファイルは、見出しや記事、画像、区切り線などが配置される配置領域のレイアウト（位置、大きさ、形状等）を管理するとともに、配置領域の属性（見出し、記事、画像、区切り線のいずれが配置されるか）を管理している。なお、本実施形態では、ストレージ９６にレイアウトデータを保存し、ひな形ＤＢ３２において保存先のパスを管理するため、ひな形ＤＢ３２及びストレージ９６においてレイアウトデータを読出し可能に記憶しているといえる。 In the "Layout storage path" field, the save destination path in which the layout data (template information) is saved is stored. The layout data file is stored in the storage of the information processing device 10. Since the storage is the HDD 96 or the like shown in FIG. 2A, it will also be referred to as “storage 96” in the following. For example, the layout data file manages the layout (position, size, shape, etc.) of the placement area where headlines, articles, images, dividing lines, etc. are placed, and also manages the attributes of the placement area (headlines, articles, images, etc.). Which of the dividing lines is placed) is managed. In this embodiment, the layout data is stored in the storage 96, and the path of the storage destination is managed in the model DB 32. Therefore, it can be said that the layout data is readable and stored in the model DB 32 and the storage 96.

「表示画格納パス（書き割り）」、「表示画格納パス（組み上がり）」のフィールドには、書き割り後の表示画（例えば、図１３（ｂ）参照）と、書き割り前の表示画（例えば、図１３（ａ）参照）を保存する保存先のパスが格納される。なお、表示画のファイルは、例えばjpeg形式のファイルであり、情報処理装置１０のストレージ９６に格納される。「モジュール構成情報格納パス」のフィールドには、モジュールの構成情報が保存されている保存先のパスが格納される。モジュールの構成情報は、モジュールに含まれるオブジェクト（見出し、記事、画像など）の識別番号のリストである。オブジェクトの識別番号を用いて、レイアウトデータのファイルを検索することで、オブジェクトの位置や大きさ、形状を取り出すことができる。なお、モジュールの構成情報のファイルについても、情報処理装置１０のストレージ９６に格納される。なお、本実施形態では、表示画やモジュール構成情報をストレージ９６に保存し、ひな形ＤＢ３２において保存先のパスを管理するため、ひな形ＤＢ３２及びストレージ９６において表示画やモジュール構成情報を読出し可能に記憶しているといえる。 In the fields of "Display image storage path (writing division)" and "Display image storage path (assembled)", the display image after writing division (see, for example, FIG. 13B) and the display image before writing division are displayed. (For example, see FIG. 13 (a)) is stored in the save destination path. The display image file is, for example, a jpeg format file and is stored in the storage 96 of the information processing device 10. In the "Module configuration information storage path" field, the save destination path in which the module configuration information is stored is stored. The module configuration information is a list of identification numbers of objects (headlines, articles, images, etc.) contained in the module. By searching the layout data file using the object identification number, the position, size, and shape of the object can be retrieved. The module configuration information file is also stored in the storage 96 of the information processing device 10. In the present embodiment, since the display image and the module configuration information are stored in the storage 96 and the save destination path is managed in the template DB 32, the display image and the module configuration information can be read in the template DB 32 and the storage 96. It can be said that I remember it.

「記事数」のフィールドには、モジュールに含まれる記事の数が格納され、「記事行数」のフィールドには、モジュールに含まれる記事の行数が格納される。また、「画像数」のフィールドには、モジュールに含まれる画像の数が格納され、「画像の縦横」のフィールドには、モジュールに含まれる画像が縦長であるか横長であるかが格納される。 The number of articles contained in the module is stored in the "number of articles" field, and the number of lines of articles contained in the module is stored in the "number of article lines" field. In addition, the number of images included in the module is stored in the "number of images" field, and whether the images included in the module are vertically long or horizontally long is stored in the "vertical and horizontal images" field. ..

図１に戻り、ユーザ端末６０は、新聞紙面を製作するユーザ（新聞社の社員等）が利用する端末である。ユーザ端末６０では、情報処理装置１０に対して、ユーザが製作しようとしている新聞紙面に関する情報を提供し、情報処理装置１０から提示されたひな形の一覧画面を表示する。また、ユーザ端末６０は、一覧画面においてユーザにより選択されたひな形の情報を情報処理装置１０に送信し、情報処理装置１０から送信されてくるひな形及び／又は情報処理装置１０にあるひな形を表示部１９３上に展開する。また、ユーザ端末６０は、ユーザが製作した新聞紙面のレイアウトデータをレイアウトデータ格納装置７０に記憶する。この場合、ユーザ端末６０は、レイアウトデータに関連付けて、ユーザが製作した新聞紙面やモジュールに関する情報（掲載年月日、掲載曜日、左右ページ、面名、版名、縦大きさ、横大きさ、記事部段数、記事数、記事行数、画像数、画像の縦横など）をレイアウトデータ格納装置７０に記憶する。 Returning to FIG. 1, the user terminal 60 is a terminal used by a user (employee of a newspaper company, etc.) who produces a newspaper page. The user terminal 60 provides the information processing device 10 with information about the newspaper page that the user intends to produce, and displays a list screen of the model presented by the information processing device 10. Further, the user terminal 60 transmits the information of the template selected by the user on the list screen to the information processing device 10, and the template and / or the template in the information processing device 10 transmitted from the information processing device 10. Is expanded on the display unit 193. Further, the user terminal 60 stores the layout data of the newspaper surface produced by the user in the layout data storage device 70. In this case, the user terminal 60 associates the layout data with information about the newspaper page and the module produced by the user (posting date, posting day of the week, left and right pages, face name, version name, vertical size, horizontal size, etc. The number of article sections, the number of articles, the number of article lines, the number of images, the vertical and horizontal directions of images, etc.) are stored in the layout data storage device 70.

図２（ｂ）には、ユーザ端末６０のハードウェア構成が示されている。図２（ｂ）に示すように、ユーザ端末６０は、ＣＰＵ１９０、ＲＯＭ１９２、ＲＡＭ１９４、記憶部（ＨＤＤ）１９６、ネットワークインタフェース１９７、表示部１９３、入力部１９５、及び可搬型記憶媒体用ドライブ１９９等を備えている。これらユーザ端末６０の構成各部は、バス１９８に接続されている。表示部１９３は液晶ディスプレイ等を含み、入力部１９５は、キーボードやマウスタッチパネル等を含む。ユーザ端末６０では、ＲＯＭ１９２あるいはＨＤＤ１９６に格納されているプログラム、或いは可搬型記憶媒体用ドライブ１９９が可搬型記憶媒体１９１から読み取ったプログラムをＣＰＵ１９０が実行することにより、図３に示す、紙面製作部６２、紙面送信部６４の機能が実現される。紙面製作部６２は、ユーザの操作に従って、新聞紙面を製作する処理を実行する。例えば、紙面製作部６２は、ユーザが配置した記事や見出し、区切り線に基づいて、新聞紙面を製作する。また、紙面製作部６２は、例えば、ユーザにより選択されたひな形に対するユーザの操作に応じて紙面を製作する。紙面送信部６４は、紙面製作部６２が製作した紙面をレイアウトデータ格納装置７０に格納する。 FIG. 2B shows the hardware configuration of the user terminal 60. As shown in FIG. 2B, the user terminal 60 includes a CPU 190, a ROM 192, a RAM 194, a storage unit (HDD) 196, a network interface 197, a display unit 193, an input unit 195, a portable storage medium drive 199, and the like. I have. Each component of the user terminal 60 is connected to the bus 198. The display unit 193 includes a liquid crystal display and the like, and the input unit 195 includes a keyboard, a mouse touch panel and the like. In the user terminal 60, when the CPU 190 executes a program stored in the ROM 192 or the HDD 196 or a program read from the portable storage medium 191 by the portable storage medium drive 199, the paper production unit 62 shown in FIG. , The function of the paper transmission unit 64 is realized. The paper surface production unit 62 executes a process of producing a newspaper surface according to a user's operation. For example, the paper production unit 62 produces a newspaper surface based on articles, headlines, and dividing lines arranged by the user. In addition, the paper surface production unit 62 produces a paper surface according to the user's operation on the template selected by the user, for example. The paper surface transmission unit 64 stores the paper surface produced by the paper surface production unit 62 in the layout data storage device 70.

レイアウトデータ格納装置７０は、データサーバ等であり、ユーザ端末６０において製作された新聞紙面のレイアウトデータや、紙面及びモジュールの情報を管理する。 The layout data storage device 70 is a data server or the like, and manages layout data of newspaper pages produced by the user terminal 60, and information on the pages and modules.

（情報処理装置１０のひな形生成処理）
次に、情報処理装置１０によるひな形生成処理について、図６のフローチャートに沿って詳細に説明する。なお、本処理が行われるタイミングは、ユーザ端末６０において新規の新聞紙面が製作され、新規新聞紙面のデータ（レイアウトデータ）がレイアウトデータ格納装置７０に格納されたタイミングであるものとする。ただし、これに限らず、例えば、新規の新聞紙面が所定数だけ製作された場合や、所定期間経過した場合（例えば１日１回）など、所定の条件を満たしたタイミングで図６の処理を実行することとしてもよい。また、ユーザや情報処理装置１０の管理者等が手動で図６の処理の実行開始を指示してもよい。 (Model generation process of information processing device 10)
Next, the model generation process by the information processing apparatus 10 will be described in detail with reference to the flowchart of FIG. It is assumed that the timing at which this process is performed is the timing at which a new newspaper page is produced on the user terminal 60 and the data (layout data) on the new newspaper page is stored in the layout data storage device 70. However, the present invention is not limited to this, and the processing of FIG. 6 is performed at the timing when a predetermined condition is satisfied, for example, when a predetermined number of new newspaper pages are produced or when a predetermined period has passed (for example, once a day). You may do it. Further, the user, the administrator of the information processing apparatus 10, or the like may manually instruct the start of execution of the process of FIG.

図６の処理では、まず、ステップＳ１０において、情報処理装置１０のひな形生成部２０は、新たにレイアウトデータ格納装置７０に格納された新聞紙面のレイアウトデータを取得する。 In the process of FIG. 6, first, in step S10, the model generation unit 20 of the information processing device 10 acquires the layout data of the newspaper page newly stored in the layout data storage device 70.

次いで、ステップＳ１１では、ひな形生成部２０が、ユーザが製作した新聞紙面の中から蓄積対象面を決定する。ひな形生成部２０は、蓄積対象テーブル３０を参照して、１日分の新聞紙面から、ひな形を生成する面を決定する。図４の蓄積対象テーブル３０からは、「生活」、「くらし」、「文化」の面がひな形を生成する面として決定される。なお、以下においては、「文化」の面のひな形生成を例にとり説明する。 Next, in step S11, the template generation unit 20 determines the storage target surface from the newspaper surface produced by the user. The model generation unit 20 determines the surface on which the model is to be generated from the newspaper surface for one day with reference to the accumulation target table 30. From the accumulation target table 30 of FIG. 4, the aspects of "life", "living", and "culture" are determined as the aspects that generate the template. In the following, the model generation of the "culture" aspect will be described as an example.

次いで、ステップＳ１２では、ひな形生成部２０が、ユーザが製作した新聞紙面の解析処理を実行する。ステップＳ１２の解析処理においては、具体的には、ひな形生成部２０は、図７のフローチャートに沿った処理を実行する。 Next, in step S12, the template generation unit 20 executes an analysis process of the newspaper surface produced by the user. In the analysis process of step S12, specifically, the template generation unit 20 executes the process according to the flowchart of FIG. 7.

ひな形生成部２０は、図７の処理において、配置領域の特定処理（Ｓ１２０）と、配置領域の種別判定処理（Ｓ１２２）と、配置領域のカテゴリ判定処理（Ｓ１２４）とを実行する。 In the process of FIG. 7, the template generation unit 20 executes the arrangement area identification process (S120), the arrangement area type determination process (S122), and the arrangement area category determination process (S124).

配置領域の特定（Ｓ１２０）においては、ひな形生成部２０は、区切り線、文字のフォント、フォントサイズ、文字の色の少なくとも１つに基づいて、配置領域を抽出する。例えば、ひな形生成部２０は、区切り線を用いて、新聞紙面の領域を区画するとともに、区画した範囲内で、文字のフォント、フォントサイズや色に基づいて、更に領域を区画することで、配置領域を抽出する。また、画像が配置されている領域についても、配置領域として抽出する。 In specifying the arrangement area (S120), the template generation unit 20 extracts the arrangement area based on at least one of the dividing line, the font of the character, the font size, and the color of the character. For example, the template generation unit 20 divides the area of the newspaper surface by using the dividing line, and further divides the area based on the font, font size, and color of the characters within the divided range. Extract the placement area. In addition, the area where the image is arranged is also extracted as the arrangement area.

次いで、配置領域の種別判定処理（Ｓ１２２）においては、まず、ステップＳ１２２１において、ひな形生成部２０が、記事領域判定を実行する。具体的には、ひな形生成部２０は、割り付いた文字のフォントが記事本文用のフォントである配置領域を記事領域として認識する。 Next, in the arrangement area type determination process (S122), first, in step S1221, the template generation unit 20 executes the article area determination. Specifically, the template generation unit 20 recognizes the arrangement area in which the font of the assigned characters is the font for the article body as the article area.

次いで、ステップＳ１２２２では、ひな形生成部２０は、見出し領域判定処理を実行する。具体的には、ひな形生成部２０は、割り付いた文字のフォントが見出し用のフォントである配置領域、記事フォントよりもサイズが大きい配置領域、フォントの色が見出し用の色である配置領域、例えば白抜き文字など装飾されている文字が割りついた配置領域、などを見出し領域と認識する。 Next, in step S1222, the template generation unit 20 executes the heading area determination process. Specifically, the template generation unit 20 has an arrangement area in which the font of the assigned characters is a headline font, an arrangement area in which the size is larger than the article font, and an arrangement area in which the font color is the headline color. For example, an arrangement area to which decorated characters such as outline characters are assigned is recognized as a heading area.

次いで、ステップＳ１２２３では、ひな形生成部２０は、画像領域判定処理を実行する。具体的には、ひな形生成部２０は、文字が割り付いておらず、画像のみが割り付いている配置領域または画像と写真説明文の領域とを合わせた領域を画像領域と認識する。 Next, in step S1223, the template generation unit 20 executes the image area determination process. Specifically, the template generation unit 20 recognizes an arrangement area in which characters are not assigned and only an image is assigned, or an area in which an image and a photo description area are combined as an image area.

以上のようにして配置領域の種別判定処理（Ｓ１２２）が行われた後は、ひな形生成部２０は、ステップＳ１２４において、配置領域のカテゴリ判定を実行する。具体的には、ひな形生成部２０は、次のようにして、配置領域のカテゴリ判定を行う。 After the type determination process (S122) of the arrangement area is performed as described above, the template generation unit 20 executes the category determination of the arrangement area in step S124. Specifically, the template generation unit 20 determines the category of the arrangement area as follows.

（１）まず、ひな形生成部２０は、見出し領域の文字（見出し文）を形態素解析し、名詞を抽出する。例えば、図８に示すように、見出し文が「日本総人口初の減少」であったとする。この場合、ひな形生成部２０は、形態素解析により、名詞「日本」「人口」「初」「減少」を抽出する。
（２）次いで、ひな形生成部２０は、記事領域の文字（記事本文）の中から、（１）で抽出された名詞「日本」「人口」「初」「減少」を検索する。
（３）次いで、ひな形生成部２０は、見出し文と記事本文の文字が所定の類似関係を満たす場合に、同一のカテゴリとする。具体的には、ひな形生成部２０は、一致する名詞が最も多い記事本文と見出しを関連付け、同一のカテゴリ名「日本／人口／初／減少」を付与する。 (1) First, the template generation unit 20 performs morphological analysis of characters (heading sentences) in the heading area and extracts nouns. For example, as shown in Fig. 8, the headline is "the first decrease in the total population of Japan". In this case, the template generation unit 20 extracts the nouns "Japan", "population", "first", and "decrease" by morphological analysis.
(2) Next, the template generation unit 20 searches for the nouns “Japan”, “population”, “first”, and “decrease” extracted in (1) from the characters (article text) in the article area.
(3) Next, the template generation unit 20 sets the same category when the characters in the headline and the article body satisfy a predetermined similarity relationship. Specifically, the template generation unit 20 associates the headline with the article text having the most matching nouns, and assigns the same category name "Japan / population / first / decrease".

以上の処理により、図７の処理（ステップＳ１２の処理）が終了する。この図７の処理により、各配置領域の種別が判定できるとともに、配置領域のカテゴリが判定できるので、以降においては、カテゴリ名が共通する配置領域を１つのまとまり（モジュール）として扱うことが可能になる。この場合、共通するカテゴリ名がモジュール名として扱われる。なお、ひな形生成部２０は、ステップＳ１２の判定結果を、配置領域のレイアウトデータとして管理する。なお、配置領域のレイアウトデータには、配置領域の位置や大きさも含まれている。 By the above processing, the processing of FIG. 7 (processing of step S12) is completed. By the process of FIG. 7, the type of each arrangement area can be determined and the category of the arrangement area can be determined. Therefore, thereafter, the arrangement areas having the same category name can be treated as one group (module). Become. In this case, the common category name is treated as the module name. The template generation unit 20 manages the determination result in step S12 as layout data of the arrangement area. The layout data of the arrangement area also includes the position and size of the arrangement area.

図６に戻り、ステップＳ１３では、ひな形生成部２０が、モジュール分割を実行する。例えば、ユーザ端末６０において、新聞紙面の「文化」の面が、図９に示すように製作されたものとする。この場合、ひな形生成部２０は、文化の面をカテゴリ（＝記事、話題）ごとに分割する。なお、本実施形態においては、新聞紙面のレイアウトデータでは、前述のように配置領域の位置や大きさと種別（見出し、記事、画像、区切り線の別）とが管理されている。すなわち、レイアウトデータからは、図１０に示すように、配置領域の位置や大きさと種別とが得られる。また、レイアウトデータにおいては、配置領域内に配置された記事や見出し、画像のカテゴリ名（何に関する記事であるかを示す名称）が管理されている。したがって、ひな形生成部２０は、カテゴリ名ごとのまとまりで、配置領域を分割する。図１１には、モジュール分割により新聞紙面の配置領域が分割された状態が太線枠にて示されている。 Returning to FIG. 6, in step S13, the template generation unit 20 executes the module division. For example, in the user terminal 60, it is assumed that the "culture" aspect of the newspaper is produced as shown in FIG. In this case, the template generation unit 20 divides the cultural aspect into categories (= articles, topics). In the present embodiment, in the layout data of the newspaper page, the position, size, and type (separate of headline, article, image, and dividing line) of the arrangement area are managed as described above. That is, as shown in FIG. 10, the position, size, and type of the arrangement area can be obtained from the layout data. Further, in the layout data, articles, headlines, and image category names (names indicating what the articles are related to) arranged in the arrangement area are managed. Therefore, the template generation unit 20 divides the arrangement area by a group for each category name. In FIG. 11, a state in which the arrangement area of the newspaper surface is divided by the module division is shown by a thick line frame.

次いで、ステップＳ１４では、ひな形生成部２０が、書き割り化を実行する。具体的には、ひな形生成部２０は、新聞紙面のレイアウトデータを編集し、文字や画像を削除して配置領域だけの状態にする。図１２には、書き割り化後の新聞レイアウトが示されている。なお、書き割り化後の新聞レイアウトは、新聞紙面から抽出された形状データであるといえる。 Next, in step S14, the template generation unit 20 executes the writing division. Specifically, the template generation unit 20 edits the layout data on the newspaper surface and deletes characters and images to make only the arrangement area. FIG. 12 shows the newspaper layout after the writing. It can be said that the newspaper layout after the writing is divided is the shape data extracted from the newspaper surface.

次いで、ステップＳ１６では、ひな形生成部２０が、レイアウトデータの書き出しを行う。すなわち、ひな形生成部２０は、書き割り化した新聞紙面のレイアウトデータを書き出し、ストレージ９６に格納するとともに、格納先のパスを取得する。この場合、ひな形生成部２０は、面全体のレイアウトデータを書き出すとともに、面の一部であるモジュールごとのレイアウトデータ(図１１の太線枠ごとのレイアウトデータ)についても書き出す。 Next, in step S16, the template generation unit 20 writes out the layout data. That is, the template generation unit 20 writes out the layout data of the newspaper page that has been divided into writing, stores it in the storage 96, and acquires the path of the storage destination. In this case, the template generation unit 20 writes out the layout data of the entire surface and also writes out the layout data for each module (layout data for each thick line frame in FIG. 11) which is a part of the surface.

次いで、ステップＳ１８では、ひな形生成部２０が、管理情報の収集を行う。この場合、ひな形生成部２０は、レイアウトデータ格納装置７０に格納されているユーザが製作した新聞紙面の情報を参照し、蓄積する必要のある管理情報を収集する。具体的には、ひな形生成部２０は、レイアウトデータ格納装置７０から、新聞紙面の管理情報として、図５のひな形ＤＢ３２の「掲載年月日」、「掲載曜日」、「左右ページ」、「面名」、「版名」、「モジュール名」、「縦大きさ」、「横大きさ」、「記事部段数」のフィールドに格納する情報などを収集する。 Next, in step S18, the template generation unit 20 collects management information. In this case, the template generation unit 20 refers to the information on the newspaper page produced by the user stored in the layout data storage device 70, and collects the management information that needs to be accumulated. Specifically, the template generation unit 20 uses the layout data storage device 70 to use the “posting date”, “posting day of the week”, “left and right pages” of the template DB 32 of FIG. Collect information to be stored in the fields of "face name", "version name", "module name", "vertical size", "horizontal size", and "number of article copies".

次いで、ステップＳ２０では、ひな形生成部２０が、表示画を作成する。この場合、図１３（ａ）、図１３（ｂ）に示すような、モジュールごとの表示画を書き出す。より具体的には、ひな形生成部２０は、図１３（ａ）に示すような組み上がり状態（新聞発行時の状態）を示す表示画と、図１３（ｂ）に示すような書き割り状態（配置領域のみの状態）を示す表示画の２種類の表示画を作成する。また、ひな形生成部２０は、図９の新聞紙面そのものの表示画についても作成する。なお、ひな形生成部２０は、作成した表示画をストレージ９６に格納し、格納先のパスを取得する。 Next, in step S20, the template generation unit 20 creates a display image. In this case, the display image for each module is written out as shown in FIGS. 13 (a) and 13 (b). More specifically, the template generation unit 20 has a display image showing an assembled state (state at the time of newspaper publication) as shown in FIG. 13 (a) and a writing division state as shown in FIG. 13 (b). Two types of display images are created, which are display images showing (state of only the arrangement area). In addition, the template generation unit 20 also creates a display image of the newspaper surface itself of FIG. The template generation unit 20 stores the created display image in the storage 96 and acquires the path of the storage destination.

次いで、ステップＳ２２では、ひな形生成部２０が、ひな形ＤＢ３２へデータを格納する。すなわち、ひな形生成部２０は、これまでの処理において作成、収集したデータをひな形ＤＢ３２へ格納する。なお、図９の新聞紙面そのものの表示画の格納パスについては、図５のひな形ＤＢ３２のデータ種別が「面」のデータ（行）の「表示画格納パス（組み上がり）」のフィールドに格納される。 Next, in step S22, the model generator 20 stores the data in the model DB 32. That is, the template generation unit 20 stores the data created and collected in the processing so far in the template DB 32. The storage path of the display image of the newspaper surface itself in FIG. 9 is stored in the "display image storage path (assembled)" field of the data (row) whose data type is "face" in the template DB32 of FIG. Will be done.

（情報処理装置１０によるひな形提示処理）
次に、図１４のフローチャートに沿って、情報処理装置１０により実行されるひな形提示処理について説明する。図１４の処理は、例えば、ユーザがユーザ端末６０において新規の新聞紙面の製作を開始するタイミングで実行される処理である。 (Model presentation processing by information processing device 10)
Next, the template presentation process executed by the information processing apparatus 10 will be described with reference to the flowchart of FIG. The process of FIG. 14 is, for example, a process executed at the timing when the user starts producing a new newspaper on the user terminal 60.

図１４の処理では、まず、ステップＳ５０において、ひな形提示部２２は、ユーザが製作しようとしている紙面の情報を取得する。この場合、紙面編集開始時に自動で紙面の情報を取得する場合と、ユーザが手入力した紙面の情報を取得する場合とがある。 In the process of FIG. 14, first, in step S50, the template presenting unit 22 acquires the information on the paper surface to be produced by the user. In this case, there are cases where the information on the paper surface is automatically acquired at the start of editing the paper surface, and there are cases where the information on the paper surface manually input by the user is acquired.

紙面編集開始時に自動で紙面の情報を取得する場合、ひな形提示部２２は、ユーザ端末６０から、製作しようとしている紙面の情報と作成済みで新聞紙面に配置しようとしているコンテンツ（記事）の属性を自動的に取得する。なお、ユーザ端末６０において紙面編集を開始する際に表示される画面は、図１５（ａ）に示すような画面である。ユーザは、紙面を製作するときに、図１５（ａ）の画面の入力欄７２において日付や、製作する面の情報などを入力、選択等するため、ひな形提示部２２は、図１５（ａ）の画面に入力された情報を取得する。また、ひな形提示部２２は、ユーザ端末６０において作成されたコンテンツ（記事）の情報（モジュール名や、モジュールの大きさ、画像数、画像の縦横の情報など）を取得する。なお、コンテンツ（記事）の情報は、文字、文字の構成情報、写真の構成情報を含んでいる。 When the information on the paper is automatically acquired at the start of editing the paper, the template presentation unit 22 uses the user terminal 60 to display the information on the paper to be produced and the attributes of the created content (article) to be placed on the newspaper. Is automatically acquired. The screen displayed when starting paper editing on the user terminal 60 is a screen as shown in FIG. 15A. Since the user inputs and selects the date, information on the surface to be produced, and the like in the input field 72 of the screen of FIG. 15 (a) when producing the paper, the template presentation unit 22 is shown in FIG. 15 (a). ) Get the information entered on the screen. In addition, the template presentation unit 22 acquires information (module name, module size, number of images, vertical and horizontal information of images, etc.) of the content (article) created in the user terminal 60. The content (article) information includes characters, character composition information, and photo composition information.

一方、ユーザが手動で紙面の情報を入力する画面は、図１５（ｂ）に示すような画面である。ユーザは、ひな形を呼び出す際に、図１５（ｂ）の画面において、掲載年月日や掲載曜日、記事数、画像数、モジュール名、面名、版名、左右ページ、記事部段数、記事行数、画像縦横などの情報を入力し、確定ボタンを押す。ユーザによって確定ボタンが押されると、ユーザ端末６０からひな形提示部２２に図１５（ｂ）の画面において入力された情報が送信される。 On the other hand, the screen on which the user manually inputs the information on the paper is the screen as shown in FIG. 15B. When the user calls the template, on the screen shown in FIG. 15B, the publication date, the publication day of the week, the number of articles, the number of images, the module name, the face name, the version name, the left and right pages, the number of article sections, and the article Enter information such as the number of lines and image height and width, and press the confirm button. When the confirmation button is pressed by the user, the information input on the screen of FIG. 15B is transmitted from the user terminal 60 to the template presentation unit 22.

図１４に戻り、次のステップＳ５２では、ひな形提示部２２が、紙面の属性（掲載曜日、左右ページ、面名、版名など）でひな形を絞り込む。この場合、ひな形提示部２２は、ひな形ＤＢ３２を参照して、ユーザが製作しようとしている紙面の属性と一致するひな形を抽出する。なお、ステップＳ５２では、ユーザが製作しようとしている紙面の属性と完全一致するひな形を抽出する場合に限らず、属性がある程度類似するひな形を抽出することとしてもよい。 Returning to FIG. 14, in the next step S52, the template presentation unit 22 narrows down the template by the attributes of the paper (posting day of the week, left and right pages, surface name, version name, etc.). In this case, the template presenting unit 22 refers to the template DB 32 and extracts a template that matches the attributes of the paper surface to be produced by the user. In step S52, not only the case of extracting a template that completely matches the attributes of the paper surface that the user is trying to produce, but also the template that has some similar attributes may be extracted.

次いで、ステップＳ５４では、ひな形提示部２２が、記事（コンテンツ）の属性でひな形を絞り込む。この場合、ひな形提示部２２は、ステップＳ５２で抽出したひな形のうち、ステップＳ５０で取得した記事の属性（例えば、モジュール名、記事数、行数、画像数、画像の縦横など）と一致するひな形を抽出する。なお、ステップＳ５２では、記事の属性と完全一致するひな形を抽出する場合に限らず、属性がある程度類似するひな形を抽出することとしてもよい。 Next, in step S54, the template presentation unit 22 narrows down the template by the attribute of the article (content). In this case, the template presentation unit 22 matches the attributes of the articles acquired in step S50 (for example, module name, number of articles, number of lines, number of images, vertical and horizontal images) among the templates extracted in step S52. Extract the template to be used. In step S52, not only the case of extracting the model that completely matches the attribute of the article, but also the case of extracting the model whose attributes are similar to some extent may be extracted.

次いで、ステップＳ５６では、ひな形提示部２２が、ひな形の候補を一覧表示する。具体的には、ひな形提示部２２は、図１６や図１７に示すような一覧画面を情報処理装置１０の紙面製作部６２に送信する。これにより、紙面製作部６２が、表示部１９３上に一覧表示を行う。図１６の一覧画面では、表示画として「組み上がり」、「書き割り」が表示される。一方、図１７の一覧画面では、表示画として「組み上がり」、「書き割り」に加え、「出来上がり」が表示される。表示画（出来上がり）は、ひな形を生成した際に用いた新聞紙面そのものの画像（図９の新聞紙面から生成した表示画）である。表示画（出来上がり）は、ひな形ＤＢ３２のうちデータ種別が「面」のデータの「表示画格納パス（組み上がり）」のフィールドに格納されているパスで管理されている表示画である。なお、図１６と図１７のいずれの一覧画面を表示するかは、ユーザにより予め設定されているものとする。なお、ユーザからの入力により、一覧画面を図１６から図１７へ、又は図１７から図１６へ切り替えられるようにしてもよい。ユーザは、図１６や図１７の一覧画面を参照することで、製作しようとしている新聞紙面において記事や画像がどのように配置されるかを確認することができるので、一覧画面からのひな形選択がしやすくなっている。 Next, in step S56, the model presentation unit 22 displays a list of model candidates. Specifically, the template presentation unit 22 transmits a list screen as shown in FIGS. 16 and 17 to the paper production unit 62 of the information processing device 10. As a result, the paper manufacturing unit 62 displays a list on the display unit 193. On the list screen of FIG. 16, "assembled" and "written division" are displayed as display images. On the other hand, on the list screen of FIG. 17, in addition to "assembled" and "written division", "finished" is displayed as a display image. The display image (completed) is an image of the newspaper surface itself (display image generated from the newspaper surface of FIG. 9) used when the template was generated. The display image (completed) is a display image managed by the path stored in the "display image storage path (assembled)" field of the data whose data type is "face" in the template DB 32. It is assumed that which of the list screens of FIGS. 16 and 17 is displayed is preset by the user. The list screen may be switched from FIG. 16 to FIG. 17 or from FIG. 17 to FIG. 16 by input from the user. By referring to the list screens of FIGS. 16 and 17, the user can confirm how the articles and images are arranged on the newspaper to be produced, so that a template can be selected from the list screen. It is easy to remove.

次いで、ステップＳ５８では、ひな形提示部２２が、一覧画面の中からユーザがいずれかのひな形を選択するまで待機する。ユーザが入力部１９５を介していずれかのひな形を選択し、選択したひな形の情報が紙面製作部６２からひな形提示部２２に送信されると、ステップＳ６０に移行する。 Next, in step S58, the template presentation unit 22 waits until the user selects one of the templates from the list screen. When the user selects one of the models via the input unit 195 and the information of the selected model is transmitted from the paper production unit 62 to the model presentation unit 22, the process proceeds to step S60.

ステップＳ６０に移行すると、ひな形提示部２２は、ひな形をユーザ端末６０の表示部１９３上に展開する。図１８には、選択されたひな形を表示部１９３上に展開した状態が示されている。ユーザは、展開されたひな形の各配置領域に対して、見出しや記事を配置したり、画像を配置することが可能となっている。なお、展開されたひな形の配置領域の大きさは、配置時に更新またはユーザが適宜変更できるようになっている。すなわち、ステップＳ６０では、ひな形提示部２２は、ユーザによって選択されたひな形を、ユーザが製作しようとしている新聞紙面の構成要素として提示しているといえる。 When the process proceeds to step S60, the model presentation unit 22 develops the model on the display unit 193 of the user terminal 60. FIG. 18 shows a state in which the selected template is expanded on the display unit 193. The user can place a headline or an article, or place an image in each of the arranged areas of the expanded template. The size of the arrangement area of the expanded template can be updated or changed as appropriate by the user at the time of arrangement. That is, in step S60, it can be said that the template presenting unit 22 presents the template selected by the user as a component of the newspaper page that the user intends to produce.

以上のように、図１４の処理が実行されることで、本実施形態では、ユーザが、適切なひな形を利用することができるので、簡易に新聞紙面を製作することが可能となっている。 As described above, by executing the process of FIG. 14, in the present embodiment, the user can use an appropriate template, so that it is possible to easily produce a newspaper page. ..

これまでの説明からわかるように、本実施形態では、ひな形生成部２０は、製作した新聞紙面から、区切り線、文字のフォント、フォントサイズ、文字の色の少なくとも１つに基づいて配置領域を特定する特定部、及び特定した配置領域から文字を除いた形状を抽出する抽出部として機能する。また、ひな形生成部２０は、特定した配置領域に配置された文字に基づいて配置領域の属性を生成し、生成した属性を抽出した形状に対応づけてひな形ＤＢ３２に記憶させる生成部として機能する。また、ひな形生成部２０は、新聞紙面から、区切り線で区切られた領域（図１１の太枠参照）を特定する特定部、及び特定した領域のデータと、該領域から文字を除いた形状とをひな形ＤＢ３２やストレージ９６に記憶させる記憶制御部として機能する。 As can be seen from the above description, in the present embodiment, the template generation unit 20 determines the arrangement area from the produced newspaper surface based on at least one of the dividing line, the character font, the font size, and the character color. It functions as a specific part to be specified and an extraction part to extract the shape excluding characters from the specified arrangement area. Further, the template generation unit 20 functions as a generation unit that generates attributes of the arrangement area based on the characters arranged in the specified arrangement area, and stores the generated attributes in the template DB 32 in association with the extracted shape. To do. Further, the template generation unit 20 includes a specific unit that specifies an area separated by a dividing line (see the thick frame in FIG. 11) from the newspaper surface, data of the specified area, and a shape obtained by removing characters from the area. It functions as a storage control unit that stores and in the template DB 32 and the storage 96.

以上、詳細に説明したように、本実施形態によると、ひな形生成部２０は、ユーザがユーザ端末６０において製作した新聞紙面から、区切り線、文字のフォント、フォントサイズ、色等に基づいて配置領域を特定し（Ｓ１２０）、特定した配置領域から文字を除いた形状を抽出する（Ｓ１３）。これにより、ユーザが製作した新聞紙面において配置領域が指定されていなくても、自動的に配置領域を特定し、ひな形に利用可能な配置領域の形状を簡易に抽出することができる。 As described in detail above, according to the present embodiment, the template generator 20 is arranged from the newspaper surface produced by the user on the user terminal 60 based on a dividing line, a character font, a font size, a color, and the like. An area is specified (S120), and a shape excluding characters from the specified arrangement area is extracted (S13). As a result, even if the arrangement area is not specified on the newspaper surface produced by the user, the arrangement area can be automatically specified and the shape of the arrangement area that can be used as a model can be easily extracted.

また、本実施形態によると、ひな形生成部２０は、特定した配置領域に配置された文字に基づいて属性情報（見出し、記事、画像の別や、カテゴリ（モジュール名））を生成し（Ｓ１２２）、配置領域の形状等に対応付けてレイアウトデータとしてストレージ９３に記憶する。これにより、ユーザが製作した新聞紙面において配置領域の属性が定められていない場合であっても、新聞紙面から配置領域の属性情報と配置領域の形状等を対応付けたレイアウトデータを自動的に生成することが可能である。 Further, according to the present embodiment, the template generation unit 20 generates attribute information (headline, article, image classification, category (module name)) based on the characters arranged in the specified arrangement area (S122). ), Stored in the storage 93 as layout data in association with the shape of the arrangement area and the like. As a result, even if the attributes of the arrangement area are not defined on the newspaper surface produced by the user, layout data in which the attribute information of the arrangement area and the shape of the arrangement area are associated with each other is automatically generated from the newspaper surface. It is possible to do.

また、本実施形態によると、２つの配置領域のそれぞれに含まれる文字が所定の類似関係を満たす場合（記事本文が、見出し文に含まれる名詞を多く含む場合）に、２つの配置領域を同じグループ（モジュール）としてストレージ９６及びひな形ＤＢ３２に記憶する。これにより、ユーザが製作した新聞紙面において配置領域間の関係が定められていない場合であっても、新聞紙面から配置領域のグループ（モジュール）を自動的に生成することが可能である。 Further, according to the present embodiment, when the characters included in each of the two arrangement areas satisfy a predetermined similarity relationship (when the article body contains many nouns included in the headline sentence), the two arrangement areas are the same. It is stored in the storage 96 and the template DB 32 as a group (module). As a result, even if the relationship between the arrangement areas is not defined in the newspaper surface produced by the user, it is possible to automatically generate a group (module) of the arrangement area from the newspaper surface.

また、本実施形態では、新聞紙面から、区切り線で区切られた領域を特定し（図１１）、特定した領域のデータ（例えばモジュール名や記事数、画像数など）と、特定した領域から文字を除いた形状データ（ひな形）とをストレージ９６及びひな形ＤＢ３２に記憶している。これにより、モジュール名や記事数、画像数などに基づいて、ストレージ９６からユーザが製作しようとしている新聞紙面に適したひな形をモジュール単位で読み出すことが可能である。また、本実施形態では、ひな形提示部２２がひな形をユーザに提示する際に、ひな形に対応する特定した領域のデータ（モジュール名や記事数、画像数など）を併せて表示する（図１６、図１７）ので、ユーザは、ひな形が複数提示された場合でも、適切なひな形を選択することが可能である。 Further, in the present embodiment, an area separated by a dividing line is specified from the newspaper surface (FIG. 11), data of the specified area (for example, module name, number of articles, number of images, etc.) and characters from the specified area are specified. The shape data (template) excluding the above is stored in the storage 96 and the template DB 32. As a result, based on the module name, the number of articles, the number of images, and the like, it is possible to read out a template suitable for the newspaper page to be produced by the user from the storage 96 in module units. Further, in the present embodiment, when the model presenting unit 22 presents the model to the user, the data (module name, number of articles, number of images, etc.) of the specified area corresponding to the model is also displayed ( 16 and 17), the user can select an appropriate model even when a plurality of models are presented.

また、本実施形態では、ひな形ＤＢ３２において、モジュールが配置された面の面名、面の記事部段数、モジュールの行数情報をモジュールのひな形に対応づけて記憶している。これにより、ひな形に対応付けて記憶されている情報に基づいて、ひな形ＤＢ３２からユーザが製作しようとしている新聞紙面に適したひな形を抽出し、ユーザに提供することが可能である。 Further, in the present embodiment, in the template DB 32, the surface name of the surface on which the module is arranged, the number of article sections of the surface, and the line number information of the module are stored in association with the module template. Thereby, based on the information stored in association with the template, it is possible to extract a template suitable for the newspaper surface to be produced by the user from the template DB 32 and provide the template to the user.

また、本実施形態では、ひな形ＤＢ３２において、モジュールが配置された面が新聞の左右いずれの面であるかの情報を、モジュールのひな形に対応づけて記憶している。新聞の場合、左右いずれの面であるかによってレイアウトが異なるため、上記のように左右いずれの面であるかの情報をひな形に対応付けて記憶しておくことで、ユーザが製作しようとしている新聞紙面に適したひな形を抽出し、ユーザに提供することが可能である。 Further, in the present embodiment, in the model DB 32, information on whether the side on which the module is arranged is the left or right side of the newspaper is stored in association with the model of the module. In the case of newspapers, the layout differs depending on which side is left or right, so the user is trying to produce by storing the information on which side is left or right in association with the template as described above. It is possible to extract a template suitable for the newspaper surface and provide it to the user.

また、本実施形態では、ひな形ＤＢ３２において、モジュールが掲載された新聞紙面が発行された曜日や日付を、モジュールのひな形に対応づけて記憶している。新聞の場合、曜日や日付によってレイアウトが異なることが多いため、上記のように曜日や日付の情報をひな形に対応付けて記憶しておくことで、ユーザが製作しようとしている新聞紙面に適したひな形を抽出し、ユーザに提供することが可能である。 Further, in the present embodiment, in the template DB 32, the day of the week and the date on which the newspaper on which the module is published are published are stored in association with the template of the module. In the case of newspapers, the layout often differs depending on the day of the week and date, so by storing the information on the day of the week and date in association with the template as described above, it is suitable for the newspaper page that the user is trying to produce. It is possible to extract a template and provide it to the user.

また、本実施形態によれば、ユーザ端末６０で生成された新聞紙面のレイアウトデータを取得し（Ｓ１０）、取得したデータのうち、区切り線と、各記事の配置領域を抽出し（Ｓ１３〜Ｓ１６）、抽出した区切り線と、各記事の配置領域の形状とをひな形（レイアウトデータ）として、読み出し可能にひな形ＤＢ３２及びストレージ９６に記憶させる（Ｓ２２）。これにより、ユーザが新聞紙面を製作する際に利用可能なひな形を自動的に生成して、ひな形ＤＢ３２及びストレージ９６に記憶させることができる。この場合、ひな形を一から手作業で生成したり、製作済みの新聞紙面から文字や画像を除外してひな形を生成したりする場合に比べて、手間や時間をかけずにひな形を生成することができ、新聞紙面の製作作業を適切に支援することが可能である。 Further, according to the present embodiment, the layout data of the newspaper page generated by the user terminal 60 is acquired (S10), and the dividing line and the arrangement area of each article are extracted from the acquired data (S13 to S16). ), The extracted dividing line and the shape of the arrangement area of each article are stored as a template (layout data) in the template DB 32 and the storage 96 so that they can be read (S22). As a result, a template that can be used by the user when producing a newspaper page can be automatically generated and stored in the template DB 32 and the storage 96. In this case, compared to manually generating a template from scratch or excluding characters and images from the produced newspaper to generate a template, it takes less time and effort to create a template. It can be generated, and it is possible to appropriately support the production work of newspaper pages.

また、本実施形態では、ひな形ＤＢ３２において、記事のカテゴリ（モジュール名）をひな形に対応付けて記憶している。これにより、ユーザが製作しようとしている記事のカテゴリに応じて、適切なひな形をひな形ＤＢ３２から読み出して、ユーザに対して提供することができる。 Further, in the present embodiment, in the template DB 32, the article category (module name) is stored in association with the template. As a result, an appropriate template can be read from the template DB 32 and provided to the user according to the category of the article to be produced by the user.

また、本実施形態では、ひな形ＤＢ３２において、ひな形を生成するときに用いた新聞紙面から生成した画像（表示画）をひな形に対応付けて記憶している。これにより、ユーザに対してひな形を提示するときに、表示画を表示することができるため、ユーザは、表示画を参照して、提示されたひな形の中から利用するひな形を選択することが可能である。 Further, in the present embodiment, in the model DB 32, an image (display image) generated from the newspaper surface used when generating the model is stored in association with the model. As a result, when the template is presented to the user, the display image can be displayed. Therefore, the user refers to the display image and selects the template to be used from the presented templates. It is possible.

また、本実施形態では、新たに製作された新聞紙面のうち、蓄積対象テーブル３０において対象とされている面を用いて、ひな形（レイアウトデータ）を生成し、ひな形ＤＢ３２及びストレージ９６に読み出し可能に記憶することとしている。これにより、ユーザが利用する可能性の高い、利便性の高いひな形を生成し、ひな形ＤＢ３２及びストレージ９６に記憶しておくことができる。 Further, in the present embodiment, a template (layout data) is generated by using the surface of the newly produced newspaper surface targeted in the storage target table 30, and read out to the template DB 32 and the storage 96. I will remember it as much as possible. As a result, a highly convenient template that is likely to be used by the user can be generated and stored in the template DB 32 and the storage 96.

また、本実施形態によると、ひな形提示部２２は、ユーザが製作しようとしている（編集対象の）紙面や記事の属性（掲載曜日、左右ページ、面名、版名、記事数、記事行数、画像数、画像の縦横など）を取得し、ひな形ＤＢ３２を参照して、取得した紙面の属性に対応するひな形を絞り込み（Ｓ５２、Ｓ５４）、絞り込んだひな形をユーザが製作しようとしている紙面のひな形として提示する（Ｓ５６）。これにより、ユーザが製作しようとしている紙面の製作に適したひな形を自動的に特定し、提示することができる。したがって、多数のひな形がひな形ＤＢ３２及びストレージ９６に格納されている場合であっても、新聞の製作に適したひな形を容易に絞り込み、ユーザに提示することができる。 Further, according to the present embodiment, the template presentation unit 22 has the attributes (posting day of the week, left and right pages, page name, version name, number of articles, number of article lines) of the paper or article (to be edited) that the user intends to produce. , Number of images, vertical and horizontal images, etc.), refer to the template DB32, narrow down the templates corresponding to the attributes of the acquired paper (S52, S54), and the user is trying to produce the narrowed template It is presented as a template on paper (S56). As a result, it is possible to automatically identify and present a template suitable for producing the paper surface to be produced by the user. Therefore, even when a large number of models are stored in the model DB 32 and the storage 96, it is possible to easily narrow down the models suitable for producing a newspaper and present them to the user.

また、本実施形態では、ひな形提示部２２は、絞り込んだひな形を提示する際に、ひな形のデータとともに、文字や写真の配置候補（表示画（組み上がり）や表示画（出来上がり））を提示する。これにより、ユーザは、提示されたひな形から利用するひな形を選択する際に、組み上がりや出来上がり後の紙面を参考にすることができる。 Further, in the present embodiment, when the model presenting unit 22 presents the narrowed-down stationery, the stationery presenting unit 22 includes the character and photo arrangement candidates (display image (assembled) and display image (completed)) together with the model data. To present. As a result, the user can refer to the assembled and finished paper when selecting the model to be used from the presented models.

また、本実施形態では、ひな形提示部２２は、ひな形を提示する際に、製作しようとしている面全体または面の一部を提示する。これにより、ユーザは、紙面作成の際に利用しやすい大きさのひな形を選択することが可能である。 Further, in the present embodiment, when presenting the template, the template presenting unit 22 presents the entire surface or a part of the surface to be manufactured. This allows the user to select a template having a size that is easy to use when creating the paper.

なお、上記実施形態では、ひな形提示部２２は、ステップＳ５２及びＳ５４において、面及びコンテンツの属性でひな形を絞り込む場合について説明したが、これに限られるものではない。すなわち、ステップＳ５２とステップＳ５４のいずれか一方を実行することとしてもよい。 In the above embodiment, the template presenting unit 22 has described the case where the template is narrowed down by the attributes of the surface and the content in steps S52 and S54, but the present invention is not limited to this. That is, either step S52 or step S54 may be executed.

なお、上記実施形態では、情報処理装置１０において生成されたひな形を情報処理装置１０内のひな形ＤＢ３２で管理する場合について説明したが、これに限らず、外部のデータサーバ等で管理してもよい。この場合、ひな形を生成する装置と、ひな形を提示する装置とが異なっていてもよい。 In the above embodiment, the case where the model generated in the information processing device 10 is managed by the model DB 32 in the information processing device 10 has been described, but the present invention is not limited to this, and the model is managed by an external data server or the like. May be good. In this case, the device for generating the stationery and the device for presenting the stationery may be different.

なお、上記実施形態では、新聞製作時に利用するひな形の生成、提示において、情報処理装置１０が上記処理を実行する場合について説明したが、これに限られるものではない。例えば、雑誌などの書籍の製作時に利用するひな形の生成、提示において、情報処理装置１０が上記処理を実行することとしてもよい。 In the above embodiment, the case where the information processing apparatus 10 executes the above processing in the generation and presentation of the template used at the time of newspaper production has been described, but the present invention is not limited to this. For example, the information processing apparatus 10 may execute the above processing in generating and presenting a template used when producing a book such as a magazine.

なお、上記の処理機能は、コンピュータによって実現することができる。その場合、処理装置が有すべき機能の処理内容を記述したプログラムが提供される。そのプログラムをコンピュータで実行することにより、上記処理機能がコンピュータ上で実現される。処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体（ただし、搬送波は除く）に記録しておくことができる。 The above processing function can be realized by a computer. In that case, a program that describes the processing content of the function that the processing device should have is provided. By executing the program on a computer, the above processing function is realized on the computer. The program describing the processing content can be recorded on a computer-readable recording medium (however, the carrier wave is excluded).

プログラムを流通させる場合には、例えば、そのプログラムが記録されたＤＶＤ（Digital Versatile Disc）、ＣＤ−ＲＯＭ（Compact Disc Read Only Memory）などの可搬型記録媒体の形態で販売される。また、プログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することもできる。 When a program is distributed, it is sold in the form of a portable recording medium such as a DVD (Digital Versatile Disc) or a CD-ROM (Compact Disc Read Only Memory) on which the program is recorded. It is also possible to store the program in the storage device of the server computer and transfer the program from the server computer to another computer via the network.

プログラムを実行するコンピュータは、例えば、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、自己の記憶装置に格納する。そして、コンピュータは、自己の記憶装置からプログラムを読み取り、プログラムに従った処理を実行する。なお、コンピュータは、可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することもできる。また、コンピュータは、サーバコンピュータからプログラムが転送されるごとに、逐次、受け取ったプログラムに従った処理を実行することもできる。 The computer that executes the program stores, for example, the program recorded on the portable recording medium or the program transferred from the server computer in its own storage device. Then, the computer reads the program from its own storage device and executes the processing according to the program. The computer can also read the program directly from the portable recording medium and execute the processing according to the program. In addition, the computer can sequentially execute processing according to the received program each time the program is transferred from the server computer.

上述した実施形態は本発明の好適な実施の例である。但し、これに限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々変形実施可能である。 The embodiments described above are examples of preferred embodiments of the present invention. However, the present invention is not limited to this, and various modifications can be made without departing from the gist of the present invention.

なお、以上の実施形態の説明に関して、更に以下の付記を開示する。
（付記１）文字を含む文書情報から、区切り線、前記文字のフォント、フォントサイズ、前記文字の色の１又は複数の組み合わせに基づいて部分領域を特定し、
特定した前記部分領域から文字を除いた形状を抽出する、
処理をコンピュータに実行させることを特徴とする形状抽出プログラム。
（付記２）特定した前記部分領域に配置された文字に基づいて属性情報を生成し、生成した前記属性情報を抽出した前記形状に対応づけて記憶部に記憶させる、
ことを特徴とする付記１に記載の形状抽出プログラム。
（付記３）前記部分領域が複数特定され、２つの部分領域のそれぞれに含まれる文字が所定の類似関係を満たす場合に、前記２つの部分領域を同じグループとして記憶部に記憶させる、
ことを特徴とする付記１に記載の形状抽出プログラム。
（付記４）前記属性情報の生成に用いられる文字は、フォントサイズが所定のサイズより大きい文字、又は、見出し文字、又は、特定の色に対応する文字である、
ことを特徴とする付記２に記載の形状抽出プログラム。
（付記５）前記文書情報は複数ページの文書情報を含み、特定した前記部分領域が配置されたページに対応するカテゴリ情報、前記部分領域が配置されたページに配置された広告の段数情報、特定した前記部分領域の文字の行数情報の少なくともいずれかが、特定した前記部分領域に対応づけて前記記憶部に記憶される、
ことを特徴とする付記２〜４のいずれかに記載の形状抽出プログラム。
（付記６）前記文書情報は見開きのページ構成を有し、特定した前記部分領域が配置されたページが見開きの右側か、左側かの別を示す情報を前記部分領域に対応づけて前記記憶部に記憶させる、ことを特徴とする付記２〜５のいずれかに記載の形状抽出プログラム。
（付記７）前記文書情報の曜日又は日付情報が、前記部分領域に対応づけて前記記憶部に記憶される、ことを特徴とする付記２〜６のいずれかに記載の形状抽出プログラム。
（付記８）編集対象の文書ページの構成要素として前記記憶部に記憶された情報を選択し、表示することを特徴とする付記２〜７のいずれかに記載の形状抽出プログラム。
（付記９）区切り線で区切られて配置された文字を含む文書情報から、区切り線で区切られた部分領域を特定し、
特定した前記部分領域のデータと、該部分領域から文字を除いた形状とを記憶部に記憶させる、
処理をコンピュータに実行させることを特徴とする形状抽出プログラム。
（付記１０）編集対象の文書ページの構成要素として前記記憶部に記憶された前記部分領域から文字を除いた形状を表示する際に、前記部分領域のデータも併せて表示させる、
処理をコンピュータに実行させることを特徴とする付記９に記載の形状抽出プログラム。
（付記１１）文字を含む文書情報から、区切り線、前記文字のフォント、フォントサイズ、前記文字の色の１又は複数の組み合わせに基づいて部分領域を特定し、
特定した前記部分領域から文字を除いた形状を抽出する、
処理をコンピュータが実行することを特徴とする形状抽出方法。
（付記１２）区切り線で区切られて配置された文字を含む文書情報から、区切り線で区切られた部分領域を特定し、
特定した前記部分領域のデータと、該部分領域から文字を除いた形状とを記憶部に記憶させる、
処理をコンピュータが実行することを特徴とする形状抽出方法。
（付記１３）文字を含む文書情報から、区切り線、前記文字のフォント、フォントサイズ、前記文字の色の１又は複数の組み合わせに基づいて部分領域を特定する特定部と、
特定した前記部分領域から文字を除いた形状を抽出する抽出部と、
を備える形状抽出装置。
（付記１４）特定した前記部分領域に配置された文字に基づいて属性情報を生成し、生成した前記属性情報を抽出した前記形状に対応づけて記憶部に記憶させる生成部、を備える付記１３に記載の形状抽出装置。
（付記１５）前記部分領域が複数特定され、２つの部分領域のそれぞれに含まれる文字が所定の類似関係を満たす場合に、前記２つの部分領域を同じグループとして記憶部に記憶させる生成部、を備える付記１３に記載の形状抽出装置。
（付記１６）前記属性情報の生成に用いられる文字は、フォントサイズが所定のサイズより大きい文字、又は、見出し文字、又は、特定の色に対応する文字である、ことを特徴とする付記１４に記載の形状抽出装置。
（付記１７）区切り線で区切られて配置された文字を含む文書情報から、区切り線で区切られた部分領域を特定する特定部と、
特定した前記部分領域のデータと、該部分領域から文字を除いた形状とを記憶部に記憶させる記憶制御部と、を備える形状抽出装置。 Regarding the description of the above embodiments, the following additional notes will be further disclosed.
(Appendix 1) From the document information including characters, a partial area is specified based on a dividing line, the font of the characters, the font size, and one or a plurality of combinations of the colors of the characters.
Extract the shape excluding characters from the specified partial area,
A shape extraction program characterized by having a computer execute processing.
(Appendix 2) Attribute information is generated based on the characters arranged in the specified partial area, and the generated attribute information is stored in the storage unit in association with the extracted shape.
The shape extraction program according to Appendix 1, characterized in that.
(Appendix 3) When a plurality of the subregions are specified and the characters included in each of the two subregions satisfy a predetermined similarity relationship, the two subregions are stored in the storage unit as the same group.
The shape extraction program according to Appendix 1, characterized in that.
(Appendix 4) The characters used to generate the attribute information are characters whose font size is larger than a predetermined size, heading characters, or characters corresponding to a specific color.
The shape extraction program according to Appendix 2, characterized in that.
(Appendix 5) The document information includes a plurality of pages of document information, category information corresponding to the page on which the specified partial area is arranged, information on the number of stages of advertisement arranged on the page on which the partial area is arranged, and identification. At least one of the line number information of the characters in the partial area is stored in the storage unit in association with the specified partial area.
The shape extraction program according to any one of Supplementary note 2 to 4, wherein the shape extraction program is characterized by the above.
(Appendix 6) The document information has a spread page structure, and the storage unit associates information indicating whether the page on which the specified partial area is arranged is on the right side or the left side of the spread with the partial area. The shape extraction program according to any one of Supplementary notes 2 to 5, wherein the shape extraction program is stored in.
(Supplementary Note 7) The shape extraction program according to any one of Supplementary notes 2 to 6, wherein the day of the week or date information of the document information is stored in the storage unit in association with the partial area.
(Supplementary Note 8) The shape extraction program according to any one of Supplementary notes 2 to 7, wherein the information stored in the storage unit is selected and displayed as a component of the document page to be edited.
(Appendix 9) From the document information including the characters separated by the dividing line, the partial area separated by the dividing line is specified.
The data of the specified partial area and the shape obtained by removing the characters from the partial area are stored in the storage unit.
A shape extraction program characterized by having a computer execute processing.
(Appendix 10) When displaying the shape obtained by removing characters from the partial area stored in the storage unit as a component of the document page to be edited, the data of the partial area is also displayed.
The shape extraction program according to Appendix 9, wherein the processing is executed by a computer.
(Appendix 11) From the document information including characters, a partial area is specified based on a dividing line, a font of the characters, a font size, and one or a plurality of combinations of the colors of the characters.
Extract the shape excluding characters from the specified partial area,
A shape extraction method characterized in that processing is performed by a computer.
(Appendix 12) From the document information including the characters separated by the dividing line, the partial area separated by the dividing line is specified.
The data of the specified partial area and the shape obtained by removing the characters from the partial area are stored in the storage unit.
A shape extraction method characterized in that processing is performed by a computer.
(Appendix 13) A specific portion that specifies a partial area based on a dividing line, a font of the character, a font size, and one or a plurality of combinations of the color of the character from document information including characters.
An extraction unit that extracts the shape excluding characters from the specified partial area,
A shape extractor equipped with.
(Appendix 14) Appendix 13 includes a generation unit that generates attribute information based on the characters arranged in the specified partial area and stores the generated attribute information in the storage unit in association with the extracted shape. The shape extractor described.
(Appendix 15) When a plurality of the subregions are specified and the characters included in each of the two subregions satisfy a predetermined similarity relationship, the generation unit that stores the two subregions as the same group in the storage unit. The shape extraction device according to Appendix 13.
(Supplementary note 16) The character used for generating the attribute information is a character whose font size is larger than a predetermined size, a heading character, or a character corresponding to a specific color. The shape extractor described.
(Appendix 17) From the document information including the characters separated by the dividing line, the specific part that identifies the partial area separated by the dividing line and the specific part.
A shape extraction device including a storage control unit that stores data in the specified partial area and a shape obtained by removing characters from the partial area in a storage unit.

１０情報処理装置（形状抽出装置）
２０ひな形生成部（特定部、抽出部、生成部、記憶制御部）
３２ひな形ＤＢ（記憶部）
９６ストレージ（記憶部） 10 Information processing device (shape extraction device)
20 Model generation unit (specific unit, extraction unit, generation unit, memory control unit)
32 Stationery DB (storage unit)
96 storage (storage unit)

Claims

From the document information including characters, a partial area is specified based on the dividing line, the font of the characters, the font size, and one or more combinations of the colors of the characters.
The shape excluding the characters is extracted from the specified partial area, and the shape is extracted .
The noun extracted by morphological analysis of the characters arranged in the specified partial area is used as attribute information, and the attribute information is stored in the storage unit in association with the extracted shape.
A shape extraction program characterized by having a computer execute processing.

When a plurality of the subregions are specified and the characters included in each of the two subregions satisfy a predetermined similarity relationship, the two subregions are stored in the storage unit as the same group.
The shape extraction program according to claim 1.

The character to be morphologically analyzed is a character whose font size is larger than a predetermined size, a heading character, or a character corresponding to a specific color.
The shape extraction program according to claim 1 .

The document information includes document information of a plurality of pages, category information corresponding to the page on which the specified partial area is arranged, information on the number of stages of advertisement arranged on the page on which the partial area is arranged, and the specified partial area. At least one of the line number information of the character is stored in the storage unit in association with the specified partial area.
The shape extraction program according to any one of claims 1 to 3 , wherein the shape extraction program is characterized in that.

The document information has a spread page structure, and information indicating whether the page on which the specified partial area is arranged is on the right side or the left side of the spread is stored in the storage unit in association with the partial area. The shape extraction program according to any one of claims 1 to 4 , wherein the shape extraction program is characterized in that.

The shape extraction program according to any one of claims 1 to 5 , wherein the day of the week or date information of the document information is stored in the storage unit in association with the partial area.

The shape extraction program according to any one of claims 1 to 6 , wherein information stored in the storage unit is selected and displayed as a component of a document page to be edited.

From the document information including characters, a partial area is specified based on the dividing line, the font of the characters, the font size, and one or more combinations of the colors of the characters.
The shape excluding the characters is extracted from the specified partial area, and the shape is extracted .
The noun extracted by morphological analysis of the characters arranged in the specified partial area is used as attribute information, and the attribute information is stored in the storage unit in association with the extracted shape.
A shape extraction method characterized in that processing is performed by a computer.

A specific part that specifies a partial area based on a dividing line, a font of the character, a font size, and one or a plurality of combinations of the color of the character from the document information including the character.
An extraction unit that extracts the shape excluding characters from the specified partial area,
A storage control unit that uses a noun extracted by morphological analysis of characters arranged in the specified partial area as attribute information and stores the attribute information in the storage unit in association with the extracted shape.
A shape extractor equipped with.