JP6705284B2

JP6705284B2 - Document processor

Info

Publication number: JP6705284B2
Application number: JP2016102046A
Authority: JP
Inventors: 光晴早坂
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2016-05-23
Filing date: 2016-05-23
Publication date: 2020-06-03
Anticipated expiration: 2036-05-23
Also published as: JP2017208053A

Description

本発明は、文書処理装置に関する。 The present invention relates to a document processing device.

一般に、マークアップ言語（例えばＯＯＸＭＬ）で記述されたプレゼンテーション形式の電子文書を閲覧、作成、編集することが可能なアプリケーション（例えばMicrosoft PowerPoint（登録商標））には、各スライドのタイトルや本文テキストのみを抽出して表示・印刷する、所謂「アウトライン表示」・「アウトライン印刷」という機能が備えられている。 Generally, in an application (for example, Microsoft PowerPoint (registered trademark)) capable of browsing, creating, and editing a presentation format electronic document described in a markup language (for example, OOXML), only the title and body text of each slide are included. It has a so-called “outline display”/“outline printing” function for extracting and displaying/printing.

上記のアプリケーションにおいて、各スライドのアウトラインとして認識されるコンテンツは、各スライド共通のテンプレートとしての属性（プレースホルダー）が付加されたコンテンツに限られる。従って、例えば、ユーザーが自由に挿入したテキストは、通常テンプレート属性を有しないため、アウトライン出力の対象として認識されることはない。 In the above application, the content recognized as the outline of each slide is limited to the content to which the attribute (placeholder) as the template common to each slide is added. Therefore, for example, the text freely inserted by the user does not normally have the template attribute and is not recognized as a target for outline output.

即ち、ユーザーが自由に挿入したコンテンツのみで構成されるスライドは、アウトライン出力の際、何も出力されないといった課題が生じる。
また、例えば、アプリケーションが備える各スライド共通のテンプレートに従って作成されたスライドとユーザーが自由に挿入したコンテンツのみで構成されるスライドとが同一ファイル内で混在する場合、アウトライン出力されるスライドとアウトライン出力されないスライドとが混在するため、アウトライン出力全体の統一性、視認性が確保できないといった課題が生じる。 That is, there is a problem that a slide configured only with contents freely inserted by the user outputs nothing at the time of outline output.
Further, for example, when a slide created according to a template common to each slide included in the application and a slide composed only of contents freely inserted by the user are mixed in the same file, the slide output as outline output and the outline output are not performed. Since slides and slides are mixed, there is a problem in that the uniformity and visibility of the entire outline output cannot be ensured.

そこで、行頭に「＃」から始まる特殊記号を付加することで、その特殊記号を目印として見出し行、本体行、副本体行を区別可能に構成し、統一性・視認性が確保されたアウトライン出力を可能とした技術が開示されている（例えば、特許文献１参照）。
また、マークアップ言語のタグとアウトライン表示条件とを１対１で対応させることで、タグ毎にアウトライン表示条件の設定を可能とし、所望するアウトライン出力を可能とした技術が開示されている（例えば、特許文献２参照）。 Therefore, by adding a special symbol starting with "#" at the beginning of the line, the heading line, main body line, and sub-main body line can be distinguished using the special symbol as a mark, and the outline output that ensures uniformity and visibility is provided. A technology that enables the above is disclosed (for example, see Patent Document 1).
Further, there is disclosed a technique in which a tag in a markup language and an outline display condition are associated with each other on a one-to-one basis, whereby the outline display condition can be set for each tag and desired outline output can be performed (for example, , Patent Document 2).

特開平５−１０１０５７号公報JP-A-5-101057 国際公開第２００７／０５２６８０号International Publication No. 2007/052680

しかしながら、上記特許文献１記載の技術では、全ての行の行頭に特殊記号を付加する必要があるため、手間が生じる。また、誤った記号を付加したり記号を付加し忘れたりした場合、所望するアウトライン出力を得られないという課題がある。
また、上記特許文献２記載の技術では、アウトライン出力対象の全てのコンテンツに適切なタグの付加と対応する表示設定とを行う必要があるため、手順が煩雑となり手間が生じる。また、手順を誤ったり忘れたりした場合、所望するアウトライン出力を得られないという課題がある。 However, in the technique described in Patent Document 1, it is necessary to add a special symbol to the beginning of every line, which is troublesome. There is also a problem that a desired outline output cannot be obtained when a wrong symbol is added or a symbol is omitted.
Further, in the technique described in Patent Document 2, it is necessary to add an appropriate tag and display settings corresponding to all the contents to be output as outlines, which makes the procedure complicated and troublesome. In addition, there is a problem that a desired outline output cannot be obtained when the procedure is erroneous or forgotten.

本発明は、ユーザーの手間を省きつつ、アウトライン出力全体の統一性、視認性を確保した所望のアウトライン出力を得ることが可能な文書処理装置を提供することを目的とする。 SUMMARY OF THE INVENTION An object of the present invention is to provide a document processing apparatus capable of obtaining a desired outline output that secures the uniformity and visibility of the entire outline output while saving the user's trouble.

請求項１に記載の発明は、上記目的を達成するためになされたものであり、
文書処理装置において、
マークアップ言語で記述された文書データからアウトライン出力の対象となるコンテンツを対象コンテンツとして抽出する抽出部と、
前記抽出部により抽出された対象コンテンツに基づいてアウトライン出力用データを生成する生成部と、
を備え、
前記抽出部は、前記文書データを構成する各ページにつき、前記対象コンテンツを指定するための記述であるテンプレート属性が付加されているか否かを判定し、前記テンプレート属性が付加されていると判定したページでは、前記テンプレート属性により指定された前記対象コンテンツを抽出し、前記テンプレート属性が付加されていないと判定したページでは、当該ページ内の各コンテンツの属性に基づいて前記対象コンテンツを抽出し、
前記テンプレート属性が付加されていると判定したページが存在しない場合、各ページ内の予めタイトルが入力されている領域として設定されている第１領域に存在するテキストコンテンツの座標位置及びテキストサイズに基づいて、一のテキストコンテンツを前記ページのタイトルとして抽出することを特徴とする。 The invention according to claim 1 was made to achieve the above object,
In the document processing device,
An extraction unit that extracts the content targeted for outline output from the document data described in the markup language as the target content,
A generation unit that generates outline output data based on the target content extracted by the extraction unit;
Equipped with
The extraction unit determines whether or not a template attribute that is a description for designating the target content is added to each page forming the document data, and determines that the template attribute is added. In the page, the target content specified by the template attribute is extracted, and in the page where it is determined that the template attribute is not added, the target content is extracted based on the attribute of each content in the page,
If there is no page determined to have the template attribute added, based on the coordinate position and the text size of the text content existing in the first area set as the area in which the title is input in advance in each page. Then, one text content is extracted as the title of the page .

請求項２に記載の発明は、請求項１に記載の文書処理装置において、
前記抽出部は、前記テンプレート属性が付加されていると判定したページが存在しない場合、各ページ内の第２領域に存在するテキストコンテンツのテキストサイズに基づいて、一又は複数のテキストコンテンツを前記ページの本文として抽出することを特徴とする。
請求項３に記載の発明は、上記目的を達成するためになされたものであり、
文書処理装置において、
マークアップ言語で記述された文書データからアウトライン出力の対象となるコンテンツを対象コンテンツとして抽出する抽出部と、
前記抽出部により抽出された対象コンテンツに基づいてアウトライン出力用データを生成する生成部と、
を備え、
前記抽出部は、前記文書データを構成する各ページにつき、前記対象コンテンツを指定するための記述であるテンプレート属性が付加されているか否かを判定し、前記テンプレート属性が付加されていると判定したページでは、前記テンプレート属性により指定された前記対象コンテンツを抽出し、前記テンプレート属性が付加されていないと判定したページでは、当該ページ内の各コンテンツの属性に基づいて前記対象コンテンツを抽出し、
前記テンプレート属性が付加されていると判定したページが存在しない場合、各ページ内の予め本文が入力されている領域として設定されている第２領域に存在するテキストコンテンツのテキストサイズに基づいて、一又は複数のテキストコンテンツを前記ページの本文として抽出することを特徴とする。 The invention according to claim 2 is the document processing apparatus according to claim 1 ,
When there is no page determined to have the template attribute added, the extraction unit extracts one or more text contents based on the text size of the text contents existing in the second area in each page. It is characterized in that it is extracted as the text of.
The invention described in claim 3 is made to achieve the above object,
In the document processing device,
An extraction unit that extracts the content targeted for outline output from the document data described in the markup language as the target content,
A generation unit that generates outline output data based on the target content extracted by the extraction unit;
Equipped with
The extraction unit determines whether or not a template attribute that is a description for designating the target content is added to each page forming the document data, and determines that the template attribute is added. In the page, the target content specified by the template attribute is extracted, and in the page where it is determined that the template attribute is not added, the target content is extracted based on the attribute of each content in the page,
If there is no page that is determined to have the template attribute added, one of the pages is determined based on the text size of the text content existing in the second area set as the area in which the text is input in advance in each page. Alternatively, a plurality of text contents are extracted as the body of the page.

請求項４に記載の発明は、請求項１〜３のいずれか一項に記載の文書処理装置において、
前記抽出部は、前記テンプレート属性が付加されていないと判定したページでは、前記テンプレート属性が付加されていると判定したページで抽出された対象コンテンツと類似するコンテンツを前記対象コンテンツとして抽出することを特徴とする。 The invention according to claim 4 is the document processing apparatus according to any one of claims 1 to 3 , wherein
In the page which is determined that the template attribute is not added, the extraction unit extracts, as the target content, content similar to the target content extracted in the page where the template attribute is determined to be added. Characterize.

請求項５に記載の発明は、請求項１〜４のいずれか一項に記載の文書処理装置において、
前記生成部により生成されたアウトライン出力用データを文書データに変換して記憶部に記憶させる第１出力部を備えることを特徴とする。 The invention according to claim 5 is the document processing apparatus according to any one of claims 1 to 4 ,
It is characterized by comprising a first output unit for converting the outline output data generated by the generation unit into document data and storing the document data in the storage unit.

請求項６に記載の発明は、請求項１〜５のいずれか一項に記載の文書処理装置において、
前記生成部により生成されたアウトライン出力用データを画像データに変換して画像形成部により用紙に画像を形成させる第２出力部を備えることを特徴とする。 The invention according to claim 6 is the document processing apparatus according to any one of claims 1 to 5 , wherein
It is characterized by comprising a second output section for converting the outline output data generated by the generation section into image data and causing the image forming section to form an image on a sheet.

請求項７に記載の発明は、請求項５又は６に記載の文書処理装置において、
前記第１出力部による文書データの記憶又は前記第２出力部による画像形成の前に、前記アウトライン出力用データのプレビュー画面を表示部に表示させる表示制御部を備えることを特徴とする。 The invention according to claim 7 is the document processing apparatus according to claim 5 or 6 , wherein:
A display control unit is provided for displaying a preview screen of the outline output data on a display unit before the document data is stored by the first output unit or the image is formed by the second output unit.

請求項８に記載の発明は、請求項１〜７のいずれか一項に記載の文書処理装置において、
前記対象コンテンツは、前記文書データを構成する各ページのタイトル及び本文から構成されることを特徴とする。 The invention according to claim 8 is the document processing apparatus according to any one of claims 1 to 7 , wherein
It is characterized in that the target content is composed of a title and a body of each page constituting the document data.

本発明によれば、ユーザーの手間を省きつつ、アウトライン出力全体の統一性、視認性を確保した所望のアウトライン出力を得ることができる。 According to the present invention, it is possible to obtain a desired outline output that secures uniformity and visibility of the entire outline output while saving the user's trouble.

本実施形態に係る画像形成装置の内部構成を示すブロック図である。It is a block diagram showing an internal configuration of an image forming apparatus according to the present embodiment. 本実施形態に係る画像形成装置の基本処理の一例を示すフローチャートである。6 is a flowchart illustrating an example of basic processing of the image forming apparatus according to the present exemplary embodiment. アウトライン抽出処理の一例を示すフローチャートである。It is a flow chart which shows an example of outline extraction processing. タイトル抽出処理の一例を示すフローチャートである。It is a flow chart which shows an example of title extraction processing. タイトル抽出処理の一例を示すフローチャートである。It is a flow chart which shows an example of title extraction processing. タイトル領域及び本文領域の一例を示す図である。It is a figure showing an example of a title field and a text field. 本文抽出処理の一例を示すフローチャートである。It is a flow chart which shows an example of text extraction processing. 本文抽出処理の一例を示すフローチャートである。It is a flow chart which shows an example of text extraction processing. ＯＯＸＭＬ文書をアプリケーション表示した一例を示す図である。It is a figure which shows an example which displayed the OOXML document by the application. 図９（Ａ）のＯＯＸＭＬ文書をマークアップ言語で記述した一例を示す図である。It is a figure which shows an example which described the OOXML document of FIG. 9(A) by markup language. 図９（Ｂ）のＯＯＸＭＬ文書をマークアップ言語で記述した一例を示す図である。It is a figure which shows an example which described the OOXML document of FIG. 9(B) by markup language. 図９のＯＯＸＭＬ文書を対象としたアウトライン出力の実施例を示す図である。FIG. 10 is a diagram showing an example of outline output for the OOXML document of FIG. 9.

以下、本発明の実施の形態について、図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

［１．構成の説明］
以下、図面を参照しながら、この発明の一実施形態について詳しく説明する。図１に、本発明の文書処理装置の実施形態である画像形成装置１の内部構成を示す。
画像形成装置１は、図１に示すように、制御部１０と、記憶部２０と、画像形成部３０と、通信部４０と、操作表示部５０と、を備えて構成されている。 [1. Description of configuration]
Hereinafter, an embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 shows an internal configuration of an image forming apparatus 1 which is an embodiment of a document processing apparatus of the present invention.
As shown in FIG. 1, the image forming apparatus 1 is configured to include a control unit 10, a storage unit 20, an image forming unit 30, a communication unit 40, and an operation display unit 50.

制御部１０は、ＣＰＵ、ＲＯＭ、ＲＡＭ等により構成される。制御部１０のＣＰＵは、ＲＯＭや記憶部２０に記憶されているシステムプログラムや各種処理プログラムを読み出してＲＡＭに展開し、展開されたプログラムに従って、画像形成装置１各部の動作を集中制御する。 The control unit 10 is composed of a CPU, a ROM, a RAM and the like. The CPU of the control unit 10 reads out the system programs and various processing programs stored in the ROM and the storage unit 20 and expands them in the RAM, and centrally controls the operation of each unit of the image forming apparatus 1 according to the expanded programs.

記憶部２０は、不揮発性の半導体メモリーやＨＤＤ（Hard Disc Drive）等により構成され、画像形成装置本体１００で実行可能なシステムプログラム、当該システムプログラムで実行可能な各種処理プログラム、これら各種処理プログラムを実行する際に使用されるデータ、制御部１１０によって演算処理された処理結果のデータ等を記憶する。
また、記憶部２０は、追加フォント、ジョブデータ等を記憶する。また、記憶部２０は、生成されたアウトライン出力用データからなる文書データを記憶する。 The storage unit 20 is composed of a nonvolatile semiconductor memory, an HDD (Hard Disc Drive), etc., and is a system program executable by the image forming apparatus main body 100, various processing programs executable by the system program, and various processing programs The data used when executing, the data of the processing result calculated by the control unit 110, and the like are stored.
The storage unit 20 also stores additional fonts, job data, and the like. The storage unit 20 also stores the document data including the generated outline output data.

画像形成部３０は、外部装置２から送信されたプリントジョブや原稿読取ユニット（図示省略）から入力された画像データに基づいて、給紙トレイ（図示省略）から供給された記録用紙などの記録媒体に画像形成を行う。画像形成部３０による画像形成の方式としては、電子写真方式、インクジェット方式、熱転写方式など、任意の周知の方法を採用することができる。 The image forming unit 30 is a recording medium such as a recording sheet supplied from a paper feed tray (not shown) based on a print job transmitted from the external device 2 or image data input from a document reading unit (not shown). Image formation. As an image forming method by the image forming unit 30, any well-known method such as an electrophotographic method, an inkjet method, a thermal transfer method can be adopted.

通信部４０は、モデム、ＬＡＮアダプターやルーター等によって構成され、通信ネットワークに接続されたＰＣ等の外部装置２との通信制御を行い、画像データやステータス情報等の受信等を行う。通信ネットワークは、イーサネット（登録商標）によるＬＡＮ（Local Area Network）に限らず、ＷＡＮ（Wide Area Network）、ＦＤＤＩ（Fiber Distributed Data Interface）やトークンリング、無線ＬＡＮ、インターネット回線、或いは公衆ＦＡＸ回線などであってもよい。また、外部装置２がモバイル端末の場合、例えば、ＬＴＥ／３Ｇ等の移動体通信回線、或いはＷｉＭＡＸ等の無線通信回線を介して、ＬＡＮ或いはインターネット回線へとアクセス可能とする構成であってもよい。 The communication unit 40 includes a modem, a LAN adapter, a router, and the like, controls communication with the external device 2 such as a PC connected to the communication network, and receives image data, status information, and the like. The communication network is not limited to a LAN (Local Area Network) by Ethernet (registered trademark), but may be WAN (Wide Area Network), FDDI (Fiber Distributed Data Interface), token ring, wireless LAN, Internet line, or public FAX line. It may be. When the external device 2 is a mobile terminal, the external device 2 may be configured to be accessible to a LAN or an internet line through a mobile communication line such as LTE/3G or a wireless communication line such as WiMAX. ..

操作表示部５０は、例えば、タッチパネル付の液晶ディスプレイ（ＬＣＤ）で構成され、表示部５１及び操作部５２として機能する。
表示部５１は、制御部１０から入力される表示制御信号に従って、各種操作画面、各機能の動作状況等の表示を行う。表示部５１は、例えば、アウトライン出力のプレビューを表示する。また、表示部５１は、ユーザーによるタッチ操作を受け付けて、操作信号を制御部１０に出力する。
操作部５２は、テンキー、スタートキー等の各種操作キーを備え、ユーザーによる各種入力操作を受け付けて、操作信号を制御部１０に出力する。ユーザーは、操作表示部５０を操作して、画質設定、倍率設定、応用設定、出力設定及び用紙設定等の画像形成に関する設定、用紙搬送指示、並びに装置の停止操作などを行うことができる。 The operation display unit 50 is composed of, for example, a liquid crystal display (LCD) with a touch panel, and functions as the display unit 51 and the operation unit 52.
The display unit 51 displays various operation screens, operating conditions of each function, and the like according to a display control signal input from the control unit 10. The display unit 51 displays, for example, a preview of outline output. The display unit 51 also receives a touch operation by the user and outputs an operation signal to the control unit 10.
The operation unit 52 includes various operation keys such as a numeric keypad and a start key, receives various input operations by the user, and outputs operation signals to the control unit 10. A user can operate the operation display unit 50 to perform image forming settings such as image quality setting, magnification setting, application setting, output setting and paper setting, paper conveyance instruction, and operation of stopping the apparatus.

外部装置２は、ユーザー操作等によりプリントするプリントジョブのデータを生成し、通信ネットワークを介して画像形成装置１に送信する。なお、外部装置２としては、例えばＰＣやサーバー装置、ＰＤＡやスマートフォン等のモバイル端末等を適用可能である。 The external device 2 generates print job data to be printed by a user operation or the like, and transmits the data to the image forming device 1 via the communication network. As the external device 2, for example, a PC, a server device, a PDA, a mobile terminal such as a smartphone, or the like can be applied.

［２．動作の説明］
次に、本実施形態に係る画像形成装置１の具体的な動作について説明する。
まず、画像形成装置１の基本処理を、図２のフローチャートを参照して説明する。 [2. Description of operation]
Next, a specific operation of the image forming apparatus 1 according to this embodiment will be described.
First, the basic processing of the image forming apparatus 1 will be described with reference to the flowchart of FIG.

まず、制御部１０は、アウトライン出力対象の文書ファイルの読み込みを行う（ステップＳ１０１）。具体的には、制御部１０は、ユーザー操作によりアウトライン出力対象として指定された文書ファイルの読み込みを行う。
次に、制御部１０は、ステップＳ１０１で読み込みを行った文書ファイルを解析し、アウトライン抽出処理を行う（ステップＳ１０２）。以下、ステップＳ１０２で行われるアウトライン抽出処理について、図３のフローチャートを参照して説明する。 First, the control unit 10 reads a document file for outline output (step S101). Specifically, the control unit 10 reads a document file designated as an outline output target by a user operation.
Next, the control unit 10 analyzes the document file read in step S101 and performs outline extraction processing (step S102). Hereinafter, the outline extraction processing performed in step S102 will be described with reference to the flowchart in FIG.

具体的には、まず、制御部１０は、参照条件に「ＮＵＬＬ」を設定する（ステップＳ２０１）。「ＮＵＬＬ」は、参照条件として特に何も設定しないことを示す値である。 Specifically, first, the control unit 10 sets “NULL” as the reference condition (step S201). "NULL" is a value indicating that nothing is specifically set as a reference condition.

次に、制御部１０は、ステップＳ１０１で読み込みを行った文書ファイル内に、テンプレート属性が付加されたスライドが存在するか否かを判定する（ステップＳ２０２）。
制御部１０は、テンプレート属性が付加されたスライドが存在すると判定した場合（ステップＳ２０２：ＹＥＳ）、参照条件に「テンプレート属性」を設定し（ステップＳ２０３）、ステップＳ２０４へと移行する。
一方、制御部１０は、テンプレート属性が付加されたスライドが存在しないと判定した場合（ステップＳ２０２：ＮＯ）、ステップＳ２０４へと移行する。 Next, the control unit 10 determines whether or not there is a slide to which the template attribute is added in the document file read in step S101 (step S202).
When determining that there is a slide to which the template attribute is added (step S202: YES), the control unit 10 sets "template attribute" in the reference condition (step S203), and proceeds to step S204.
On the other hand, when the control unit 10 determines that there is no slide to which the template attribute is added (step S202: NO), the control unit 10 proceeds to step S204.

ステップＳ２０４において、制御部１０は、文書ファイル内のスライド番号（ページ番号）を示す変数ｎに１を設定する（ステップＳ２０４）。以降、文書ファイル内の最終スライドのスライド番号をＮと表記する。 In step S204, the control unit 10 sets 1 to the variable n indicating the slide number (page number) in the document file (step S204). Hereinafter, the slide number of the final slide in the document file will be referred to as N.

次に、制御部１０は、ｎ番目のスライド内に、“ｔｉｔｌｅ” プレースホルダーが指定（付加）されたテキストコンテンツが存在するか否かを判定する（ステップＳ２０５）。“ｔｉｔｌｅ” プレースホルダーとは、各スライドのタイトルに関するプレースホルダーのことである。即ち、“ｔｉｔｌｅ” プレースホルダーが指定されたテキストコンテンツは、アウトライン出力の際、タイトルとして認識される。
制御部１０は、“ｔｉｔｌｅ” プレースホルダーが指定されたテキストコンテンツが存在すると判定した場合（ステップＳ２０５：ＹＥＳ）、当該テキストコンテンツをタイトルとして抽出し（ステップＳ２０６）、ステップＳ２０８へと移行する。
一方、制御部１０は、“ｔｉｔｌｅ” プレースホルダーが指定されたテキストコンテンツが存在しないと判定した場合（ステップＳ２０５：ＮＯ）、タイトル抽出処理を行う（ステップＳ２０７）。以下、ステップＳ２０７で行われるタイトル抽出処理について、図４及び図５のフローチャートを参照して説明する。 Next, the control unit 10 determines whether or not the text content for which the “title” placeholder is designated (added) exists in the n-th slide (step S205). The "title" placeholder is a placeholder for the title of each slide. That is, the text content for which the "title" placeholder is designated is recognized as a title when the outline is output.
When the control unit 10 determines that the text content in which the "title" placeholder is designated is present (step S205: YES), the control unit 10 extracts the text content as a title (step S206), and proceeds to step S208.
On the other hand, when the control unit 10 determines that the text content in which the “title” placeholder is designated does not exist (step S205: NO), the control unit 10 performs a title extraction process (step S207). The title extraction process performed in step S207 will be described below with reference to the flowcharts of FIGS. 4 and 5.

具体的には、まず、制御部１０は、抽出コンテンツに初期値を設定する（ステップＳ３０１）。ここで、抽出コンテンツとは、タイトル候補として抽出されたコンテンツのことである。なお、設定される初期値は、初期値であること（即ち、抽出されたコンテンツが未だ存在しないこと）が識別可能であればいかなる値であってもよい。 Specifically, first, the control unit 10 sets an initial value in the extracted content (step S301). Here, the extracted content is content extracted as a title candidate. The initial value to be set may be any value as long as it can be identified that it is the initial value (that is, the extracted content does not yet exist).

次に、制御部１０は、プレースホルダーが指定されたコンテンツに類似するコンテンツが抽出されたか否かを示す「isPlaceholderFound」に、未だ抽出されていないことを示す「ｆａｌｓｅ」を設定する（ステップＳ３０２）。 Next, the control unit 10 sets "isPlaceholderFound", which indicates whether or not content similar to the content for which the placeholder is designated, has been extracted, to "false", which indicates that the content has not yet been extracted (step S302). .

次に、制御部１０は、スライド内のコンテンツの番号を示す変数ｎに１を設定する（ステップＳ３０３）。以降、スライド内の最終コンテンツのコンテンツ番号をＮと表記する。 Next, the control unit 10 sets 1 to the variable n indicating the content number in the slide (step S303). Hereinafter, the content number of the final content in the slide will be referred to as N.

次に、制御部１０は、ｎ番目のコンテンツ（以下コンテンツｎ）がテキストコンテンツであるか否かを判定する（ステップＳ３０４）。
制御部１０は、コンテンツｎがテキストコンテンツであると判定した場合（ステップＳ３０４：ＹＥＳ）、ステップＳ３０６へと移行する。
一方、制御部１０は、コンテンツｎがテキストコンテンツでないと判定した場合（ステップＳ３０４：ＮＯ）、変数ｎに１を加算して（ステップＳ３０５）、ステップＳ３０４の処理を繰り返す。 Next, the control unit 10 determines whether the n-th content (hereinafter content n) is a text content (step S304).
When the control unit 10 determines that the content n is the text content (step S304: YES), the control unit 10 moves to step S306.
On the other hand, when the control unit 10 determines that the content n is not the text content (step S304: NO), it adds 1 to the variable n (step S305) and repeats the process of step S304.

ステップＳ３０６において、制御部１０は、参照条件がＮＵＬＬに設定されていない（即ち、参照条件がテンプレート属性に設定されている）か否かを判定する（ステップＳ３０６）。
制御部１０は、参照条件がＮＵＬＬに設定されていないと判定した場合（ステップＳ３０６：ＹＥＳ）、参照条件がテンプレート属性に設定されていると判定し、次のステップＳ３０７へと移行する。
一方、制御部１０は、参照条件がＮＵＬＬに設定されていると判定した場合（ステップＳ３０６：ＮＯ）、ステップＳ３０９へと移行する。 In step S306, the control unit 10 determines whether the reference condition is not set to NULL (that is, the reference condition is set to the template attribute) (step S306).
When determining that the reference condition is not set to NULL (step S306: YES), the control unit 10 determines that the reference condition is set to the template attribute, and proceeds to the next step S307.
On the other hand, when the control unit 10 determines that the reference condition is set to NULL (step S306: NO), the control unit 10 proceeds to step S309.

ステップＳ３０７において、制御部１０は、“ｔｉｔｌｅ”プレースホルダーが指定されたコンテンツの位置とコンテンツｎの位置とのユークリッド距離が閾値Ｄ以下であるか否かを判定する（ステップＳ３０７）。ここで、閾値Ｄは、コンテンツｎが、“ｔｉｔｌｅ”プレースホルダーが指定されたコンテンツと類似する位置に存在すると見做すことができるか否かの境界となる値のことである。なお、閾値Ｄは、固定値であってもよいし、可変であってもよい。
制御部１０は、ユークリッド距離が閾値Ｄ以下であると判定した場合（ステップＳ３０７：ＹＥＳ）、「isPlaceholderFound」に、プレースホルダーが指定されたコンテンツに類似するコンテンツが抽出されたことを示す「ｔｒｕｅ」を設定し（ステップＳ３０８）、ステップＳ３１２へと移行する。
一方、制御部１０は、ユークリッド距離が閾値Ｄを超えていると判定した場合（ステップＳ３０７：ＮＯ）、コンテンツｎはプレースホルダーが指定されたコンテンツと類似しないと判定し、ステップＳ３０９へと移行する。 In step S307, the control unit 10 determines whether the Euclidean distance between the position of the content for which the "title" placeholder is designated and the position of the content n is equal to or less than the threshold value D (step S307). Here, the threshold value D is a value that serves as a boundary as to whether or not the content n can be regarded as existing at a position similar to the content in which the “title” placeholder is designated. The threshold value D may be a fixed value or may be variable.
When the control unit 10 determines that the Euclidean distance is less than or equal to the threshold D (step S307: YES), “isPlaceholderFound” indicates that content similar to the content for which the placeholder is designated is extracted as “true”. Is set (step S308), and the process proceeds to step S312.
On the other hand, when the control unit 10 determines that the Euclidean distance exceeds the threshold value D (step S307: NO), the control unit 10 determines that the content n is not similar to the content for which the placeholder is designated, and proceeds to step S309. ..

ステップＳ３０９において、制御部１０は、コンテンツｎの位置がタイトル領域内であるか否かを判定する（ステップＳ３０９）。ここで、タイトル領域とは、各スライドにおいて、タイトルが入力されていると判定する領域として、予め設定されている領域のことである。なお、タイトル領域以外の領域は、本文が入力されていると判定する領域である本文領域として設定される。即ち、各スライドは、図６に示すように、タイトル領域Ｅ１と、本文領域Ｅ２と、により構成されている。タイトル領域Ｅ１及び本文領域Ｅ２は、固定されていてもよいし、可変であってもよい。
制御部１０は、コンテンツｎの位置がタイトル領域内であると判定した場合（ステップＳ３０９：ＹＥＳ）、次のステップＳ３１０へと移行する。
一方、制御部１０は、コンテンツｎの位置がタイトル領域外であると判定した場合（ステップＳ３０９：ＮＯ）、当該コンテンツｎはタイトルの候補ではないと判定し、ステップＳ３１３へと移行する。 In step S309, the control unit 10 determines whether the position of the content n is within the title area (step S309). Here, the title area is an area set in advance as an area for determining that a title is input in each slide. An area other than the title area is set as a body area, which is an area in which it is determined that the body has been input. That is, each slide is composed of a title area E1 and a body area E2, as shown in FIG. The title area E1 and the body area E2 may be fixed or variable.
When determining that the position of the content n is within the title area (step S309: YES), the control unit 10 moves to the next step S310.
On the other hand, when the control unit 10 determines that the position of the content n is outside the title area (step S309: NO), the control unit 10 determines that the content n is not a title candidate, and proceeds to step S313.

ステップＳ３１０において、制御部１０は、コンテンツｎのテキストサイズと、抽出コンテンツのテキストサイズと、を比較する（ステップＳ３１０）。なお、抽出コンテンツに初期値が設定されている場合には、コンテンツｎのテキストサイズが抽出コンテンツのテキストサイズよりも大きいと判定する。
制御部１０は、コンテンツｎのテキストサイズが抽出コンテンツのテキストサイズよりも大きいと判定した場合（ステップＳ３１０：＞）、当該コンテンツｎはタイトルの候補であると判定し、ステップＳ３１２へと移行する。
また、制御部１０は、コンテンツｎのテキストサイズと抽出コンテンツのテキストサイズとが同一であると判定した場合（ステップＳ３１０：＝＝）、次のステップＳ３１１へと移行する。
一方、制御部１０は、コンテンツｎのテキストサイズが抽出コンテンツのテキストサイズよりも小さいと判定した場合（ステップＳ３１０：＜）、当該コンテンツｎはタイトルの候補ではないと判定し、ステップＳ３１３へと移行する。 In step S310, the control unit 10 compares the text size of the content n with the text size of the extracted content (step S310). Note that when the initial value is set in the extracted content, it is determined that the text size of the content n is larger than the text size of the extracted content.
When the control unit 10 determines that the text size of the content n is larger than the text size of the extracted content (step S310:>), the control unit 10 determines that the content n is a title candidate, and proceeds to step S312.
When the control unit 10 determines that the text size of the content n is the same as the text size of the extracted content (step S310:==), the control unit 10 proceeds to the next step S311.
On the other hand, when the control unit 10 determines that the text size of the content n is smaller than the text size of the extracted content (step S310:<), the control unit 10 determines that the content n is not a title candidate, and proceeds to step S313. To do.

ステップＳ３１１において、制御部１０は、コンテンツｎの位置が抽出コンテンツよりも左上であるか否かを判定する（ステップＳ３１１）。
制御部１０は、コンテンツｎの位置が抽出コンテンツよりも左上であると判定した場合（ステップＳ３１１：ＹＥＳ）、当該コンテンツｎはタイトルの候補であると判定し、次のステップＳ３１２へと移行する。
一方、制御部１０は、コンテンツｎの位置が抽出コンテンツよりも左上でないと判定した場合（ステップＳ３１１：ＮＯ）、当該コンテンツｎはタイトルの候補ではないと判定し、ステップＳ３１３へと移行する。 In step S311, the control unit 10 determines whether the position of the content n is at the upper left of the extracted content (step S311).
When the control unit 10 determines that the position of the content n is at the upper left of the extracted content (step S311: YES), the control unit 10 determines that the content n is a title candidate, and proceeds to the next step S312.
On the other hand, when the control unit 10 determines that the position of the content n is not at the upper left of the extracted content (step S311: NO), the control unit 10 determines that the content n is not a title candidate, and proceeds to step S313.

ステップＳ３１２において、制御部１０は、抽出コンテンツにコンテンツｎを設定する（ステップＳ３１２）。 In step S312, the control unit 10 sets the content n as the extracted content (step S312).

ステップＳ３１３において、制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されているか否かを判定する（ステップＳ３１３）。
制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されていると判定した場合（ステップＳ３１３：ＹＥＳ）、ステップＳ３１２で抽出コンテンツに設定されたコンテンツｎをタイトルとして抽出し、図３のステップＳ２０８へと移行する。
一方、制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されていない（即ち、「ｆａｌｓｅ」が設定されている）と判定した場合（ステップＳ３１３：ＮＯ）、次のステップＳ３１４へと移行する。 In step S313, the control unit 10 determines whether "true" is set in "isPlaceholderFound" (step S313).
When the control unit 10 determines that “true” is set in “isPlaceholderFound” (step S313: YES), the content n set in the extracted content in step S312 is extracted as a title, and step S208 in FIG. Transition to.
On the other hand, when the control unit 10 determines that “true” is not set in “isPlaceholderFound” (that is, “false” is set) (step S313: NO), the process proceeds to the next step S314. ..

ステップＳ３１４において、制御部１０は、変数ｎに、最終コンテンツのコンテンツ番号であるＮが設定されているか否かを判定する（ステップＳ３１４）。
制御部１０は、変数ｎにＮが設定されていると判定した場合（ステップＳ３１４：ＹＥＳ）、ステップＳ３１２で抽出コンテンツに設定されたコンテンツｎをタイトルとして抽出し、図３のステップＳ２０８へと移行する。
一方、制御部１０は、変数ｎにＮが設定されていないと判定した場合（ステップＳ３１４：ＮＯ）、変数ｎに１を加算して（ステップＳ３１５）、ステップＳ３０４へと移行する。 In step S314, the control unit 10 determines whether the variable n is set to N, which is the content number of the final content (step S314).
When the control unit 10 determines that N is set in the variable n (step S314: YES), the control unit 10 extracts the content n set as the extraction content in step S312 as a title, and proceeds to step S208 in FIG. To do.
On the other hand, when the control unit 10 determines that N is not set in the variable n (step S314: NO), 1 is added to the variable n (step S315), and the process proceeds to step S304.

図３のステップＳ２０８において、制御部１０は、ｎ番目のスライド内に、“ｔｉｔｌｅ” プレースホルダー以外のプレースホルダー（以下“ｂｏｄｙ” プレースホルダー）が指定されたテキストコンテンツが存在するか否かを判定する（ステップＳ２０８）。“ｂｏｄｙ” プレースホルダーとは、各スライドの本文に関するプレースホルダーのことである。即ち、“ｂｏｄｙ” プレースホルダーが指定されたテキストコンテンツは、アウトライン出力の際、本文として認識される。
制御部１０は、“ｂｏｄｙ” プレースホルダーが指定されたテキストコンテンツが存在すると判定した場合（ステップＳ２０８：ＹＥＳ）、当該テキストコンテンツを本文として抽出し（ステップＳ２０９）、ステップＳ２１１へと移行する。
一方、制御部１０は、“ｂｏｄｙ” プレースホルダーが指定されたテキストコンテンツが存在しないと判定した場合（ステップＳ２０８：ＮＯ）、本文抽出処理を行う（ステップＳ２１０）。以下、ステップＳ２１０で行われる本文抽出処理について、図７及び図８のフローチャートを参照して説明する。 In step S208 of FIG. 3, the control unit 10 determines whether or not there is a text content in which a placeholder other than the “title” placeholder (hereinafter, “body” placeholder) is designated in the n-th slide. Yes (step S208). A "body" placeholder is a placeholder for the body of each slide. That is, the text content for which the "body" placeholder is designated is recognized as the body when the outline is output.
When the control unit 10 determines that the text content in which the “body” placeholder is designated exists (step S208: YES), the control unit 10 extracts the text content as the body (step S209) and proceeds to step S211.
On the other hand, when the control unit 10 determines that the text content in which the "body" placeholder is designated does not exist (step S208: NO), the control unit 10 performs a body text extraction process (step S210). Hereinafter, the text extraction process performed in step S210 will be described with reference to the flowcharts of FIGS. 7 and 8.

具体的には、まず、制御部１０は、抽出コンテンツに初期値を設定する（ステップＳ４０１）。ここで、抽出コンテンツとは、本文候補として抽出されたコンテンツのことである。
次のステップＳ４０２〜ステップＳ４０６の処理は、タイトル抽出処理について示す図４のステップＳ３０２〜ステップＳ３０６の処理と同様であるので、説明を省略する。 Specifically, first, the control unit 10 sets an initial value in the extracted content (step S401). Here, the extracted content is content extracted as a body text candidate.
The processing of the next steps S402 to S406 is the same as the processing of steps S302 to S306 of FIG. 4 showing the title extraction processing, and therefore the description thereof will be omitted.

ステップＳ４０７において、制御部１０は、“ｂｏｄｙ”プレースホルダーが指定されたコンテンツの位置とコンテンツｎの位置とのユークリッド距離が閾値Ｄ以下であるか否かを判定する（ステップＳ４０７）。ここで、閾値Ｄは、コンテンツｎが、“ｂｏｄｙ”プレースホルダーが指定されたコンテンツと類似する位置に存在すると見做すことができるか否かの境界となる値のことである。
制御部１０は、ユークリッド距離が閾値Ｄ以下であると判定した場合（ステップＳ４０７：ＹＥＳ）、抽出コンテンツに初期値を設定して抽出コンテンツの設定を初期化する（ステップＳ４０８）。次いで、制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」を設定し（ステップＳ４０９）、ステップＳ４１２へと移行する。
一方、制御部１０は、ユークリッド距離が閾値Ｄを超えていると判定した場合（ステップＳ４０７：ＮＯ）、コンテンツｎはプレースホルダーが指定されたコンテンツと類似しないと判定し、ステップＳ４１０へと移行する。 In step S407, the control unit 10 determines whether or not the Euclidean distance between the position of the content for which the "body" placeholder is designated and the position of the content n is equal to or less than the threshold D (step S407). Here, the threshold value D is a value that becomes a boundary as to whether or not it can be considered that the content n exists at a position similar to the content in which the "body" placeholder is designated.
When the control unit 10 determines that the Euclidean distance is equal to or less than the threshold value D (step S407: YES), it sets an initial value to the extracted content and initializes the setting of the extracted content (step S408). Next, the control unit 10 sets “true” in “isPlaceholderFound” (step S409), and proceeds to step S412.
On the other hand, when the control unit 10 determines that the Euclidean distance exceeds the threshold value D (step S407: NO), the control unit 10 determines that the content n is not similar to the content for which the placeholder is designated, and proceeds to step S410. ..

ステップＳ４１０において、制御部１０は、コンテンツｎの位置が本文領域（図６参照）内であるか否かを判定する（ステップＳ４１０）。
制御部１０は、コンテンツｎの位置が本文領域内であると判定した場合（ステップＳ４１０：ＹＥＳ）、次のステップＳ４１１へと移行する。
一方、制御部１０は、コンテンツｎの位置が本文領域外であると判定した場合（ステップＳ４１０：ＮＯ）、当該コンテンツｎは本文の候補ではないと判定し、ステップＳ４１３へと移行する。 In step S410, the control unit 10 determines whether or not the position of the content n is within the text area (see FIG. 6) (step S410).
When the control unit 10 determines that the position of the content n is within the text area (step S410: YES), the control unit 10 moves to the next step S411.
On the other hand, when the control unit 10 determines that the position of the content n is outside the body text area (step S410: NO), the control unit 10 determines that the content n is not a body text candidate, and proceeds to step S413.

ステップＳ４１１において、制御部１０は、コンテンツｎのテキストサイズが閾値Ｓ以上であるか否かを判定する（ステップＳ４１１）。ここで、閾値Ｓは、コンテンツｎが、本文の候補と見做すことができるか否かの境界となる値のことである。なお、閾値Ｓは、固定値であってもよいし、可変であってもよい。
制御部１０は、コンテンツｎのテキストサイズが閾値Ｓ以上であると判定した場合（ステップＳ４１１：ＹＥＳ）、当該コンテンツｎは本文の候補であると判定し、次のステップＳ４１２へと移行する。
一方、制御部１０は、コンテンツｎのテキストサイズが閾値Ｓ未満であると判定した場合（ステップＳ４１１：ＮＯ）、当該コンテンツｎは本文の候補ではないと判定し、ステップＳ４１３へと移行する。 In step S411, the control unit 10 determines whether the text size of the content n is equal to or larger than the threshold value S (step S411). Here, the threshold value S is a value serving as a boundary of whether or not the content n can be regarded as a text body candidate. The threshold value S may be a fixed value or may be variable.
When the control unit 10 determines that the text size of the content n is equal to or larger than the threshold S (step S411: YES), the control unit 10 determines that the content n is a body text candidate and moves to the next step S412.
On the other hand, when the control unit 10 determines that the text size of the content n is smaller than the threshold value S (step S411: NO), the control unit 10 determines that the content n is not a text body candidate, and proceeds to step S413.

ステップＳ４１２において、制御部１０は、コンテンツｎを抽出コンテンツに追加する（ステップＳ４１２）。即ち、本文抽出処理において、抽出対象となる本文は複数であってもよいため、抽出コンテンツにコンテンツｎを設定（更新）する処理ではなく、都度追加する処理が行われる。 In step S412, the control unit 10 adds the content n to the extracted content (step S412). That is, in the text extraction process, since there may be a plurality of texts to be extracted, the process of adding each time is performed instead of the process of setting (updating) the content n in the extracted content.

ステップＳ４１３において、制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されているか否かを判定する（ステップＳ４１３）。
制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されていると判定した場合（ステップＳ４１３：ＹＥＳ）、ステップＳ４１２で抽出コンテンツに追加されたコンテンツｎを本文として抽出し、図３のステップＳ２１１へと移行する。
一方、制御部１０は、「isPlaceholderFound」に「ｔｒｕｅ」が設定されていない（即ち、「ｆａｌｓｅ」が設定されている）と判定した場合（ステップＳ４１３：ＮＯ）、次のステップＳ４１４へと移行する。 In step S413, the control unit 10 determines whether "true" is set in "isPlaceholderFound" (step S413).
When the control unit 10 determines that “true” is set in “isPlaceholderFound” (step S413: YES), the content n added to the extracted content in step S412 is extracted as the text, and step S211 in FIG. 3 is performed. Transition to.
On the other hand, when the control unit 10 determines that “true” is not set in “isPlaceholderFound” (that is, “false” is set) (step S413: NO), the process proceeds to the next step S414. ..

ステップＳ４１４において、制御部１０は、変数ｎにＮが設定されているか否かを判定する（ステップＳ４１４）。
制御部１０は、変数ｎにＮが設定されていると判定した場合（ステップＳ４１４：ＹＥＳ）、ステップＳ４１２で抽出コンテンツに追加された全てのコンテンツｎを本文として抽出し、図３のステップＳ２１１へと移行する。
一方、制御部１０は、変数ｎにＮが設定されていないと判定した場合（ステップＳ４１４：ＮＯ）、変数ｎに１を加算して（ステップＳ４１５）、ステップＳ４０４へと移行する。 In step S414, the control unit 10 determines whether N is set in the variable n (step S414).
When the control unit 10 determines that N is set in the variable n (step S414: YES), the control unit 10 extracts all the contents n added to the extracted contents in step S412 as the text, and proceeds to step S211 in FIG. And transition.
On the other hand, when the control unit 10 determines that N is not set in the variable n (step S414: NO), 1 is added to the variable n (step S415), and the process proceeds to step S404.

制御部１０は、上記のステップＳ２０５〜ステップＳ２１０の処理により、アウトライン出力の対象となるコンテンツ（対象コンテンツ）を抽出することができる。即ち、制御部１０は、本発明の抽出部として機能する。 The control unit 10 can extract the content (target content) that is the target of outline output by the processing of steps S205 to S210 described above. That is, the control unit 10 functions as the extraction unit of the present invention.

図３のステップＳ２１１において、制御部１０は、上記のステップＳ２０５〜ステップＳ２１０の処理で抽出されたコンテンツ（タイトル及び本文）に基づいて、アウトライン出力のレイアウトを設定する処理（アウトライン出力レイアウト処理）を行う（ステップＳ２１１）。 In step S211 of FIG. 3, the control unit 10 performs a process (outline output layout process) of setting an outline output layout based on the content (title and body) extracted in the processes of steps S205 to S210. Perform (step S211).

次に、制御部１０は、変数ｎに、最終スライドのスライド番号であるＮが設定されているか否かを判定する（ステップＳ２１２）。
制御部１０は、変数ｎにＮが設定されていると判定した場合（ステップＳ２１２：ＹＥＳ）、図２のステップＳ１０３へと移行する。
一方、制御部１０は、変数ｎにＮが設定されていないと判定した場合（ステップＳ２１２：ＮＯ）、変数ｎに１を加算して（ステップＳ２１３）、ステップＳ２０５へと移行する。 Next, the control unit 10 determines whether or not N, which is the slide number of the final slide, is set in the variable n (step S212).
When the control unit 10 determines that the variable n is set to N (step S212: YES), the control unit 10 proceeds to step S103 in FIG.
On the other hand, when the control unit 10 determines that N is not set in the variable n (step S212: NO), it adds 1 to the variable n (step S213), and proceeds to step S205.

制御部１０は、上記のアウトライン抽出処理により、アウトライン出力の対象として抽出されたコンテンツ（タイトル及び本文）に基づいて、アウトライン出力用データを生成することができる。即ち、制御部１０は、本発明の生成部として機能する。 The control unit 10 can generate outline output data based on the content (title and body) extracted as an outline output target by the outline extraction process. That is, the control unit 10 functions as the generation unit of the present invention.

図２のステップＳ１０３において、制御部１０は、上記のアウトライン抽出処理により生成されたアウトライン出力用データに基づいて、アウトライン出力のプレビュー画面を表示部５１に表示させる（ステップＳ１０３）。即ち、制御部１０は、本発明の表示制御部として機能する。なお、アウトライン出力のプレビュー画面の表示先は、表示部５１に限らず、例えば、外部装置２の表示部であってもよい。 In step S103 of FIG. 2, the control unit 10 displays a preview screen of outline output on the display unit 51 based on the outline output data generated by the outline extraction process (step S103). That is, the control unit 10 functions as the display control unit of the present invention. The display destination of the outline output preview screen is not limited to the display unit 51, and may be the display unit of the external device 2, for example.

次に、制御部１０は、ユーザー操作によりアウトライン出力の実行が指示されたか否かを判定する（ステップＳ１０４）。例えば、ステップＳ１０３で表示部５１に表示されたアウトライン出力のプレビュー画面において、アウトライン出力の実行を指示する操作が行われた場合にはアウトライン出力の実行が指示されたと判定し、アウトライン出力の実行をキャンセルする操作が行われた場合にはアウトライン出力の実行が指示されていないと判定する。なお、アウトライン出力の出力方法としては、文書ファイルの出力又は画像形成出力が挙げられる。
制御部１０は、アウトライン出力の実行が指示されたと判定した場合（ステップＳ１０４：ＹＥＳ）、次のステップＳ１０５へと移行する。
一方、制御部１０は、アウトライン出力の実行が指示されていないと判定した場合（ステップＳ１０４：ＮＯ）、処理を終了する。 Next, the control unit 10 determines whether or not an instruction to execute outline output is given by a user operation (step S104). For example, in the outline output preview screen displayed on the display unit 51 in step S103, when an operation to instruct execution of outline output is performed, it is determined that execution of outline output has been instructed, and execution of outline output is executed. When the cancel operation is performed, it is determined that execution of outline output has not been instructed. Note that the output method of the outline output may be output of a document file or image forming output.
When determining that the execution of the outline output is instructed (step S104: YES), the control unit 10 proceeds to the next step S105.
On the other hand, when the control unit 10 determines that the execution of the outline output is not instructed (step S104: NO), the process ends.

ステップＳ１０５において、制御部１０は、出力方法が文書ファイルの出力であるか否かを判定する（ステップＳ１０５）。
制御部１０は、出力方法が文書ファイルの出力であると判定した場合（ステップＳ１０５：ＹＥＳ）、生成されたアウトライン出力用データを文書ファイルに変換して記憶部２０に記憶させる処理（ファイル保存処理）を行い（ステップＳ１０６）、処理を終了する。即ち、制御部１０は、本発明の第１出力部として機能する。
一方、制御部１０は、出力方法が文書ファイルの出力でない（即ち、画像形成出力である）と判定した場合（ステップＳ１０５：ＮＯ）、生成されたアウトライン出力用データを画像データに変換して画像形成部３０により用紙に画像を形成させる処理（画像形成処理）を行い（ステップＳ１０７）、処理を終了する。即ち、制御部１０は、本発明の第２出力部として機能する。 In step S105, the control unit 10 determines whether or not the output method is output of a document file (step S105).
When the control unit 10 determines that the output method is the output of the document file (step S105: YES), the control unit 10 converts the generated outline output data into a document file and stores it in the storage unit 20 (file saving process). ) Is performed (step S106), and the process ends. That is, the control unit 10 functions as the first output unit of the present invention.
On the other hand, when the control unit 10 determines that the output method is not the output of the document file (that is, the image formation output) (step S105: NO), the generated outline output data is converted into the image data and the image is output. A process of forming an image on a sheet (image forming process) is performed by the forming unit 30 (step S107), and the process ends. That is, the control unit 10 functions as the second output unit of the present invention.

図９に、ＯＯＸＭＬ文書をアプリケーション表示した図の一例を示す。図９（Ａ）は、ＯＯＸＭＬ文書の１番目のスライドであり、タイトル、本文ともプレースホルダーが指定されている。図９（Ｂ）は、ＯＯＸＭＬ文書の２番目のスライドであり、タイトル、本文ともプレースホルダーは指定されていない。 FIG. 9 shows an example of a diagram in which an OOXML document is displayed as an application. FIG. 9A is the first slide of the OOXML document, and placeholders are designated for both the title and the text. FIG. 9B is the second slide of the OOXML document, and the placeholder is not designated for the title and the text.

図１０Ａ及び図１０Ｂに、図９のＯＯＸＭＬ文書をマークアップ言語で記述した図の一例を示す。図１０Ａは、図９（Ａ）のスライドをマークアップ言語で記述した図の一例である。図１０Ｂは、図９（Ｂ）のスライドをマークアップ言語で記述した図の一例である。
ＯＯＸＭＬ文書の１番目のスライドには、図１０Ａに示すように、各スライドのタイトルに関するプレースホルダーである“ｔｉｔｌｅ” プレースホルダーＭ１と、各スライドの本文に関するプレースホルダーである“ｂｏｄｙ” プレースホルダーＭ２と、が指定されている。
一方、ＯＯＸＭＬ文書の２番目のスライドには、図１０Ｂに示すように、“ｔｉｔｌｅ” プレースホルダーＭ１及び“ｂｏｄｙ” プレースホルダーＭ２のいずれも指定されていない。 10A and 10B show an example of a diagram in which the OOXML document of FIG. 9 is described in a markup language. FIG. 10A is an example of a diagram in which the slide of FIG. 9A is described in a markup language. FIG. 10B is an example of a diagram in which the slide of FIG. 9B is described in a markup language.
As shown in FIG. 10A, the first slide of the OOXML document includes a “title” placeholder M1 which is a placeholder for the title of each slide, and a “body” placeholder M2 which is a placeholder for the body of each slide. , Are specified.
On the other hand, in the second slide of the OOXML document, as shown in FIG. 10B, neither the “title” placeholder M1 nor the “body” placeholder M2 is designated.

図１１に、図９のＯＯＸＭＬ文書を対象としたアウトライン出力の実施例を示す。図１１（Ａ）は、従来のアプリケーションを利用したアウトライン出力の一例である。図１１（Ｂ）は、本発明を利用したアウトライン出力の一例である。
従来例では、図１１（Ａ）に示すように、タイトル、本文ともプレースホルダーが指定された１番目のスライドは、タイトル、本文とも正常にアウトライン出力されるものの、タイトル、本文ともプレースホルダーが指定されていない２番目のスライドは、タイトル、本文ともアウトライン出力することができない。
一方、本発明では、タイトル、本文ともプレースホルダーが指定された１番目のスライドを参照して、タイトル、本文ともプレースホルダーが指定されていない２番目のスライドに対しタイトル抽出処理（図４及び図５参照）及び本文抽出処理を行うことで、アウトライン出力するタイトル及び本文を抽出することができる。従って、本発明によれば、図１１（Ｂ）に示すように、タイトル、本文ともプレースホルダーが指定された１番目のスライドのみならず、タイトル、本文ともプレースホルダーが指定されていない２番目のスライドにおいても、タイトル、本文を抽出して正常にアウトライン出力することができる。 FIG. 11 shows an example of outline output for the OOXML document of FIG. FIG. 11A is an example of outline output using a conventional application. FIG. 11B is an example of the outline output using the present invention.
In the conventional example, as shown in FIG. 11A, the first slide in which the placeholder is specified for both the title and the body is normally outline-outputted for both the title and the body, but the placeholder is specified for both the title and the body. The second slide that is not displayed cannot output outlines of both the title and text.
On the other hand, in the present invention, referring to the first slide in which the placeholder is specified for both the title and the text, the title extraction process is performed on the second slide for which the placeholder is not specified for the title and the text (see FIG. 4 and FIG. 5)) and text extraction processing, the title and text to be output in outline can be extracted. Therefore, according to the present invention, as shown in FIG. 11B, not only the first slide in which the placeholder is specified for the title and the text, but also the second slide in which the placeholder is not specified for the title and the text. Also in the slide, it is possible to extract the title and the body and output the outline normally.

［３．効果］
以上のように、本実施形態に係る文書処理装置（画像形成装置１）は、マークアップ言語で記述された文書データ（文書ファイル）からアウトライン出力の対象となるコンテンツを対象コンテンツとして抽出する抽出部（制御部１０）と、抽出部により抽出された対象コンテンツに基づいてアウトライン出力用データを生成する生成部（制御部１０）と、を備える。また、抽出部は、対象コンテンツがマークアップ言語により指定されている場合、指定された対象コンテンツを抽出し、対象コンテンツがマークアップ言語により指定されていない場合、文書データ内の各コンテンツの属性に基づいて対象コンテンツを抽出する。ここで、各コンテンツの属性とは、各コンテンツの座標位置やテキストサイズ等のことである。
従って、本実施形態に係る文書処理装置によれば、ユーザーによる各種の設定操作を行うことなく自動的にアウトライン出力の対象となるコンテンツを抽出することができるので、ユーザーの手間を省きつつ、アウトライン出力全体の統一性、視認性を確保した所望のアウトライン出力を得ることができる。 [3. effect]
As described above, the document processing apparatus (image forming apparatus 1) according to the present embodiment extracts the content, which is the target of outline output, from the document data (document file) described in the markup language as the target content. (Control unit 10) and a generation unit (control unit 10) that generates outline output data based on the target content extracted by the extraction unit. In addition, the extraction unit extracts the specified target content when the target content is specified by the markup language, and when the target content is not specified by the markup language, the extraction unit sets the attribute of each content in the document data. Based on this, the target content is extracted. Here, the attribute of each content is the coordinate position, text size, etc. of each content.
Therefore, according to the document processing apparatus of the present embodiment, it is possible to automatically extract the content that is the target of outline output without performing various setting operations by the user. It is possible to obtain a desired outline output that secures the uniformity and visibility of the entire output.

また、本実施形態に係る文書処理装置によれば、抽出部は、文書データを構成する各ページにつきテンプレート属性が付加されているか否かを判定し、テンプレート属性が付加されていると判定したページでは、テンプレート属性に基づいて対象コンテンツを抽出する。
従って、本実施形態に係る文書処理装置によれば、テンプレート属性の有無により対象コンテンツの抽出方法を選択することができるので、適切に対象コンテンツを抽出することができる。 Further, according to the document processing apparatus according to the present embodiment, the extraction unit determines whether or not a template attribute is added to each page forming document data, and the page determined to have the template attribute is added. Then, the target content is extracted based on the template attribute.
Therefore, according to the document processing apparatus of the present embodiment, the extraction method of the target content can be selected depending on the presence or absence of the template attribute, so that the target content can be appropriately extracted.

また、本実施形態に係る文書処理装置によれば、抽出部は、テンプレート属性が付加されていると判定したページが存在しない場合、各ページ内の第１領域（タイトル領域Ｅ１）に存在するテキストコンテンツの座標位置及びテキストサイズに基づいて、一のテキストコンテンツをページのタイトルとして抽出する。
従って、本実施形態に係る文書処理装置によれば、文書データ中にテンプレート属性が付加されていると判定したページが存在しない場合であっても、各ページのタイトルを抽出することができるので、より確実にアウトライン出力全体の統一性、視認性を確保した所望のアウトライン出力を得ることができる。 Further, according to the document processing apparatus according to the present embodiment, when the page determined to have the template attribute added does not exist, the extraction unit extracts the text existing in the first area (title area E1) in each page. One text content is extracted as the title of the page based on the coordinate position of the content and the text size.
Therefore, according to the document processing apparatus according to the present embodiment, the title of each page can be extracted even if there is no page determined to have the template attribute added to the document data. It is possible to more reliably obtain the desired outline output that secures the uniformity and visibility of the entire outline output.

また、本実施形態に係る文書処理装置によれば、抽出部は、テンプレート属性が付加されていると判定したページが存在しない場合、各ページ内の第２領域（本文領域Ｅ２）に存在するテキストコンテンツのテキストサイズに基づいて、一又は複数のテキストコンテンツをページの本文として抽出する。
従って、本実施形態に係る文書処理装置によれば、文書データ中にテンプレート属性が付加されていると判定したページが存在しない場合であっても、各ページの本文を抽出することができるので、より確実にアウトライン出力全体の統一性、視認性を確保した所望のアウトライン出力を得ることができる。 Further, according to the document processing apparatus according to the present embodiment, if the page determined to have the template attribute added does not exist in the extraction unit, the text existing in the second area (text area E2) in each page is extracted. Based on the text size of the content, one or more text contents are extracted as the body of the page.
Therefore, according to the document processing apparatus according to the present embodiment, the text of each page can be extracted even when there is no page determined to have the template attribute added in the document data. It is possible to more reliably obtain the desired outline output that secures the uniformity and visibility of the entire outline output.

また、本実施形態に係る文書処理装置によれば、抽出部は、テンプレート属性が付加されていないと判定したページでは、テンプレート属性が付加されていると判定したページで抽出された対象コンテンツと類似するコンテンツを対象コンテンツとして抽出する。
従って、本実施形態に係る文書処理装置によれば、文書データ中にテンプレート属性が付加されていると判定したページとテンプレート属性が付加されていないと判定したページとが混在する場合であっても、全てのページにおいてテンプレート属性に基づく対象コンテンツの抽出を行うことができるので、アウトライン出力全体の統一性、視認性を十分に確保することができる。 Further, according to the document processing apparatus of the present embodiment, the extraction unit determines that a page determined to have no template attribute is similar to the target content extracted on the page determined to have the template attribute. The content to be extracted is extracted as the target content.
Therefore, according to the document processing apparatus according to the present embodiment, even when the page determined to have the template attribute added to the document data and the page determined to have the template attribute not added are mixed. Since the target content can be extracted on all pages based on the template attribute, it is possible to sufficiently secure the uniformity and visibility of the outline output as a whole.

また、本実施形態に係る文書処理装置は、生成部により生成されたアウトライン出力用データを文書データに変換して記憶部２０に記憶させる第１出力部（制御部１０）を備える。
従って、本実施形態に係る文書処理装置によれば、元の文書データとは別にアウトライン出力用の文書データを併せて保管することができるので、文書データの解析時のみならずユーザーが所望するタイミングでアウトライン出力を実施することができる。 The document processing apparatus according to the present embodiment also includes a first output unit (control unit 10) that converts the outline output data generated by the generation unit into document data and stores the document data in the storage unit 20.
Therefore, according to the document processing apparatus according to the present embodiment, the document data for outline output can be stored together with the original document data, so that the timing desired by the user can be obtained not only when the document data is analyzed. Outline output can be performed.

また、本実施形態に係る文書処理装置は、生成部により生成されたアウトライン出力用データを画像データに変換して画像形成部３０により用紙に画像を形成させる第２出力部（制御部１０）を備える。
従って、本実施形態に係る文書処理装置によれば、アウトライン出力全体の統一性、視認性を確保したアウトライン印刷を実施することができるので、ユーザーが所望する印刷物を得ることができる。 The document processing apparatus according to the present embodiment further includes a second output unit (control unit 10) that converts the outline output data generated by the generation unit into image data and causes the image forming unit 30 to form an image on a sheet. Prepare
Therefore, according to the document processing apparatus of the present embodiment, it is possible to carry out outline printing while ensuring uniformity and visibility of the outline output as a whole, and it is possible to obtain a printed matter desired by the user.

また、本実施形態に係る文書処理装置は、第１出力部による文書データの記憶又は第２出力部による画像形成の前に、アウトライン出力用データのプレビュー画面を表示部５１に表示させる表示制御部（制御部１０）を備える。
従って、本実施形態に係る文書処理装置によれば、ユーザーにアウトライン出力のイメージを提示することでアウトライン出力を続行するかキャンセルするかを自由に選択させることができるので、ユーザーにとって所望するアウトライン出力を得ることができる。 Further, the document processing apparatus according to the present embodiment causes the display unit 51 to display the preview screen of the outline output data on the display unit 51 before the document data is stored by the first output unit or the image is formed by the second output unit. (Control unit 10) is provided.
Therefore, according to the document processing apparatus of the present embodiment, it is possible to freely select whether to continue or cancel the outline output by presenting the image of the outline output to the user. Can be obtained.

また、本実施形態に係る文書処理装置によれば、対象コンテンツは、文書データを構成する各ページのタイトル及び本文から構成される。
従って、本実施形態に係る文書処理装置によれば、アウトライン出力のベースとなる各ページのタイトル及び本文を抽出することができるので、ユーザーにとって不足感のないアウトライン出力を得ることができる。 Further, according to the document processing apparatus according to the present embodiment, the target content is composed of the title and the body of each page forming the document data.
Therefore, according to the document processing apparatus according to the present embodiment, the title and the text of each page, which is the basis of outline output, can be extracted, and thus outline output can be obtained that the user does not feel deficient.

以上、本発明に係る実施形態に基づいて具体的に説明したが、本発明は上記実施形態に限定されるものではなく、その要旨を逸脱しない範囲で変更可能である。 The specific description has been given above based on the embodiment according to the present invention, but the present invention is not limited to the above-described embodiment and can be modified without departing from the gist thereof.

例えば、上記実施形態では、図７及び図８に示す本文抽出処理において、“ｂｏｄｙ”プレースホルダーが指定されたコンテンツの位置とコンテンツｎの位置とのユークリッド距離が閾値Ｄ以下であると判定した場合（ステップＳ４０７：ＹＥＳ）、ステップＳ４０８で抽出コンテンツの設定を初期化した後にコンテンツｎを抽出コンテンツに追加する（ステップＳ４１２）ことで、追加された一のコンテンツｎのみを本文として抽出するようにしているが、これに限定されるものではない。例えば、ステップＳ４０８の処理を行わないようにすることで、ステップＳ４１２で追加された全てのコンテンツｎを本文として抽出するようにしてもよい。 For example, in the above-described embodiment, when it is determined that the Euclidean distance between the position of the content for which the “body” placeholder is designated and the position of the content n is equal to or less than the threshold value D in the text extraction process shown in FIGS. (Step S407: YES), after the setting of the extracted content is initialized in step S408, the content n is added to the extracted content (step S412) so that only the added one content n is extracted as the text. However, the present invention is not limited to this. For example, by not performing the process of step S408, all the contents n added in step S412 may be extracted as the body.

また、上記実施形態では、本発明の文書処理装置として、画像形成装置１を例示して説明しているが、これに限定されるものではない。例えば、本発明の抽出部や生成部、その他の機能（表示制御部、第１出力部、第２出力部等）を外部装置２に備えることで、外部装置２を本発明の文書処理装置とすることも可能である。 Further, in the above embodiment, the image forming apparatus 1 is described as an example of the document processing apparatus of the present invention, but the present invention is not limited to this. For example, by providing the external device 2 with the extraction unit and the generation unit of the present invention, and other functions (display control unit, first output unit, second output unit, etc.), the external device 2 becomes the document processing device of the present invention. It is also possible to do so.

その他、文書処理装置を構成する各装置の細部構成及び各装置の細部動作に関しても、本発明の趣旨を逸脱することのない範囲で適宜変更可能である。 In addition, the detailed configuration of each device constituting the document processing device and the detailed operation of each device can be appropriately changed without departing from the scope of the present invention.

１画像形成装置（文書処理装置）
１０制御部（抽出部、生成部、表示制御部、第１出力部、第２出力部）
２０記憶部
３０画像形成部
４０通信部
５０操作表示部
５１表示部
５２操作部
２外部装置 1 Image forming device (document processing device)
10 control unit (extraction unit, generation unit, display control unit, first output unit, second output unit)
20 storage unit 30 image forming unit 40 communication unit 50 operation display unit 51 display unit 52 operation unit 2 external device

Claims

An extraction unit that extracts the content targeted for outline output from the document data described in the markup language as the target content,
A generation unit that generates outline output data based on the target content extracted by the extraction unit;
Equipped with
The extraction unit determines whether or not a template attribute that is a description for designating the target content is added to each page forming the document data, and determines that the template attribute is added. In the page, the target content specified by the template attribute is extracted, and in the page where it is determined that the template attribute is not added, the target content is extracted based on the attribute of each content in the page,
If there is no page determined to have the template attribute added, based on the coordinate position and the text size of the text content existing in the first area set as the area in which the title is input in advance in each page. A document processing apparatus , wherein one text content is extracted as a title of the page .

When there is no page determined to have the template attribute added, the extraction unit extracts one or more text contents based on the text size of the text contents existing in the second area in each page. The document processing apparatus according to claim 1 , wherein the document processing apparatus extracts the text as a body of the document.

An extraction unit that extracts the content targeted for outline output from the document data described in the markup language as the target content,
A generation unit that generates outline output data based on the target content extracted by the extraction unit;
Equipped with
The extraction unit determines whether or not a template attribute that is a description for designating the target content is added to each page forming the document data, and determines that the template attribute is added. In the page, the target content specified by the template attribute is extracted, and in the page where it is determined that the template attribute is not added, the target content is extracted based on the attribute of each content in the page,
If there is no page that is determined to have the template attribute added, one of the pages is determined based on the text size of the text content existing in the second area set as the area in which the text is input in advance in each page. Alternatively, a document processing apparatus, wherein a plurality of text contents are extracted as a body of the page.

In the page which is determined that the template attribute is not added, the extraction unit extracts, as the target content, content similar to the target content extracted in the page where the template attribute is determined to be added. The document processing apparatus according to claim 1, wherein the document processing apparatus is a document processing apparatus.

The document processing apparatus according to any one of claims 1-4, characterized in that it comprises a first output unit to be stored in the storage unit and converts the generated outline output data in the document data by the generating unit ..

To any one of claims 1 to 5, characterized in that a second output unit for forming an image generated outline output data on paper by the image forming unit converts the image data by the generating unit Document processing device described.

Wherein the first output section prior to image formation by stored or the second output portion of the document data by claim 5, characterized in that it comprises a display control unit for displaying a preview screen of the outline output data to the display unit Alternatively, the document processing apparatus according to item 6 .

The target content, the document processing apparatus according to any one of claims 1 to 7, characterized in that they are composed of the title and body of each page constituting the document data.