WO2014168199A1 - 論理演算方法および情報処理装置 - Google Patents
論理演算方法および情報処理装置 Download PDFInfo
- Publication number
- WO2014168199A1 WO2014168199A1 PCT/JP2014/060386 JP2014060386W WO2014168199A1 WO 2014168199 A1 WO2014168199 A1 WO 2014168199A1 JP 2014060386 W JP2014060386 W JP 2014060386W WO 2014168199 A1 WO2014168199 A1 WO 2014168199A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- logical operation
- sets
- record
- records
- basket
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 27
- 230000010365 information processing Effects 0.000 title claims description 19
- 238000007796 conventional method Methods 0.000 description 7
- 238000005192 partition Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000001174 ascending effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/254—Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
Definitions
- the present invention relates to a logical operation technique for large-scale data (Big Data).
- the present invention has been made in view of the above circumstances, and an object thereof is to provide a technique for efficiently performing a logical operation between a plurality of sets in large-scale data.
- each set to be logically operated is classified into a common section having a size that can be arranged in a memory, and a logical operation is performed on the memory for each section.
- the common category is set so that all records in each set can be classified without duplication.
- the logical operation result is obtained by calculating the direct sum of the logical operation results for each section. Note that the size of the common category is determined so that the classified records can be expanded in the memory.
- each record constituting the set is classified into predetermined categories for each set, and between records belonging to the same category, Perform a logical operation between the sets, obtain an operation result, calculate a direct sum of the operation results for each of the sections, and the section can uniquely classify all the records belonging to the plurality of sets.
- a logical operation method between sets is provided.
- An information processing apparatus that performs a logical operation between a plurality of sets, wherein each record constituting the set belongs to the same category as a classification unit that classifies each record into a predetermined category for each set.
- a logical operation unit that performs a logical operation between the sets and obtains an operation result between records
- a direct sum unit that calculates a direct sum of the operation results for each of the sections, and the section includes the plurality of the plurality of records.
- the computer can classify all records belonging to a plurality of sets into categories that can be uniquely categorized, classifying means for classifying the records for each set, and between records belonging to the same category between the predetermined sets.
- a program for causing a logical operation means for performing a logical operation and obtaining an operation result, and a direct sum means for calculating a direct sum of the operation results for each of the sections.
- FIG. 1 It is a block diagram of an information processor of an embodiment of the present invention. It is explanatory drawing for demonstrating the example of an ordered set of embodiment of this invention. It is explanatory drawing for demonstrating the conventional logic operation process. It is explanatory drawing for demonstrating the conventional logic operation process. It is explanatory drawing for demonstrating the outline
- FIG. 1 is a block diagram of the information processing apparatus 100 of this embodiment.
- the information processing apparatus 100 of this embodiment includes a CPU 110, a memory 120, a storage device 130, an input device 140, and an output device 150. Further, a network interface (NWIF) 170 and an external storage device 160 may be further provided.
- NWIF network interface
- the storage device 130 stores a plurality of ordered sets 131 from which duplicate records are excluded. Each ordered set 131 is obtained by searching records held in one database 300 using predetermined items and extracting the search results.
- the database 300 is held in the external storage device 160 and other information processing devices 180 and other external storage devices 190 connected to the information processing device 100 via the network 171 or the like.
- FIG. 2 shows an example of the database 300 and each ordered set 131.
- three ordered sets 131A, 131B, and 131C extracted from the database 300 having three items are shown.
- the ordered sets are referred to as 131 when there is no need to distinguish them.
- the database 300 includes three items of age (Age), region (Area), and possession point (Point), and is composed of one or more records having at least one item value.
- the items in the database 300 are not limited to these, and can take various item types.
- the item value may be any numerical value, character string, full text, or the like that can be searched.
- the number assigned to the left side of each record constituting the database 300 is a record number (recNo.) Uniquely assigned to each record.
- the record number is information representing a position where each record is stored in the database 300 represented as tabular data. This record number is given, for example, when the database 300 is created. Each record can be accessed by specifying a record number.
- the record number is an address that does not consume the storage area.
- the database 300 may not be held in one storage area. It may be distributed and stored in a plurality of storage devices. For example, in the example of the database 300, records with record numbers 0 to 3 are stored in the storage device 130 of the information processing apparatus 100, records 4 to 6 are stored in the external storage device 160, and records 7 to 10 are stored. May be stored in the storage device of the information processing apparatus 180. Alternatively, the age database may be stored in the storage device 130, the regional database may be stored in the external storage device 160, and the possession point database may be stored in the external storage device 190.
- the ordered set 131 is a set of information for searching records satisfying a predetermined condition using a predetermined item as a key in the database 300 and specifying a record obtained as a result.
- a record number is used as a set of information for specifying a record.
- search results are often obtained so that item values are in ascending order or descending order as shown in FIG. For this reason, the record numbers stored in the ordered set 131 are randomly arranged.
- the order of elements for example, (1, 2, 3) or (3, 2, 1) is not distinguished.
- the order of the set of this embodiment is not required as a result of the set operation, it often has an order when it is generated.
- the order of the elements, (1, 2, 3) and (3, 2, 1) are distinguished from each other.
- the set extracted from the database 300 is a set including an order, and is called an ordered set (Ordered Set).
- the ordered set 131 ⁇ / b> A is a record in which a record having an item “Age” value of 10 or more is extracted from the database 300 and the record number is stored.
- the ordered set 131A holds record numbers of 3, 2, 5, 8, 4, and 10 in this order.
- the ordered set 131B is a record in which the record having the item “area” value “South” or “West” is extracted from the database 300 and the record number is stored.
- the ordered set 131B holds record numbers 10, 2, 8, and 9 in this order.
- the ordered set 131C is a record in which a record having a value of “owned point (Point)” of 10 or more is extracted from the database 300 and the record number is stored.
- the ordered set 131C holds record numbers 7, 1, 4, 3, 0, and 8 in this order.
- an ID for uniquely identifying each record in the database 300 may be assigned, and the ID may be stored in the ordered set 131 instead of the record number.
- the ID requires a storage area.
- the CPU 110 realizes a function as an arithmetic unit 210 (see FIG. 6 described later) that executes a logical operation between the ordered sets 131 according to a program stored in the storage device 130 in advance.
- the arithmetic unit 210 loads the ordered set 131 into the memory 120 and performs the logical operation. Note that data necessary when the arithmetic unit 210 executes a logical operation, data generated during execution of the logical operation, and the like are stored in the memory 120 and / or the storage device 130.
- each ordered set 131A, 131B, and 131C will be described only by the ordered set A, B, and C and the last alphabetic character, respectively.
- the above logical operation expression only English letters are used.
- the logical operation is described as A ⁇ (B + C). The same applies to other sets.
- the size of each record constituting the ordered sets A, B, and C is expanded (loaded).
- the possible size of the memory 120 is 6 (each 2 records of each ordered set A, B, C).
- each ordered set A, B, and C in which record values are randomly arranged is mechanically divided from the top to generate a divided ordered set 132 of two records. Then, the logical operation is executed between the respective divided order sets 132, the sum of the results is generated, the duplicate value elimination operation for eliminating duplicate values is performed, and the result is output. At this time, the logical operation needs to be performed for all combinations of the respective divided order sets 132.
- the ordered set A is divided into three divided ordered sets 132 (Aa, Ab, Ac) every two records.
- the ordered set B is divided into two divided ordered sets 132 (Ba, Bb).
- the ordered set C is divided into three divided ordered sets 132 (Ca, Cb, Cc).
- the same division order set 132 is used repeatedly.
- the partition order set Aa is 6 times
- the partition order set Ab is 6 times
- the partition order set Ac is 6 times
- the partition order set Ba is 9 times
- the partition order set Bb is 9 times
- the partition order set Ca is used six times
- the division order set Cb is used six times
- the division order set Cc is used six times.
- the total number of ordered sets is K (K is an integer equal to or greater than 1)
- the number of records in the kth ordered set is Nk (Nk is an integer equal to or greater than 1)
- Nk is an integer equal to or greater than 1
- Mk is an integer equal to or greater than 1
- the number of readings is the product of the number of records in each divided order set 132 and the number of operations. Therefore, in the above example, 6 ⁇ 18 is 108 times. Further, the number of times of writing is the sum of the number of records of the operation result. Therefore, in the above example, in order, 1, 2, 1, 0, 1, 0, 0, 1, 1, 1, 1, 1, 0, 1, 0, 1, 0 times, Times. As described above, in the conventional method, a total of 120 reading and writing processes are performed only by calculation.
- the integration of the obtained results is not a direct sum. That is, a total of 18 calculation results is obtained to obtain a set (2,2,3,2,3,2,8,8,8,4,4), and duplicates for eliminating duplicate values from this set A removal process is performed to obtain (2, 3, 4, 8) as the calculation result.
- FIG. 5 shows an outline of processing by the calculation unit 210 of the present embodiment.
- the records constituting each ordered set 131 are classified (categorized) into common categories.
- a classification for classifying each record is referred to as a basket.
- a logical operation is executed for each basket, and finally a direct sum of the logical operation results of all baskets is calculated.
- the records of each ordered set 131 are divided into sizes that can be arranged in the memory 120, as in the past. However, at this time, it is not mechanically divided by the number of records, but according to the record value, it is classified and classified into one or more predetermined baskets so that records to be distributed do not overlap.
- the arithmetic unit 210 distributes and classifies all records belonging to a plurality of ordered sets 131 to each basket 400, and for each basket 400.
- a logic operation unit 212 that performs a logical operation
- a direct sum unit 213 that calculates a direct sum of the logical operation results of each basket 400.
- the basket 400 is provided on the storage device 130.
- each basket 400 only records satisfying a predetermined condition (sorting condition) are sorted.
- the distribution condition of each basket 400 is set so that all records in the total ordered set 131 can be uniquely distributed (categorized). That is, it is set so as to cover all the records of the total ordered set 131 and classify them without duplication.
- the distribution condition for example, a range of record values, a remainder (remainder) obtained by dividing the record value by a predetermined integer of 2 or more, and the like can be used.
- the distribution conditions and the size and number of baskets 400 are determined in advance. Among them, the size and number of baskets 400 are set according to the values of the records in the total ordered set 131 and the size of the memory 120 used for logical operations. For example, the size of the basket 400 is determined such that the total size of all records classified in the basket 400 does not exceed the size of the memory 120.
- the size of the basket 400 can be up to M / N, where M is the amount of available memory and N is the number of sets.
- the width of the range is determined according to the size of the memory 120.
- the divisor is determined according to the size of the memory 120.
- the number of baskets 400 is small because the number of I / O switching operations is reduced. Note that the minimum number of baskets 400 is N ⁇ T / M when T is the size of the entire set and T is divided by the size M / N of the basket 400.
- the classification process by the classification unit 211 of the present embodiment will be described with a specific example using three ordered sets A, B, and C shown in FIG. 2 as shown in FIG.
- the logical operation executed here is A ⁇ (B + C) as in the conventional method description.
- the distribution condition of each basket 400 is a range of record values. That is, a record whose record value matches the range of record values specified by the distribution condition is distributed to the basket 400.
- the distribution condition of the first basket 401 is that the record value is in the range [0.2], that is, the record value is (0, 1, 2).
- the distribution condition is the same [3..6], that is, the same (3,4,5,6), and the distribution condition of the third basket 403 is the same [7..10], that is, the same (7 , 8, 9, 10).
- the classification unit 211 classifies the records into the baskets 401, 402, and 403 in order of record numbers for each of the ordered sets A, B, and C.
- the first basket 401 [0..2] contains a record with record number 1 and a record value of 2 (hereinafter simply referred to as record 2).
- ..6] include record 3 with record number 0, record 5 with record number 2, and record 4 with record number 5, and third basket 403 [7..10] includes record 8 with record number 3 and The records 10 with the record number 5 are classified.
- the records are classified into the respective baskets 401, 402, and 403.
- FIG. 7C shows a partial ordered set 133 of each ordered set 131 after classification.
- the ordered set A includes a partial ordered set 133 (A401) classified into the first basket 401, a partial ordered set 133 (A402) classified into the second basket 402, and a third Are divided (segmented) into partial order sets 133 (A403) classified into baskets 403.
- the ordered set B is the partial ordered set 133 (B401) and the partial ordered set 133 (B403)
- the ordered set C is the partial ordered set 133 (C401), the partial ordered set 133 (C402), and the partial ordered set 133. (C403).
- the logical operation unit 212 of this embodiment performs a logical operation for each basket. That is, in the example of FIG. 7B, a logical operation is performed between records in the first basket 401, the second basket 402, and the third basket 404.
- the categories of the baskets 400 (401, 402, 403) do not overlap. For this reason, the result of the logical operation in one basket 400 is always separate and independent from that of the other basket 400. For this reason, the direct sum unit 21 of the present embodiment calculates the direct sum of the logical operation results of the baskets 400 (401, 402, 403), and obtains the operation results. In the above example, ((2) + (3,4) + (8,10)) is calculated, and the operation result (2,3,4,8,10) is obtained.
- each record is read from the storage device 130 to the memory 120, and it is determined to which basket 400 the assignment is made, and written in the basket area of the storage device 130. For this reason, it is necessary to read from the storage area 130 to the memory 120 and write from the memory 120 to the storage device 130 several times. In the above example, 16 times are required, 32 times in total.
- the number of times each partial ordered set is used for a logical operation is “proportional” to the size of the total ordered set, and the prior art polynomial order shown in Equation (1) Are fundamentally different, each time.
- the total number of logical operations is three as described above. In these three operations, the number of reads is the product of the number of records in each partial ordered set and the number of logical operations, and the number of writes is 16, and the number of writes is the sum of the number of records of the operation results. The total of 2 times is 5 times. Therefore, the number of reads and writes during the logical operation is 21.
- FIG. 8 illustrates a flow of logical operation processing between the ordered sets 131 by the operation unit 210 of the present embodiment.
- the classification unit 211 scans and classifies the records in each ordered set 131 in order from the top, and distributes them to each basket 400 (step S1101).
- the calculation unit 212 loads records into the memory 120 in units of baskets 400 and performs logical operations (step S1102).
- the logical operation result is stored in the storage device 130 or the like.
- the direct sum unit 213 loads the logical operation result into the memory 120 and calculates the direct sum (step S1103).
- the information processing apparatus 100 is an information processing apparatus 100 that performs a logical operation between a plurality of ordered sets 131, and records each of the ordered sets 131 as the ordered set 131.
- the logical operation is performed between records belonging to the same category (basket) 400 of the classification unit 211 that classifies the category (basket) 400 and each ordered set 131, respectively, and obtains an operation result.
- a logical operation unit 212; and a direct sum unit 213 that calculates a direct sum of the operation results of each of the sections (baskets) 400.
- the section (basket) 400 includes all records belonging to the plurality of ordered sets 131. Are uniquely categorized.
- the records of each ordered set 131 are classified in advance into sections that do not overlap with each other, and logical operations are performed between the sections.
- the size of the section is a size that can be expanded in the memory 120.
- the writing process to the basket 400 and the reading process from the basket 400 at the time of the logical operation are added, but each arithmetic process can be executed only by loading it once onto the memory 120. .
- the number of calculations is also the number of baskets 400. For this reason, unlike the prior art, it is not necessary to perform the calculation for every combination for each division unit. Thus, according to this embodiment, the number of calculations can be reduced. Furthermore, since the number of operations is reduced, the number of accesses to the memory 120 for each operation is also reduced. In addition, deduplication calculation when obtaining the final result is not necessary.
- a logical operation on an ordered set 131 that is created from large-scale data and cannot be expanded on the memory 120 can be efficiently executed at high speed.
- all the ordered sets 131 are classified into the basket 400, but the present invention is not limited to this.
- the ordered set 131 having a predetermined size or less (less than a predetermined number of records) may be configured to be operated as it is without being classified.
- each basket 400 may be constructed on a different information processing apparatus connected by the network 171 or the like.
- each information processing apparatus in which the basket 400 is constructed includes a logical operation unit 212 and performs logical operation on data in the basket 400.
- the classification unit 211 may be configured to scan each ordered set 131 and extract a record to be distributed to the logical operation target basket 400 during the logical operation. In this case, the classification unit 211 scans the ordered set 131 by the number of baskets 400.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Algebra (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
1)Aa×(Ba+Ca)=(3,2)×(1,2,7,10)=(2)
2)Aa×(Ba+Cb)=(3,2)×(2,3,4,10)=(2,3)
3)Aa×(Ba+Cc)=(3,2)×(0,2,8,10)=(2)
4)Aa×(Bb+Ca)=(3,2)×(1,7,8,9)=()
5)Aa×(Bb+Cb)=(3,2)×(3,4,8,9)=(3)
6)Aa×(Bb+Cc)=(3,2)×(0,8,8,9)=()
7)Ab×(Ba+Ca)=(5,8)×(1,2,7,10)=()
8)Ab×(Ba+Cb)=(5,8)×(2,3,4,10)=(2)
9)Ab×(Ba+Cc)=(5,8)×(0,2,8,10)=(8)
10)Ab×(Bb+Ca)=(5,8)×(1,7,8,9)=(8)
11)Ab×(Bb+Cb)=(5,8)×(3,4,8,9)=(8)
12)Ab×(Bb+Cc)=(5,8)×(0,8,8,9)=(8)
13)Ac×(Ba+Ca)=(4,6)×(1,2,7,10)=()
14)Ac×(Ba+Cb)=(4,6)×(2,3,4,10)=(4)
15)Ac×(Ba+Cc)=(4,6)×(0,2,8,10)=()
16)Ac×(Bb+Ca)=(4,6)×(1,7,8,9)=()
17)Ac×(Bb+Cb)=(4,6)×(3,4,8,9)=(4)
18)Ac×(Bb+Cc)=(4,6)×(0,8,8,9)=()
1)A401×(B401+C401)=(2)×(2,1,0)=(2)
2)A402×(B402+C402)=(3,5,4)×(3,4)=(3,4)
3)A403×(B403+C403)=(8,10)×(7,8,9,10)
=(8,10)
Claims (7)
- 複数の集合間の論理演算方法であって、
前記集合を構成する各レコードを、前記集合毎に、それぞれ予め定めた区分に分類し、
同一の前記区分に属するレコード間で、前記集合間の論理演算を行い、演算結果を得、
前記区分毎の前記演算結果の直和を計算し、
前記区分は、前記複数の集合に属する全レコードを一意に類別可能なものであること
を特徴とする集合間の論理演算方法。 - 請求項1記載の論理演算方法であって、
前記区分のサイズは、前記論理演算を行う際、展開するメモリのサイズに応じて決定されること
を特徴とする論理演算方法。 - 請求項2記載の論理演算方法であって、
前記区分のサイズは、当該区分に分類される全レコードの総サイズが、前記メモリのサイズを超えないよう決定されること
を特徴とする論理演算方法。 - 請求項1から3いずれか1項記載の論理演算方法であって、
前記区分は、前記レコードの値の範囲で定められること
を特徴とする論理演算方法。 - 請求項1から3いずれか1項記載の論理演算方法であって、
前記区分は、予め定めた2以上の整数の剰余で定められること
を特徴とする論理演算方法。 - 複数の集合間の論理演算を行う情報処理装置であって、
前記集合を構成する各レコードを、前記集合毎に、それぞれ予め定めた区分に分類する分類部と、
同一の前記区分に属するレコード間で、前記集合間の論理演算を行い、演算結果を得る論理演算部と、
前記区分毎の前記演算結果の直和を計算する直和部と、を備え、
前記区分は、前記複数の集合に属する全レコードを一意に類別可能なものであること
を特徴とする集合間の情報処理装置。 - コンピュータを、
複数の集合に属する全レコードを一意に類別可能な区分に、当該レコードを前記集合毎に分類する分類手段、
同一の前記区分に属するレコード間で、予め定めた前記集合間の論理演算を行い、演算結果を得る論理演算手段、
前記区分毎の前記演算結果の直和を計算する直和手段、として機能させるためのプログラム。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2015511295A JPWO2014168199A1 (ja) | 2013-04-12 | 2014-04-10 | 論理演算方法および情報処理装置 |
US14/784,202 US20160070776A1 (en) | 2013-04-12 | 2014-04-10 | Logical operation method and information processing device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013083879 | 2013-04-12 | ||
JP2013-083879 | 2013-04-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014168199A1 true WO2014168199A1 (ja) | 2014-10-16 |
Family
ID=51689603
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2014/060386 WO2014168199A1 (ja) | 2013-04-12 | 2014-04-10 | 論理演算方法および情報処理装置 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20160070776A1 (ja) |
JP (1) | JPWO2014168199A1 (ja) |
WO (1) | WO2014168199A1 (ja) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108256054A (zh) * | 2018-01-15 | 2018-07-06 | 腾讯科技(深圳)有限公司 | 确定目标号码集合的方法和装置 |
WO2020235015A1 (ja) * | 2019-05-21 | 2020-11-26 | 日本電信電話株式会社 | 情報処理装置、情報処理方法及びプログラム |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0462668A (ja) * | 1990-06-29 | 1992-02-27 | Casio Comput Co Ltd | レコード検索方法 |
JP2003036269A (ja) * | 2001-07-23 | 2003-02-07 | Sony Corp | 情報処理装置および情報処理方法並びにこの情報処理のプログラムが記録された記録媒体 |
JP2012108635A (ja) * | 2010-11-16 | 2012-06-07 | Nec Corp | 分散メモリデータベースシステム、フロントデータベースサーバ、データ処理方法およびプログラム |
WO2012164738A1 (ja) * | 2011-06-03 | 2012-12-06 | 株式会社日立製作所 | データベース管理システム、装置及び方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9087055B2 (en) * | 2013-01-28 | 2015-07-21 | International Business Machines Corporation | Segmenting documents within a full text index |
-
2014
- 2014-04-10 JP JP2015511295A patent/JPWO2014168199A1/ja active Pending
- 2014-04-10 WO PCT/JP2014/060386 patent/WO2014168199A1/ja active Application Filing
- 2014-04-10 US US14/784,202 patent/US20160070776A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0462668A (ja) * | 1990-06-29 | 1992-02-27 | Casio Comput Co Ltd | レコード検索方法 |
JP2003036269A (ja) * | 2001-07-23 | 2003-02-07 | Sony Corp | 情報処理装置および情報処理方法並びにこの情報処理のプログラムが記録された記録媒体 |
JP2012108635A (ja) * | 2010-11-16 | 2012-06-07 | Nec Corp | 分散メモリデータベースシステム、フロントデータベースサーバ、データ処理方法およびプログラム |
WO2012164738A1 (ja) * | 2011-06-03 | 2012-12-06 | 株式会社日立製作所 | データベース管理システム、装置及び方法 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2014168199A1 (ja) | 2017-02-16 |
US20160070776A1 (en) | 2016-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US12056583B2 (en) | Target variable distribution-based acceptance of machine learning test data sets | |
JP6605573B2 (ja) | 並列ディシジョン・ツリー・プロセッサー・アーキテクチャ | |
JP6051212B2 (ja) | 反復データの処理 | |
US10831747B2 (en) | Multi stage aggregation using digest order after a first stage of aggregation | |
CN111258966A (zh) | 一种数据去重方法、装置、设备及存储介质 | |
US11288266B2 (en) | Candidate projection enumeration based query response generation | |
US20070239663A1 (en) | Parallel processing of count distinct values | |
US20240152498A1 (en) | Data storage using vectors of vectors | |
JP2019057082A (ja) | データ検索システム、データ検索方法、及びプログラム | |
US9639073B2 (en) | Information processing apparatus for discriminating between combined results of plurality of elements, program product and method for same | |
CN105830160A (zh) | 用于将经屏蔽数据写入到缓冲器的设备及方法 | |
WO2014168199A1 (ja) | 論理演算方法および情報処理装置 | |
Yin et al. | Content‐Based Image Retrial Based on Hadoop | |
US11734244B2 (en) | Search method and search device | |
US20220066988A1 (en) | Hash suppression | |
CN105205058A (zh) | 数据处理系统和方法 | |
WO2015143708A1 (zh) | 后缀数组的构造方法及装置 | |
Song et al. | Nslpa: A node similarity based label propagation algorithm for real-time community detection | |
JP6577922B2 (ja) | 検索装置、方法、及びプログラム | |
JP6631139B2 (ja) | 検索制御プログラム、検索制御方法および検索サーバ装置 | |
KR101441000B1 (ko) | 병렬처리 기반 트리플 데이터에 대한 변경 탐지 방법 | |
Khan et al. | Scalable compression of a weighted graph | |
US8819086B2 (en) | Naming methodologies for a hierarchical system | |
JP4870732B2 (ja) | 情報処理装置、名寄せ方法及びプログラム | |
JP7106924B2 (ja) | クラスタ分析システム、クラスタ分析方法およびクラスタ分析プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14783131 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2015511295 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14784202 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14783131 Country of ref document: EP Kind code of ref document: A1 |