[go: up one dir, main page]

CN104102475B - The method, apparatus and system of distributed parallel task processing - Google Patents

The method, apparatus and system of distributed parallel task processing Download PDF

Info

Publication number
CN104102475B
CN104102475B CN201310125254.1A CN201310125254A CN104102475B CN 104102475 B CN104102475 B CN 104102475B CN 201310125254 A CN201310125254 A CN 201310125254A CN 104102475 B CN104102475 B CN 104102475B
Authority
CN
China
Prior art keywords
data
processing
node
fragmentation
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310125254.1A
Other languages
Chinese (zh)
Other versions
CN104102475A (en
Inventor
廖龙
秦晓强
答治茜
罗建国
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Tencent Cloud Computing Beijing Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201310125254.1A priority Critical patent/CN104102475B/en
Publication of CN104102475A publication Critical patent/CN104102475A/en
Application granted granted Critical
Publication of CN104102475B publication Critical patent/CN104102475B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the invention discloses the method, apparatus and system of a kind of processing of distributed parallel task, it is related to field of computer technology, the complexity for solving distributed parallel task processing system in the prior art is higher, the slow problem of distributed parallel task processing.The method includes:Receive pending data;It is multiple data fragmentations by the pending data cutting;The multiple data fragmentation is respectively allocated to multiple processing nodes to handle;Receive the sub- result data after each processing node processing;The sub- result data is merged, result data is formed.The present invention is suitable for the parallel processing of the data of big data quantity.

Description

The method, apparatus and system of distributed parallel task processing
Technical field
A kind of handled the present invention relates to field of computer technology more particularly to distributed parallel task method, apparatus and System.
Background technology
Currently, with the development of computer technology, the quantity of the equipment such as computer data to be treated is also increasing. Currently, the parallel processing of the larger data of data volume can be carried out by equipment such as multiple stage computers.Under normal circumstances, it is carrying out When the quick processing of the larger data of data volume, need to be applied to distributed parallel task processing system.Distributed parallel task Processing system be it is a kind of by different location, with different function or the multiple stage computers communication network that possesses different data It connects, by being managed collectively under control, completes the computer system of information handling task in phase.
Current distributed parallel task processing system generally there is control node and multiple processing nodes, control node to connect Pending data is received, and pending data is grouped first, sorting operation, it later again will be pending after grouping, sequence Data are given multiple processing nodes and are handled.In the distributed parallel task processing of the prior art, it is required to pending Data are grouped, sorting operation, increase the complexity of entire distributed parallel task processing system so that distributed parallel The speed of task processing is slower.
Invention content
The embodiment of the present invention provides a kind of method, apparatus and system of the processing of distributed parallel task, can solve existing There is the complexity of the distributed parallel task processing system in technology higher, slow the asking of distributed parallel task processing Topic.
In a first aspect, the embodiment of the present invention provides a kind of method of distributed parallel task processing, including:
Receive pending data;
It is multiple data fragmentations by the pending data cutting;
The multiple data fragmentation is respectively allocated to multiple processing nodes to handle;
Receive the sub- result data after each processing node processing;
The sub- result data is merged, result data is formed.
Second aspect, the embodiment of the present invention provide a kind of method of distributed parallel task processing, including:
Receive the data fragmentation that control node is sent;Wherein, the data fragmentation is that the control node cutting is pending Data and obtain, the pending data are not grouped and sort;
Data in the data fragmentation are handled, sub- result data is formed;
The sub- result data is sent to the control node.
The third aspect, the embodiment of the present invention provide a kind of control node, including:
Receiving unit, for receiving pending data;
Cutting unit, the pending data cutting for receiving the receiving unit are multiple data fragmentations;
Allocation unit is handled for the multiple data fragmentation to be respectively allocated to multiple processing nodes;
The receiving unit is additionally operable to receive the sub- result data after each processing node processing;
Combining unit, the sub- result data for receiving the receiving unit merge, and form result data.
Fourth aspect, the embodiment of the present invention provide a kind of processing node, including:
Receiving unit, the data fragmentation for receiving control node transmission;Wherein, the data fragmentation is the control section It puts the pending data of cutting and obtains, the pending data are not grouped and sort;
Processing unit, the data in the data fragmentation for receiving receiving unit are handled, and form sub- result Data;
Transmission unit, the sub- result data for forming the processing unit are sent to the control node.
5th aspect, the system that the embodiment of the present invention provides a kind of processing of distributed parallel task, including control node and Multiple processing nodes, wherein
The pending data cutting is multiple data point for receiving pending data by the control node The multiple data fragmentation is respectively allocated to multiple processing nodes and handled by piece;
The processing node, the data fragmentation sent for receiving the control node, by the number in the data fragmentation According to being handled, sub- result data is formed, and the sub- result data is sent to the control node;
The control node is additionally operable to receive the sub- result data after each processing node processing, by the sub- number of results According to merging, result data is formed.
The method, apparatus and system of distributed parallel task processing provided by the invention, control node receives pending The pending data cutting is multiple data fragmentations, the multiple data fragmentation is respectively allocated to multiple places by data Reason node is handled, and receives the sub- result data after each processing node processing, and the sub- result data is merged, Form result data.And in the prior art, control node is receiving pending data, needs first to pending data It is grouped and sorts, under the scene that some do not need packet sequence, the mode of the prior art increases entire distribution The complexity of formula parallel task processing system so that the speed of distributed parallel task processing is slower.And provided by the invention point The mode of cloth parallel task processing can reduce entire distributed parallel it is not necessary that pending data are grouped and are sorted The complexity of task processing system can promote the speed of distributed parallel task processing.
Description of the drawings
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with Obtain other attached drawings according to these attached drawings.
Fig. 1 is the flow chart one of the method for distributed parallel task provided in an embodiment of the present invention processing;
Fig. 2 is the flowchart 2 of the method for distributed parallel task provided in an embodiment of the present invention processing;
Fig. 3 is the flow chart of the method for the distributed parallel task processing that further embodiment of this invention provides;
Fig. 4 is the structural schematic diagram one of control node provided in an embodiment of the present invention;
Fig. 5 is the structural schematic diagram two of control node provided in an embodiment of the present invention;
Fig. 6 is the structural schematic diagram of processing node provided in an embodiment of the present invention;
Fig. 7 is the structural schematic diagram of the system of distributed parallel task provided in an embodiment of the present invention processing.
Specific implementation mode
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without creative efforts Embodiment shall fall within the protection scope of the present invention.
The advantages of to make technical solution of the present invention, is clearer, makees specifically to the present invention with reference to the accompanying drawings and examples It is bright.
As shown in Figure 1, the method for distributed parallel task processing provided in an embodiment of the present invention, carries out from control node side It illustrates, the method includes:
101, pending data are received.
In distributed parallel task, the data volume of the pending data is generally large, and the size of data volume is general More than 1 terabyte (Terabyte, abbreviation TB), but it is not only limited to this.
102, it is multiple data fragmentations by the pending data cutting.
Wherein, it is data fragmentation, institute that the pending data can carry out cutting according to the quantity of the processing node The quantity for stating data fragmentation is identical as the processing quantity of node, and the size of the data of each data fragmentation storage can phase Together, but it is not only limited to this.
103, the multiple data fragmentation multiple processing nodes are respectively allocated to handle.
The multiple data fragmentation is respectively allocated to multiple processing nodes to carry out processing save according to each processing The load information of point distributes, and it is minimum that a data fragmentation in multiple data fragmentations distributed to load in every sub-distribution Handle node;It is had not been obtained furthermore it is also possible to which a data fragmentation in the multiple data fragmentation is randomly assigned to one The processing node of data fragmentation, but be not only limited to this, by the multiple data fragmentation be respectively allocated to multiple processing nodes into Row processing can also have other various ways, will not enumerate herein.
104, the sub- result data after each processing node processing is received.
Wherein, the sub- result data is formed after the processing node processing, and the processing node can obtain it The data fragmentation got is read and is handled line by line, often independent unrelated between capable data so as to be carried out on processing node Arithmetic logic can be performed simultaneously in multirow data.
105, the sub- result data is merged, forms result data.
Wherein, the control node can merge the sub- result data that each processing node returns, and form result Data.The result data can store database, for subsequent data analysis application.
The method of distributed parallel task processing provided in an embodiment of the present invention, control node receive pending data, It is multiple data fragmentations by the pending data cutting, the multiple data fragmentation is respectively allocated to multiple processing nodes It is handled, and receives the sub- result data after each processing node processing, the sub- result data is merged, knot is formed Fruit data.And in the prior art, control node is receiving pending data, needs first to divide pending data Group and sequence, under the scene that some do not need packet sequence, the mode of the prior art increases entire distributed parallel The complexity of task processing system so that the speed of distributed parallel task processing is slower.And distribution provided by the invention is simultaneously The mode of row task processing can be reduced it is not necessary that pending data are grouped and are sorted at entire distributed parallel task The complexity of reason system improves the speed of distributed parallel task processing.
The other side corresponding with control node is processing node side, as shown in Fig. 2, distribution provided in an embodiment of the present invention The method of parallel task processing is illustrated from processing node side, including:
201, the data fragmentation that control node is sent is received.
The source of the data fragmentation is the pending data that control node receives.The pending data without The grouping and sequence of control node are crossed, cutting is directly carried out by the control node and forms the data fragmentation.
202, the data in the data fragmentation are handled, forms sub- result data.
The processing node can be read and be handled line by line to the data fragmentation that it gets, often between capable data It is independent unrelated so that the arithmetic logic carried out on processing node can be performed simultaneously in multirow data.
203, the sub- result data is sent to the control node.
The purpose of above-mentioned steps 203 is that the sub- result data after each processing node processing data fragmentation reaches control It after node, is merged by the control node, forms result data.
The method of distributed parallel task processing provided in an embodiment of the present invention, processing node receive data fragmentation, wherein The data fragmentation is the pending data of the control node cutting and obtains that the pending data are not grouped and arrange Sequence, the processing node carries out processing to data fragmentation and forms sub- result data, then sub- result data is sent to the control Node.And in the prior art, control node is receiving pending data, needs first to be grouped pending data And sequence, under the scene that some do not need packet sequence, the mode of the prior art increases entire distributed parallel and appoints The complexity for processing system of being engaged in so that the speed of distributed parallel task processing is slower.And distributed parallel provided by the invention The mode of task processing can reduce entire distributed parallel task processing it is not necessary that pending data are grouped and are sorted The complexity of system can promote the speed of distributed parallel task processing.
It is described in detail and further expands below for method shown in fig. 1 or fig. 2:
As shown in figure 3, the method for the distributed parallel task processing that further embodiment of this invention provides, including:
301, control node receives pending data.
In distributed parallel task, the data volume of the pending data is generally large, and the size of data volume is general More than 1 terabyte (Terabyte, abbreviation TB), but it is not only limited to this.For example, the pending data can be certain For application program in intraday logon information, the logon information includes the on-line time of the account under the application program, under Line time etc., but it is not only limited to this.
302, the pending data cutting is multiple numbers according to the quantity of the processing node by the control node According to fragment.After step 302, step 303 or step 304 can be executed.
Wherein, it is data fragmentation, institute that the pending data can carry out cutting according to the quantity of the processing node The quantity for stating data fragmentation is identical as the processing quantity of node, and the size of the data of each data fragmentation storage can phase Together, but it is not only limited to this.
303, a data fragmentation in the multiple data fragmentation is randomly assigned to one and had not been obtained by control node The processing node of data fragmentation, until multiple data fragmentations are assigned.Later, step 308 is continued to execute.
In order to ensure that the load of each processing node will not be excessive, need to carry out reasonable distribution, tool to the data fragmentation Body can be randomly assigned data fragmentation, and after processing node has received data fragmentation, will not receive again To the data fragmentation of the pending data.
304, the load information of its own is sent to control node by processing node.Step 305-306 is executed later.
Likewise, in order to data fragmentation described in reasonable distribution, it can also be according to the big of the load of each processing node It is small to be allocated.The load at processing node is carried in the load information.
305, control node determines negative according to the load information of each processing node received according to the load information Carry minimum processing node.
Specifically, after the load information that the control node gets each processing node, due to the load information In carry the load of processing node, therefore can learn and load minimum processing node.
306, a data fragmentation in the multiple data fragmentation is distributed to the minimum place of the load by control node Manage node.Continue to execute step 307.
In this way, when each data fragmentation in multiple data fragmentations is allocated, it is minimum can to distribute to load Handle node so that the distribution of data fragmentation is more balanced, ensure that the load balancing of processing node.
307, control node judges whether the multiple data fragmentation is assigned.If the data fragmentation is assigned, Step 308 is executed, otherwise returns to step 304.
308, processing node handles the multirow data in the data fragmentation line by line, forms sub- result data.
The processing node can be read and be handled line by line to the data fragmentation that it gets, often between capable data It is independent unrelated so that the arithmetic logic carried out on processing node can be performed simultaneously in multirow data.
By above-mentioned pending data be certain application program in intraday logon information for, if desired filter out certain The online account at one moment, then the logon information can be data fragmentation by the control node cutting, be saved by each processing Point continues with, and according to the on-line time and downtime of each account in logon information, filters out at a time online Account.Since multiple processing nodes are carried out at the same time screening, the speed for filtering out the online account at a certain moment is also very fast.
309, the sub- result data is sent to the control node by processing node.
310, control node merges the sub- result data, forms result data.
It is worth noting that the control node and processing node in the embodiment of the present invention may each be computer etc. and have fortune The electronic equipment of calculation ability.
The method for the distributed parallel task processing that further embodiment of this invention provides, control node receive pending number According to being multiple data fragmentations by the pending data cutting, and the multiple data fragmentation be respectively allocated to multiple places Reason node is handled, and receives the sub- result data after each processing node processing, and the sub- result data is closed And form result data.And in the prior art, control node is receiving pending data, needs first to pending Data are grouped and sort, and under the scene that some do not need packet sequence, the mode of the prior art increases entirely The complexity of distributed parallel task processing system so that the speed of distributed parallel task processing is slower.And the present invention provides The processing of distributed parallel task mode it is not necessary that pending data are grouped and are sorted, entire distribution can be reduced The complexity of parallel task processing system can promote the speed of distributed parallel task processing.
With reference to the realization of above-mentioned Fig. 1 and method shown in Fig. 3, as shown in figure 4, control provided in an embodiment of the present invention saves Point, including:
Receiving unit 41, for receiving pending data.
Cutting unit 42, the pending data cutting for receiving the receiving unit 41 are multiple data point Piece.
Allocation unit 43 is handled for the multiple data fragmentation to be respectively allocated to multiple processing nodes.
The receiving unit 41 is additionally operable to receive the sub- result data after each processing node processing.
Combining unit 44, the sub- result data for receiving the receiving unit 41 merge, and form result Data.
Specifically, as shown in figure 5, the cutting unit 42, is used for:
It is more by the pending data cutting that the receiving unit 41 receives according to the quantity of the processing node A data fragmentation.
Wherein, the quantity of the data fragmentation is identical as the processing quantity of node.
Further, as shown in figure 5, the allocation unit 43, is additionally operable to:
A data fragmentation in the multiple data fragmentation after 42 cutting of cutting unit is randomly assigned to one A processing node that data fragmentation has not been obtained.
Further, as shown in figure 5, the control node further includes:Determination unit 45.
The receiving unit 41 is additionally operable to receive the load information of each processing node.
The determination unit 45, the load information for being received according to the receiving unit 41 determine and load minimum place Manage node.
The allocation unit 43 is additionally operable to a data in multiple data fragmentations after 42 cutting of cutting unit Fragment distributes to the minimum processing node of the load.
It is worth noting that the specific implementation of control node provided in an embodiment of the present invention may refer in Fig. 3 The specific implementation of the method for distributed parallel task processing, details are not described herein again.The control node can be computer Deng the electronic equipment with operational capability.
Control node provided in an embodiment of the present invention, control node receive pending data, by the pending number It is multiple data fragmentations according to cutting, and the multiple data fragmentation is respectively allocated to multiple processing nodes and is handled, and connects The sub- result data after each processing node processing is received, and the sub- result data is merged, forms result data.And In the prior art, control node is receiving pending data, needs that first pending data are grouped and are sorted, Some are not needed under the scene of packet sequence, and the mode of the prior art increases entire distributed parallel task processing system Complexity so that distributed parallel task processing speed it is slower.And distributed parallel task processing provided by the invention Mode can reduce the complexity of entire distributed parallel task processing system it is not necessary that pending data are grouped and are sorted Degree can promote the speed of distributed parallel task processing.
With reference to the realization of above-mentioned Fig. 2 and method shown in Fig. 3, as shown in fig. 6, processing provided in an embodiment of the present invention saves Point, including:
Receiving unit 51, the data fragmentation for receiving control node transmission.
Wherein, the data fragmentation is the pending data of the control node cutting and obtains, the pending data It is not grouped and sorts.
Processing unit 52, the data in the data fragmentation for receiving receiving unit 51 are handled, and form son Result data.
Transmission unit 53, the sub- result data for forming the processing unit 52 are sent to the control node.
It is worth noting that the data fragmentation includes multirow data.
As shown in fig. 6, the processing unit 52, is specifically used for:
Multirow data in the data fragmentation are handled line by line.
Specifically, as shown in fig. 6, the transmission unit 53, is additionally operable to:
Load information is sent to the control node.Wherein, the load information carries the load of processing node.
It is worth noting that the specific implementation of processing node provided in an embodiment of the present invention may refer in Fig. 3 The specific implementation of the method for distributed parallel task processing, details are not described herein again.The processing node can be computer Deng the electronic equipment with operational capability.
Processing node provided in an embodiment of the present invention, processing node receive data fragmentation, wherein the data fragmentation is institute It states the pending data of control node cutting and obtains, the pending data are not grouped and sort, the processing node pair Data fragmentation carries out processing and forms sub- result data, then sub- result data is sent to the control node.And in the prior art In, control node is receiving pending data, needs that first pending data are grouped and are sorted, and is not required at some It wants under the scene that packet is sorted, the mode of the prior art increases the complexity of entire distributed parallel task processing system Degree so that the speed of distributed parallel task processing is slower.And the mode of distributed parallel task provided by the invention processing without Pending data need to be grouped and be sorted, the complexity of entire distributed parallel task processing system can be reduced, it can To promote the speed of distributed parallel task processing.
As shown in fig. 7, the system of distributed parallel task processing provided in an embodiment of the present invention, including 61 He of control node Multiple processing nodes 62, wherein
The pending data cutting is multiple data for receiving pending data by the control node 61 The multiple data fragmentation is respectively allocated to multiple processing nodes 62 and handled by fragment;
The processing node 62, the data fragmentation for receiving the transmission of the control node 61, will be in the data fragmentation Data handled, form sub- result data, and the sub- result data is sent to the control node 61;
The control node 61 is additionally operable to receive each processing node 62 treated sub- result data, by the sub- knot Fruit data merge, and form result data.
It is worth noting that the specific implementation of the system of distributed parallel task processing provided in an embodiment of the present invention The specific implementation of the method for the distributed parallel task processing in Fig. 3 is may refer to, details are not described herein again.
The system of distributed parallel task processing provided in an embodiment of the present invention, control node receive pending data, It is multiple data fragmentations by the pending data cutting, and the multiple data fragmentation is respectively allocated to multiple processing and is saved Point is handled, and receives the sub- result data after each processing node processing, and the sub- result data is merged, shape At result data.And in the prior art, control node is receiving pending data, need first to pending data into Row grouping and sequence, under the scene that some do not need packet sequence, the mode of the prior art increases entire distribution The complexity of parallel task processing system so that the speed of distributed parallel task processing is slower.And distribution provided by the invention The mode of formula parallel task processing can reduce entire distributed parallel and appoint it is not necessary that pending data are grouped and are sorted The complexity for processing system of being engaged in can promote the speed of distributed parallel task processing.
Through the above description of the embodiments, it is apparent to those skilled in the art that the present invention can borrow Help software that the mode of required common hardware is added to realize, naturally it is also possible to which by hardware, but the former is more preferably in many cases Embodiment.Based on this understanding, the portion that technical scheme of the present invention substantially in other words contributes to the prior art Dividing can be expressed in the form of software products, which is stored in the storage medium that can be read, and such as count The floppy disk of calculation machine, hard disk or CD etc., including some instructions are used so that computer equipment (can be personal computer, Server or the network equipment etc.) execute method described in each embodiment of the present invention.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (5)

1. a kind of method of distributed parallel task processing, which is characterized in that including:
Receive pending data;
It is multiple data fragmentations by the pending data cutting according to the quantity of processing node;Wherein, the data fragmentation Quantity it is identical as the processing quantity of node, the pending data without control node grouping and sequence, directly It connects and is carried out cutting by the control node and formed the data fragmentation;
The multiple data fragmentation is respectively allocated to multiple processing nodes to handle, including:
Receive the load information of each processing node;
It is determined according to the load information and loads minimum processing node;
A data fragmentation in the multiple data fragmentation is distributed into the minimum processing node of the load;
Receive the sub- result data after each processing node processing;The sub- result data is formed after the processing node processing , the processing node can be read and be handled line by line to the data fragmentation that it gets, often independent between capable data It is unrelated so that the arithmetic logic carried out on processing node can be performed simultaneously in multirow data;
The sub- result data is merged, result data is formed.
2. a kind of method of distributed parallel task processing, which is characterized in that including:
The load information of itself is sent to control node;Wherein, the load information carries the load of processing node;
Receive the data fragmentation that control node is sent;Wherein, the data fragmentation is the pending number of the control node cutting According to and obtain, the pending data are not grouped and sort, and the data fragmentation includes multirow data, between often capable data It is independent unrelated;
Data in the data fragmentation are handled, sub- result data is formed;
The sub- result data is sent to the control node;
The step of data by the data fragmentation handle, form sub- result data, including:
Multirow data in the data fragmentation are handled line by line, it is often independent unrelated between capable data so as to handle The arithmetic logic carried out on node can be performed simultaneously in multirow data.
3. a kind of control node, which is characterized in that including:
Receiving unit, for receiving pending data;
Cutting unit, for the quantity according to processing node, the pending data cutting that the receiving unit is received For multiple data fragmentations;Wherein, the quantity of the data fragmentation is identical as the processing quantity of node, the pending number According to the grouping and sequence without control node, cutting is directly carried out by the control node and forms the data fragmentation;
The receiving unit is additionally operable to receive the load information of each processing node;
Determination unit, the load information for being received according to the receiving unit determine and load minimum processing node;
Allocation unit, it is described for distributing to a data fragmentation in multiple data fragmentations after the cutting unit cutting Load minimum processing node;
The receiving unit is additionally operable to receive the sub- result data after each processing node processing;The sub- result data is institute It is formed after stating processing node processing, the processing node can be read and be located line by line to the data fragmentation that it gets Reason, it is often independent unrelated between capable data so that the arithmetic logic carried out on processing node can be in multirow data simultaneously It executes;
Combining unit, the sub- result data for receiving the receiving unit merge, and form result data.
4. a kind of processing node, which is characterized in that including:
Transmission unit, for sending the load information of itself to control node;Wherein, the load information carries processing node Load;
Receiving unit, the data fragmentation for receiving control node transmission;Wherein, the data fragmentation is that the control node is cut Divide pending data and obtain, the pending data are not grouped and sort, and the data fragmentation includes multirow data, often It is independent unrelated between capable data;
Processing unit, the data in the data fragmentation for receiving receiving unit are handled, and form sub- result data;
Transmission unit, the sub- result data for forming the processing unit are sent to the control node;
The processing unit, for being handled line by line the multirow data in the data fragmentation, often between capable data solely It is vertical unrelated so that the arithmetic logic carried out on processing node can be performed simultaneously in multirow data.
5. a kind of system of distributed parallel task processing, which is characterized in that including control node and multiple processing nodes, In,
The pending data cutting is multiple data fragmentations for receiving pending data by the control node, will The multiple data fragmentation is respectively allocated to multiple processing nodes and is handled, and the pending data are without control node Grouping and sequence, directly by the control node carry out cutting and form the data fragmentation, the data fragmentation include it is more Row data, it is often independent unrelated between capable data;
The processing node, the data fragmentation sent for receiving the control node, by the data in the data fragmentation into Row processing, forms sub- result data, and the sub- result data is sent to the control node;The processing node can be right Its data fragmentation got is read and is handled line by line, often independent unrelated between capable data so that on processing node The arithmetic logic of progress can be performed simultaneously in multirow data;
The control node is additionally operable to receive the sub- result data after each processing node processing, by the sub- result data into Row merges, and forms result data.
CN201310125254.1A 2013-04-11 2013-04-11 The method, apparatus and system of distributed parallel task processing Active CN104102475B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310125254.1A CN104102475B (en) 2013-04-11 2013-04-11 The method, apparatus and system of distributed parallel task processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310125254.1A CN104102475B (en) 2013-04-11 2013-04-11 The method, apparatus and system of distributed parallel task processing

Publications (2)

Publication Number Publication Date
CN104102475A CN104102475A (en) 2014-10-15
CN104102475B true CN104102475B (en) 2018-10-02

Family

ID=51670655

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310125254.1A Active CN104102475B (en) 2013-04-11 2013-04-11 The method, apparatus and system of distributed parallel task processing

Country Status (1)

Country Link
CN (1) CN104102475B (en)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105740063A (en) * 2014-12-08 2016-07-06 杭州华为数字技术有限公司 Data processing method and apparatus
CN105740085B (en) * 2014-12-11 2019-04-19 华为技术有限公司 Fault-tolerance processing method and device
US9766818B2 (en) * 2014-12-31 2017-09-19 Samsung Electronics Co., Ltd. Electronic system with learning mechanism and method of operation thereof
CN104581149B (en) * 2015-01-27 2017-08-08 北京正奇联讯科技有限公司 The skill of video and audio file examines method and system
CN106681991A (en) * 2015-11-05 2017-05-17 阿里巴巴集团控股有限公司 Method and equipment for detecting continuous time signal data
CN105892996A (en) * 2015-12-14 2016-08-24 乐视网信息技术(北京)股份有限公司 Assembly line work method and apparatus for batch data processing
CN107180017B (en) * 2016-03-11 2021-05-28 阿里巴巴集团控股有限公司 Sample serialization method and device
CN106095832B (en) * 2016-06-01 2020-02-18 东软集团股份有限公司 Distributed parallel data processing method and device
CN106354828A (en) * 2016-08-31 2017-01-25 天津南大通用数据技术股份有限公司 Data fragmentation method and device for distributed database
CN106598552A (en) * 2016-12-22 2017-04-26 郑州云海信息技术有限公司 Data point conversion method and device based on Gridding module
CN107707592A (en) * 2017-01-24 2018-02-16 贵州白山云科技有限公司 Task processing method, node and content distributing network
CN107743246A (en) * 2017-01-24 2018-02-27 贵州白山云科技有限公司 Task processing method, system and data handling system
CN106980538A (en) * 2017-02-16 2017-07-25 平安科技(深圳)有限公司 The method and device of data processing
CN107888684A (en) * 2017-11-13 2018-04-06 小草数语(北京)科技有限公司 Distributed system calculating task processing method, device and controller
CN108052646A (en) * 2017-12-25 2018-05-18 北京车联天下信息技术有限公司 Big data system and method are calculated in real time
WO2019140567A1 (en) * 2018-01-17 2019-07-25 新联智慧信息技术(深圳)有限公司 Big data analysis method and system
CN108784685B (en) * 2018-05-24 2021-04-02 北京维康恒科技有限公司 Method and device for processing electrocardiographic waveform data
CN111143393A (en) * 2018-11-03 2020-05-12 广州市明领信息科技有限公司 Big data processing system
CN109522138A (en) * 2018-11-14 2019-03-26 北京中电普华信息技术有限公司 A kind of processing method and system of distributed stream data
CN110209496B (en) * 2019-05-20 2022-05-17 中国平安财产保险股份有限公司 Task fragmentation method and device based on data processing and fragmentation server
CN111522662B (en) * 2020-04-23 2020-11-27 柴懿晖 Node system for financial analysis and implementation method thereof
CN112162859A (en) * 2020-09-24 2021-01-01 成都长城开发科技有限公司 Data processing method and device, computer readable medium and electronic equipment
CN112162839A (en) * 2020-09-25 2021-01-01 太平金融科技服务(上海)有限公司 Task scheduling method and device, computer equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021781A (en) * 2007-03-19 2007-08-22 中国人民解放军国防科学技术大学 Stream processor expanding method for flexible distribution operating group resource
CN101819651A (en) * 2010-04-16 2010-09-01 浙江大学 Method for parallel execution of particle swarm optimization algorithm on multiple computers
CN102129394A (en) * 2010-01-14 2011-07-20 优必达科技有限公司 Distributed Computing Method and System
CN102279730A (en) * 2010-06-10 2011-12-14 阿里巴巴集团控股有限公司 Parallel data processing method, device and system
CN102883145A (en) * 2012-09-28 2013-01-16 安科智慧城市技术(中国)有限公司 Method and system for identifying dynamic objects
CN103034475A (en) * 2011-10-08 2013-04-10 中国移动通信集团四川有限公司 Distributed parallel computing method, device and system
CN103034618A (en) * 2012-03-22 2013-04-10 富士施乐株式会社 Image processing device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006057040A1 (en) * 2004-11-26 2006-06-01 Fujitsu Limited Computer system and information processing method
EP2372530A4 (en) * 2008-11-28 2012-12-19 Shanghai Xinhao Micro Electronics Co Ltd Data processing method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021781A (en) * 2007-03-19 2007-08-22 中国人民解放军国防科学技术大学 Stream processor expanding method for flexible distribution operating group resource
CN102129394A (en) * 2010-01-14 2011-07-20 优必达科技有限公司 Distributed Computing Method and System
CN101819651A (en) * 2010-04-16 2010-09-01 浙江大学 Method for parallel execution of particle swarm optimization algorithm on multiple computers
CN102279730A (en) * 2010-06-10 2011-12-14 阿里巴巴集团控股有限公司 Parallel data processing method, device and system
CN103034475A (en) * 2011-10-08 2013-04-10 中国移动通信集团四川有限公司 Distributed parallel computing method, device and system
CN103034618A (en) * 2012-03-22 2013-04-10 富士施乐株式会社 Image processing device
CN102883145A (en) * 2012-09-28 2013-01-16 安科智慧城市技术(中国)有限公司 Method and system for identifying dynamic objects

Also Published As

Publication number Publication date
CN104102475A (en) 2014-10-15

Similar Documents

Publication Publication Date Title
CN104102475B (en) The method, apparatus and system of distributed parallel task processing
US11005815B2 (en) Priority allocation for distributed service rules
US10209908B2 (en) Optimization of in-memory data grid placement
CN103946800B (en) Lossless uninterrupted message processing method during system software upgrading
CN106033373B (en) A virtual machine resource scheduling method and scheduling system in a cloud computing platform
DE112017003294B4 (en) Technologies for scalable sending and receiving of packets
CN103248521B (en) Method, device and the communication system of a kind of business game rule configuration
CN110321329A (en) Data processing method and device based on big data
CN106445629A (en) Load balancing method and device
CN102541858A (en) Data equality processing method, device and system based on mapping and protocol
CN103856548B (en) Dynamic resource scheduling method and dynamic resource scheduling device
CN105677462A (en) Distributed task system based on internet of things and business processing method
CN104995604A (en) Resource allocation method of virtual machine and device thereof
CN105897457A (en) Service upgrade method and system of server group
CN105740063A (en) Data processing method and apparatus
CN106844405A (en) Data query method and apparatus
CN107220123A (en) One kind solves Spark data skew method and system
CN103617508A (en) Configurable business rule plug-in extension apparatus and business rule plug-in extension method
CN103973803A (en) Cloud resource distribution system and method, and computer readable record medium of stored program
CN108241531A (en) A kind of method and apparatus for distributing resource for virtual machine in the cluster
CN106327140A (en) Method and device for monitoring data modification
CN110399600A (en) Generate the method and device of wide table
CN109800078B (en) Task processing method, task distribution terminal and task execution terminal
CN107995026B (en) Management and control method, management node, managed node and system based on middleware
CN112631716B (en) Database container scheduling method, device, electronic device and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20190730

Address after: 518057 Nanshan District science and technology zone, Guangdong, Zhejiang Province, science and technology in the Tencent Building on the 1st floor of the 35 layer

Co-patentee after: Tencent cloud computing (Beijing) limited liability company

Patentee after: Tencent Technology (Shenzhen) Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518000 Zhenxing Road, SEG Science Park 2 East Room 403

Patentee before: Tencent Technology (Shenzhen) Co., Ltd.