CN115174586B

CN115174586B - Automatic load balancing system and method based on cloud platform

Info

Publication number: CN115174586B
Application number: CN202211069778.9A
Authority: CN
Inventors: 吴俊逸
Original assignee: Changzhou Zunshang Information Technology Co ltd
Current assignee: Changzhou Zunshang Information Technology Co ltd
Priority date: 2022-09-02
Filing date: 2022-09-02
Publication date: 2022-11-29
Anticipated expiration: 2042-09-02
Also published as: CN115174586A

Abstract

The invention is suitable for the technical field of control or balance adjustment systems, and provides an automatic load balancing system and method based on a cloud platform, which comprises the following steps: receiving service request information sent by a user side, and determining the load required by a server to normally respond to the service request information; acquiring all server information capable of responding to the service according to the service request information, wherein the server information comprises a server load grade and a server load ratio; screening server information according to the server load level, reserving the server information capable of normally responding to the service, and judging whether the reserved server information is zero or not; when the number of the service request messages is zero, the received service request messages are intercepted for a short time, the load level of the server is checked in real time, and the interception is removed until the server messages capable of responding to the service normally exist. The server responds to the service request after processing a part of the executing tasks, and the server can be effectively prevented from being paralyzed.

Description

Automatic load balancing system and method based on cloud platform

Technical Field

The invention relates to the technical field of control or balance adjustment systems, in particular to an automatic load balancing system and method based on a cloud platform.

Background

Under the information age, network users grow exponentially, the requirements of the users on various network services are higher and higher, and the network services are required to have faster response time and higher service quality. When a user obtains a network service through a user terminal, a service request is usually initiated to a specific address through the specific address of a background server, and the background server responds to the service request of the user after receiving the service request. If the load of the background server is too high, the server cannot be normally provided, and even paralysis occurs. Therefore, it is desirable to provide an automatic load balancing system and method based on a cloud platform, which aim to solve the above problems.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide an automatic load balancing system and method based on a cloud platform to solve the problems in the background technology.

The invention is realized in such a way, and an automatic load balancing method based on a cloud platform comprises the following steps:

receiving service request information sent by a user side, determining the load required by a server to normally respond to the service request information, and recording the service request information;

acquiring all server information capable of responding to the service according to the service request information, wherein the server information comprises a server load grade and a server load ratio;

screening server information according to the server load level, reserving the server information capable of normally responding to the service, and judging whether the reserved server information is zero or not;

when the server load ratio is not zero, selecting the server corresponding to the server information with the lowest reserved server load ratio to respond to the service request, sending service response information to the user side, and clearing the recorded service request information;

when the service request information is zero, the received service request information is temporarily intercepted, the load level of the server is checked in real time until the server information capable of normally responding to the service exists, and the interception is removed.

As a further scheme of the invention: the step of screening the server information according to the server load level and reserving the server information capable of normally responding to the service specifically includes:

determining a server load grade capable of being reserved according to the load quantity, wherein the server load grade is determined according to a residual load value;

and screening the server information according to the determined server load level, and reserving the server information capable of normally responding to the service.

As a further scheme of the invention: the step of performing transient interception on the received service request information further includes:

matching the intercepted service request information with the recorded service request information, and determining whether the same service request information exists or not;

when the same service request information exists, the server responds to the recorded service request information and responds to the intercepted service request information;

and after the server responds, the intercepted service request information and the recorded service request information are simultaneously cleared.

classifying the intercepted service request information, wherein the request contents of the service request information in each class are the same;

after the interception is removed, the server only directly responds to one service request message in each class, and other service request messages are automatically and indirectly responded.

As a further scheme of the invention: the method further comprises the following steps:

recording the times and time for intercepting the service request information;

calculating the interception frequency and the average interception time within the set time every other set time;

and obtaining a calculated value of the capacity increase, wherein the calculated value of the capacity increase = a first coefficient, interception frequency and a second coefficient, average interception time, and when the calculated value of the capacity increase is greater than a critical value of the capacity increase, generating capacity increase reminding information.

Another object of the present invention is to provide an automatic load balancing system based on a cloud platform, the system comprising:

the request information receiving module is used for receiving the service request information sent by the user side, determining the load required by the server for normally responding to the service request information and recording the service request information;

the server information acquisition module is used for acquiring all server information capable of responding to the service according to the service request information, wherein the server information comprises a server load grade and a server load ratio;

the server information screening module is used for screening the server information according to the server load level, reserving the server information capable of normally responding to the service, and judging whether the reserved server information is zero or not;

the server response module is used for selecting a server corresponding to the server information with the lowest reserved server load ratio to respond to the service request when the server response module is not zero, sending service response information to the user side and clearing the recorded service request information;

and the request information interception module is used for temporarily intercepting the received service request information when the service request information is zero, checking the load level of the server in real time until the server information capable of normally responding to the service exists, and removing interception.

As a further scheme of the invention: the server information screening module comprises:

a load grade determining unit, configured to determine a server load grade that can be reserved according to the load quantity, where the server load grade is determined according to a remaining load value;

and the server information screening unit is used for screening the server information according to the determined server load level and reserving the server information capable of normally responding to the service.

As a further scheme of the invention: the request information intercepting module comprises:

the request information matching unit is used for matching the intercepted service request information with the recorded service request information and determining whether the same service request information exists or not;

the synchronous response unit is used for responding the intercepted service request information when the same service request information exists and the server responds to the recorded service request information;

and the synchronous clearing unit is used for clearing the intercepted service request information and the recorded service request information at the same time after the server responds.

As a further scheme of the invention: the request information interception module further comprises:

the request information classification unit is used for classifying the intercepted service request information, and the request contents of the service request information in each class are the same;

and the automatic response unit enables the server to directly respond to only one service request message in each class after the interception is removed, and other service request messages are automatically and indirectly responded.

As a further scheme of the invention: the system also comprises a capacity-increasing reminding module, wherein the capacity-increasing reminding module specifically comprises:

the interception information recording unit is used for recording the times and time for intercepting the service request information;

the interception information calculation unit calculates the interception frequency and the average interception time within the set time at intervals of the set time;

and the reminding information generating unit is used for obtaining a calculated value of the capacity increase, wherein the calculated value of the capacity increase = a first coefficient, interception frequency and a second coefficient, average interception time, and generating capacity increase reminding information when the calculated value of the capacity increase is greater than a critical value of the capacity increase.

Compared with the prior art, the invention has the beneficial effects that:

the invention can obtain all server information capable of responding to the service according to the service request information, screen the server information according to the server load grade, reserve the server information capable of responding to the service normally, and judge whether the reserved server information is zero or not; when the load ratio of the server is not zero, selecting the server corresponding to the server information with the lowest load ratio of the reserved server to respond to the service request, sending service response information to the user side, and clearing the recorded service request information; when the current service request information is zero, the received service request information is intercepted for a short time, the load grade of the server is checked in real time until the server information which can normally respond to the service exists, and the interception is removed. The server responds to the service request after processing a part of the executing tasks, and the server can be effectively prevented from being paralyzed.

Drawings

Fig. 1 is a flowchart of an automatic load balancing method based on a cloud platform.

Fig. 2 is a flowchart of screening server information according to a server load level in an automatic load balancing method based on a cloud platform.

Fig. 3 is a flowchart illustrating matching of intercepted service request information with recorded service request information in an automatic load balancing method based on a cloud platform.

Fig. 4 is a flowchart of classifying intercepted service request information in an automatic load balancing method based on a cloud platform.

Fig. 5 is a flowchart of generating capacity increase reminding information in an automatic load balancing method based on a cloud platform.

Fig. 6 is a schematic structural diagram of an automatic load balancing system based on a cloud platform.

Fig. 7 is a schematic structural diagram of a server information screening module in an automatic load balancing system based on a cloud platform.

Fig. 8 is a schematic structural diagram of a request information intercepting module in an automatic load balancing system based on a cloud platform.

Fig. 9 is a schematic structural diagram of a capacity increase reminding module in an automatic load balancing system based on a cloud platform.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer, the present invention is described in further detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention.

Specific implementations of the present invention are described in detail below with reference to specific embodiments.

As shown in fig. 1, an embodiment of the present invention provides an automatic load balancing method based on a cloud platform, where the method includes the following steps:

s100, receiving service request information sent by a user side, determining the load required by a server to normally respond to the service request information, and recording the service request information;

s200, acquiring all server information capable of responding to the service according to the service request information, wherein the server information comprises a server load grade and a server load ratio;

s300, screening the server information according to the server load level, reserving the server information capable of normally responding to the service, and judging whether the reserved server information is zero or not;

s400, when the number is not zero, selecting a server corresponding to the server information with the lowest reserved server load ratio to respond to the service request, sending service response information to the user side, and clearing the recorded service request information;

and S500, when the number is zero, temporarily intercepting the received service request information, checking the load level of the server in real time until the server information capable of normally responding to the service exists, and releasing the interception.

It should be noted that, in the information age, network users are exponentially increasing, and users have higher and higher requirements for various network services, which require faster response time and higher quality of service for the network services. When a user obtains a network service through a user terminal, a service request is usually initiated to a specific address through the specific address of a background server, and the background server responds to the service request of the user after receiving the service request. If the load of the background server is too high, the background server cannot provide services normally, and even paralysis occurs.

In the embodiment of the invention, when receiving service request information sent by a user end, load capacity required by a server for normally responding to the service request information is predicted according to the specific service request information, the service request information is recorded, then all server information capable of responding to service is obtained according to the service request information, the server information comprises a server address, a server load grade and a server load ratio, the server load grade can reflect the residual load value of the server, the server load ratio is a load degree, for example, the server load ratio is 72%, then the server information is screened according to the server load grade, only the server information capable of normally responding to service is reserved, and whether the reserved server information is zero or not is judged; when the service request information is not zero, the server corresponding to the server information with the lowest server load occupation ratio is selected to respond to the service request, the lower the server load occupation ratio is, the higher the response speed is, the service response information is sent to the user side, and the recorded service request information is removed, so that the service request information is completed; when the number of the tasks is zero, the server can normally respond to the service request, if the load is imposed on the server, the server can be paralyzed, so that the service request cannot be responded in time, and the task which is being executed by the server can be influenced.

As shown in fig. 2, as a preferred embodiment of the present invention, the step of screening server information according to a server load level and retaining the server information capable of responding to a service normally includes:

s301, determining a server load grade capable of being reserved according to the load quantity, wherein the server load grade is determined according to a residual load value;

s302, screening the server information according to the determined server load level, and reserving the server information capable of normally responding to the service.

In the embodiment of the invention, the server load grades capable of being reserved can be determined according to the load, each load range corresponds to one server load grade, the corresponding rule is set in advance, the server load grades are determined according to the residual load values, the more the residual load values are, the server can process the service request with large load, then the server information is screened according to the determined server load grades, only the server information capable of normally responding to the service is reserved, for example, the server load grades determined according to the load values are three grades, the higher the grades are, the more the residual load values are, and the server load grades in the reserved server information are all greater than or equal to the three grades.

As shown in fig. 3, as a preferred embodiment of the present invention, the step of temporarily intercepting the received service request information further includes:

s501, the intercepted service request information is matched with the recorded service request information, and whether the same service request information exists or not is determined;

s502, when the same service request information exists, the server responds to the recorded service request information and simultaneously responds to the intercepted service request information;

s503, after the server responds, the intercepted service request information and the recorded service request information are simultaneously cleared.

In the embodiment of the invention, after the service request information is intercepted, the intercepted service request information is matched with the recorded service request information to determine whether the same service request information exists, which can be understood that the service processed by a server group has similarity, when the service request information is intercepted, the server group is indicated to respond to a large number of service requests, the intercepted service request information is likely to be the same as the recorded service request information, and when the same service request information exists and the server responds to the recorded service request information, the intercepted service request information is responded at the same time, namely the service response information is sent to two user sides at the same time, so that the intercepted service request can be responded quickly, and the intercepted service request information and the recorded service request information are cleared at the same time after the server responds.

As shown in fig. 4, as a preferred embodiment of the present invention, the step of performing short-term interception on the received service request information further includes:

s504, the intercepted service request information is classified, and the request content of the service request information in each class is the same;

and S505, after the interception is removed, the server only directly responds to one service request message in each class, and other service request messages are automatically and indirectly responded.

In the embodiment of the invention, after the service request information is intercepted, the intercepted service request information is also classified, the request contents of the service request information in each class are the same, for example, a hot topic appears, which causes the load of the server to be overlarge, so that many service request information are related to the hot topic, after the interception is removed, the server only directly responds to one service request information in each class, other service request information is automatically and indirectly responded, namely, the service response information obtained by the direct response is copied into a plurality of parts, and the service response information is sent to the user side corresponding to each service request information in the class.

As shown in fig. 5, as a preferred embodiment of the present invention, the method further includes:

s601, recording times and time for intercepting service request information;

s602, calculating the interception frequency and the average interception time within set time at intervals of the set time;

and S603, obtaining a calculated value of the capacity increase, wherein the calculated value of the capacity increase = a first coefficient, interception frequency and a second coefficient, average interception time, and when the calculated value of the capacity increase is greater than a critical value of the capacity increase, generating capacity increase reminding information.

In the embodiment of the present invention, it should be understood that the number of cases of intercepting the service request information should be very small, and after all, the number of times and the time of intercepting the service request information need to be recorded, and the interception frequency and the average interception time within the set time are recalculated at set intervals to obtain a capacity increase calculated value, when the capacity increase calculated value is greater than the capacity increase critical value, it is indicated that the influence on the user is large, capacity increase reminding information is generated, and related personnel are reminded to upgrade the server, and the set time, the first coefficient, the second coefficient, and the capacity increase critical value are all set values in advance.

As shown in fig. 6, an embodiment of the present invention further provides an automatic load balancing system based on a cloud platform, where the system includes:

a request information receiving module 100, configured to receive service request information sent by a user, determine a load required by a server to normally respond to the service request information, and record the service request information;

a server information obtaining module 200, configured to obtain, according to the service request information, all server information capable of responding to the service, where the server information includes a server load level and a server load ratio;

the server information screening module 300 is configured to screen server information according to a server load level, reserve server information that can normally respond to a service, and determine whether the reserved server information is zero;

the server response module 400, when the time is not zero, selects the server corresponding to the server information with the lowest load ratio of the reserved server to respond to the service request, sends the service response information to the user side, and clears the recorded service request information;

and when the request information interception module 500 is zero, temporarily intercepting the received service request information, checking the load level of the server in real time until the server information capable of normally responding to the service exists, and removing interception.

In the embodiment of the invention, when business request information sent by a user side is received, load capacity required by a server for normally responding the business request information is predicted according to the specific business request information, the business request information is recorded, then all server information capable of responding business services is obtained according to the business request information, the server information comprises a server address, a server load grade and a server load ratio, the server load grade can reflect the residual load value of the server, the server load ratio is a load degree, then the server information is screened according to the server load grade, only the server information capable of normally responding the business is reserved, and whether the reserved server information is zero or not is judged; when the service request information is not zero, the server corresponding to the server information with the lowest server load occupation ratio is selected to respond to the service request, the lower the server load occupation ratio is, the higher the response speed is, the service response information is sent to the user side, and the recorded service request information is removed, so that the service request information is completed; when the number of the tasks is zero, the server can normally respond to the service request, if the load is imposed on the server, the server can be paralyzed, so that the service request cannot be responded in time, and the task which is being executed by the server can be influenced.

As shown in fig. 7, as a preferred embodiment of the present invention, the server information filtering module 300 includes:

a load level determining unit 301, configured to determine, according to the load amount, a server load level that can be reserved, where the server load level is determined according to a remaining load value;

a server information screening unit 302, configured to screen server information according to the determined server load level, and retain server information capable of responding to a service normally.

As shown in fig. 8, as a preferred embodiment of the present invention, the request information intercepting module 500 includes:

a request information matching unit 501, configured to match the intercepted service request information with the recorded service request information, and determine whether the same service request information exists;

a synchronous response unit 502, which responds to the intercepted service request information when the server responds to the recorded service request information when the same service request information exists;

after the server responds, the synchronous clearing unit 503 clears the intercepted service request information and the recorded service request information at the same time.

As shown in fig. 8, as a preferred embodiment of the present invention, the request information intercepting module 500 further includes:

a request information classification unit 504, configured to classify the intercepted service request information, where request contents of the service request information in each class are the same;

after the interception is removed, the automatic response unit 505 enables the server to directly respond to only one service request message in each class, and other service request messages are automatically and indirectly responded.

As shown in fig. 9, as a preferred embodiment of the present invention, the system further includes a capacity-increasing reminding module 600, where the capacity-increasing reminding module 600 specifically includes:

an interception information recording unit 601, configured to record the number of times and time for intercepting the service request information;

an interception information calculation unit 602 that calculates an interception frequency and an average interception time within a set time at intervals of the set time;

the reminding information generating unit 603 is configured to obtain a calculated value of the increase-volume, where the calculated value of the increase-volume = a first coefficient, an interception frequency, and a second coefficient, an average interception time, and generate the reminding information of the increase-volume when the calculated value of the increase-volume is greater than the critical value of the increase-volume.

The present invention has been described in detail with reference to the preferred embodiments thereof, and it should be understood that the present invention is not limited thereto, but includes any modifications, equivalents, and improvements within the spirit and scope of the present invention.

It should be understood that, although the steps in the flowcharts of the embodiments of the present invention are shown in sequence as indicated by the arrows, the steps are not necessarily performed in sequence as indicated by the arrows. The steps are not limited to being performed in the exact order illustrated and, unless explicitly stated herein, may be performed in other orders. Moreover, at least a portion of steps in various embodiments may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed alternately or alternatingly with other steps or at least a portion of sub-steps or stages of other steps.

It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a non-volatile computer-readable storage medium, and can include the processes of the embodiments of the methods described above when the program is executed. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), programmable ROM (PROM), electrically Programmable ROM (EPROM), electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double Data Rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link DRAM (SLDRAM), rambus (Rambus) direct RAM (RDRAM), direct Rambus Dynamic RAM (DRDRAM), and Rambus Dynamic RAM (RDRAM), among others.

Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice in the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

Claims

1. An automatic load balancing method based on a cloud platform is characterized by comprising the following steps:

when the load ratio of the server is not zero, selecting the server corresponding to the server information with the lowest load ratio of the reserved server to respond to the service request, sending service response information to the user side, and clearing the recorded service request information;

when the service request information is zero, carrying out short-time interception on the received service request information, checking the load level of the server in real time until the server information capable of normally responding to the service exists, and removing the interception;

the method further comprises the following steps:

recording the times and time for intercepting the service request information;

recalculating the interception frequency and the average interception time within a set time at intervals of the set time;

2. The automatic load balancing method based on the cloud platform according to claim 1, wherein the step of screening server information according to a server load level and retaining the server information that can normally respond to a service specifically includes:

3. The automatic load balancing method based on the cloud platform according to claim 1, wherein the step of temporarily intercepting the received service request information further comprises:

when the same service request information exists, the server responds to the intercepted service request information when responding to the recorded service request information;

4. The automatic load balancing method based on the cloud platform according to claim 1, wherein the step of temporarily intercepting the received service request information further comprises:

after the interception is removed, the server only directly responds to one service request message in each type, and other service request messages are automatically and indirectly responded.

5. An automatic load balancing system based on a cloud platform, the system comprising:

the request information receiving module is used for receiving the service request information sent by the user side, determining the load required by the server to normally respond to the service request information and recording the service request information;

the server response module is used for selecting a server corresponding to the server information with the lowest reserved load ratio of the server to respond to the service request when the server response module is not zero, sending service response information to the user side and clearing the recorded service request information;

the request information interception module is used for temporarily intercepting the received service request information when the service request information is zero, checking the load level of the server in real time until the server information capable of normally responding to the service exists, and removing interception;

the system also comprises a capacity-increasing reminding module, wherein the capacity-increasing reminding module specifically comprises:

6. The automatic load balancing system based on the cloud platform as claimed in claim 5, wherein the server information filtering module comprises:

a load level determining unit, configured to determine a server load level that can be reserved according to the load amount, where the server load level is determined according to a remaining load value;

7. The automatic load balancing system based on cloud platform of claim 5, wherein the request information intercepting module comprises:

8. The automatic load balancing system based on cloud platform of claim 5, wherein the request information intercepting module further comprises: