[go: up one dir, main page]

CN107992416B - Method and device for determining webpage time delay - Google Patents

Method and device for determining webpage time delay Download PDF

Info

Publication number
CN107992416B
CN107992416B CN201711219239.8A CN201711219239A CN107992416B CN 107992416 B CN107992416 B CN 107992416B CN 201711219239 A CN201711219239 A CN 201711219239A CN 107992416 B CN107992416 B CN 107992416B
Authority
CN
China
Prior art keywords
webpage
address
designated
target
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711219239.8A
Other languages
Chinese (zh)
Other versions
CN107992416A (en
Inventor
刘永生
王光全
廖军
左冰
文湘江
刘千仞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China United Network Communications Group Co Ltd
Original Assignee
China United Network Communications Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China United Network Communications Group Co Ltd filed Critical China United Network Communications Group Co Ltd
Priority to CN201711219239.8A priority Critical patent/CN107992416B/en
Publication of CN107992416A publication Critical patent/CN107992416A/en
Application granted granted Critical
Publication of CN107992416B publication Critical patent/CN107992416B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Prevention of errors by analysis, debugging or testing of software
    • G06F11/3668Testing of software
    • G06F11/3672Test management
    • G06F11/3688Test management for test execution, e.g. scheduling of test suites
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Prevention of errors by analysis, debugging or testing of software
    • G06F11/3604Analysis of software for verifying properties of programs
    • G06F11/3612Analysis of software for verifying properties of programs by runtime analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本申请公开了网页时延的确定方法及装置,涉及移动通信技术领域,能够解决因用户不愿安装探针且部分区域难以安装探针,而导致的运营商难以获取到全部用户访问网页时的实时网页时延的问题。本申请的方法包括:根据网页标识和终端IP地址,获取单向DPI设备存储的所有流量数据中的目标流量数据;将所述响应时间与所述访问时间的差值确定为单次往返时延;确定指定时间内,所述指定终端访问所述指定网页时的所有单次往返时延;根据所述所有单次往返时延,确定所述指定终端访问所述指定网页的指定网页时延。本申请适用于确定用户访问网页时的网页时延。

Figure 201711219239

The present application discloses a method and a device for determining web page delay, which relate to the field of mobile communication technology and can solve the problem that the operator is difficult to obtain when all users access web pages due to the fact that users are unwilling to install probes and it is difficult to install probes in some areas. The problem of real-time web page delay. The method of the present application includes: acquiring target traffic data in all traffic data stored by a one-way DPI device according to a webpage identifier and a terminal IP address; determining the difference between the response time and the access time as a single round-trip delay ; determine all single round-trip delays when the designated terminal accesses the designated webpage within a designated time; determine the designated webpage delay for the designated terminal to access the designated webpage according to all the single round-trip delays. This application is suitable for determining the web page delay when a user accesses a web page.

Figure 201711219239

Description

Method and device for determining webpage time delay
Technical Field
The present invention relates to the field of mobile communications technologies, and in particular, to a method and an apparatus for determining a webpage time delay.
Background
The webpage delay refers to the time consumed from the time when a user sends an access request to the time when the user receives access content when the user accesses a certain webpage. The webpage time delay can reflect the service perception of the user to the webpage service, and operators can delimit and position network transmission problems according to the size of the webpage time delay. Therefore, how to determine the webpage delay becomes a problem that operators pay more attention to.
At present, a probe dialing and measuring mode is generally used for determining webpage time delay, namely, a probe is installed at a user side to simulate the behavior of a user for accessing a webpage so as to acquire the webpage time delay. However, due to the limitation of geographical location, the probe can only be installed at a place where the installation can be performed manually, which results in that the operator can only determine the webpage time delay of part of users; moreover, for a user, the probe does not help to surf the internet, and the probe occupies the flow of the user during dial testing, so that the user is often reluctant to cooperate with an operator to install the probe; because the user is unwilling to install the probe, when the user has a problem in accessing the webpage, the operation and maintenance personnel can only temporarily visit the webpage to detect the webpage time delay, and the detected webpage time delay can only reflect the service perception during dialing but is difficult to reflect the service perception of the user when the problem occurs.
Disclosure of Invention
The application provides a method and a device for determining webpage time delay, which can solve the problem that an operator is difficult to acquire real-time webpage time delay when all users access webpages because the users are unwilling to install probes and probes are difficult to install in partial areas.
In order to achieve the purpose, the technical scheme is as follows:
in a first aspect, the present application provides a method for determining a webpage time delay, where the method includes:
acquiring target traffic data in all traffic data stored by a one-way Deep Packet Inspection (DPI) device according to a webpage identifier and a terminal Internet Protocol (IP) address, wherein the target traffic data at least comprises access time of a specified terminal corresponding to the terminal IP address to access a specified webpage corresponding to the webpage identifier and response time of the specified terminal to respond to the specified webpage in the access;
determining a difference between the response time and the access time as a single round trip delay;
determining all single round-trip time delays of the appointed terminal when the appointed terminal accesses the appointed webpage within the appointed time;
and determining the specified webpage time delay of the specified terminal for accessing the specified webpage according to all the single round-trip time delays.
In a second aspect, the present application provides an apparatus for determining a webpage time delay, where the apparatus includes:
the acquisition module is used for acquiring target flow data in all flow data stored by the unidirectional DPI equipment according to the webpage identifier and the terminal IP address, wherein the target flow data at least comprise the access time of a specified terminal corresponding to the terminal IP address to access a specified webpage corresponding to the webpage identifier and the response time of the specified terminal responding to the specified webpage in the current access;
a determining module, configured to determine a difference between the response time and the access time obtained by the obtaining module as a single round-trip delay;
the determining module is further configured to determine all single round-trip time delays of the designated terminal, which are obtained by the obtaining module, when the designated terminal accesses the designated webpage within a designated time;
the determining module is further configured to determine, according to all the single round-trip delays, a specified webpage delay of the specified terminal for accessing the specified webpage.
In a third aspect, the present application provides a network device, including: memory for storing one or more computer programs, the one or more computer programs comprising instructions; when executed by a processor, the instructions cause the network device to perform the method for determining a web page latency according to any one of the first aspect and its various alternative implementations.
In a fourth aspect, the present application provides a computer-readable storage medium, where instructions are stored, and when the instructions are executed on a network device, the network device is caused to execute the method for determining a web page delay according to any one of the first aspect and various optional implementations thereof.
Compared with the prior art that an operator cannot acquire the webpage time delay when all users access the webpage due to the fact that probes cannot be installed in partial areas, the webpage time delay determining method and device for determining the webpage time delay when the terminals access the webpage acquire the flow data of the webpage from the unidirectional DPI equipment, and therefore the webpage time delay when the designated terminals access the designated webpage is determined according to the flow data; and when the webpage time delay of a certain user at a certain moment needs to be acquired, the flow data of the user at the moment is only needed to be screened from the flow data to calculate the webpage time delay, so that the calculated webpage time delay truly reflects the real-time service perception of the user.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of a method for determining a web page delay according to an embodiment of the present invention;
fig. 2 is a schematic view of an application scenario of a unidirectional DPI device according to an embodiment of the present invention;
fig. 3 is a flowchart of another method for determining a web page delay according to an embodiment of the present invention;
fig. 4 is a flowchart of another method for determining a web page delay according to an embodiment of the present invention;
fig. 5 is a flowchart of another method for determining a web page delay according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of a device for determining a web page delay according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a network device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention provides a method for determining webpage time delay, which is applied to a device for determining the webpage time delay. The determining device of the web page delay may specifically be a network device such as a terminal or a server, which has data analysis and processing capabilities. As shown in fig. 1, the method includes:
step 101, acquiring target traffic data in all traffic data stored in the unidirectional DPI equipment according to the webpage identifier and the terminal IP address.
The target flow data at least comprises the access time of a specified terminal corresponding to the terminal IP address to access a specified webpage corresponding to the webpage identification, and the response time of the specified terminal in the current access to respond to the specified webpage.
The unidirectional DPI equipment adopts a DPI technology, and can identify the application layer data of the data packet, so that the network flow is analyzed in detail. Because the DPI device capable of simultaneously acquiring the uplink traffic and the downlink traffic is expensive at present, and the deployment number of the DPI device is in direct proportion to the data volume of the network traffic to be analyzed, in order to save the investment cost, a unidirectional DPI device is generally used to detect the uplink traffic of the network.
As shown in fig. 2, the unidirectional DPI device is located between the designated terminal and the server, and is configured to detect an upstream traffic sent by the terminal to the server. In the embodiment of the present invention, the traffic data sent when the terminal accesses the web page is based on a Transmission Control Protocol (TCP). TCP adopts three-way handshake to establish data connection between a terminal and a server, in the three-way handshake process of establishing connection between the terminal and the server, the terminal firstly sends a first synchronization Sequence number (SYN) message to the server, and the first SYN message is transmitted to the server through one-way DPI equipment and waits for the server to confirm; after receiving the first SYN message, the server confirms the first SYN message, and then sends a second SYN message and a first Acknowledgement Character (ACK) message to the terminal; after receiving the second SYN message and the first ACK message, the terminal sends a second ACK message to the server; after the server receives the second ACK message, the terminal and the server complete three-way handshake, connection is established, and the terminal can access webpage content in the server.
It should be noted that, when the unidirectional DPI device receives the first SYN packet, the receiving time of the first SYN packet is recorded, and the receiving time is determined as the access time; and when the unidirectional DPI equipment receives the second ACK message, recording the receiving time of the second ACK message, and determining the receiving time as response time.
If the unidirectional DPI device only monitors the first SYN packet and does not monitor the second ACK packet, the server may not respond to the access request of the user due to busy or the like, or the first SYN packet does not reach the server due to poor network quality from the unidirectional DPI device to the server, or the second SYN packet and the first ACK packet sent by the server do not reach the terminal. When the above situation occurs, the above situation can be improved by optimizing the network or repeatedly initiating an access request to the server by the terminal for many times.
And step 102, determining the difference value between the response time and the access time as the single round-trip delay.
In the present example, t is used1Denotes the access time, t2Indicating the response Time, the single Round-Trip Time (RTT) is expressed as:
RTT=t2-t1
and 103, determining all single round-trip time delays when the appointed terminal accesses the appointed webpage within the appointed time.
Considering that a user accesses web page elements, such as pictures, script files and the like, in a web page in a process of accessing the web page, when the user accesses a web page containing a plurality of web page elements, the web page elements need to be downloaded from a corresponding server, the web page delay includes the downloading time of the web page elements, and the time consumed for downloading different web page elements from different servers is different. The designated time can be set manually, and the average access time of the same webpage accessed by the user in the historical data can be referred during setting.
In the embodiment of the invention, the difference value between the access time and the response time of one access request initiated by the terminal is used as the single round-trip delay, and the access request terminal can access one of the webpage itself or the webpage elements.
And step 104, determining the specified webpage time delay of the specified terminal for accessing the specified webpage according to all the single round-trip time delays.
The embodiment of the invention obtains the flow data of the webpage accessed by the terminal from the unidirectional DPI equipment, thereby determining the webpage time delay when the appointed terminal accesses the appointed webpage according to the flow data, and because the unidirectional DPI equipment does not need to be deployed at the user side, and the flow data when all users access the webpage are transmitted to the server after passing through the unidirectional DPI equipment, the calculated webpage time delay can reflect the service perception when all users access the webpage without the cooperation of the users; and when the webpage time delay of a certain user at a certain moment needs to be acquired, the flow data of the user at the moment is only needed to be screened from the flow data to calculate the webpage time delay, so that the calculated webpage time delay truly reflects the real-time service perception of the user.
In order to facilitate the screening of the target flow data from all the flow data, in one implementation manner of the embodiment of the present invention, a screening basis needs to be determined. Therefore, on the basis of the implementation shown in fig. 1, the implementation shown in fig. 3 can also be realized. Step 101 is to obtain target traffic data in all traffic data stored in the unidirectional DPI device according to the terminal IP address and the web page identifier, and may perform steps 1011 to 1015:
step 1011, obtain Uniform Resource Identifier (URI) of the specified web page, web page element of the specified web page, and URI of the web page element.
It should be noted that the web page elements at least include pictures, sounds, videos, style sheet files, script files, and the like in the web page.
For a webpage needing to acquire webpage time delay, the webpage needs to be accessed, and a HyperText Markup Language (HTML) source file of the webpage is stored, wherein the structure of the HTML source file comprises a Head (Head) part and a Body (Body) part, the Head part provides relevant information of the webpage, and the Body part provides specific content of the webpage. After the HTML source file is obtained, the determining device for webpage time delay analyzes the head part of the HTML source file to obtain the URI of the webpage and the URI corresponding to the webpage elements.
Step 1012, obtaining a first reference field and a second reference field of a hypertext transfer Protocol (HTTP) header in each data packet in all the traffic data.
It should be noted that, after receiving all traffic data sent by the unidirectional DPI device, extracting fields "Host (Host)" and "Request (Request) URI" of an HTTP header in each packet, and merging the two fields to obtain a URI of a web page element accessed by the packet, where the Host field is before and the Request URI field is after the merging. And combining the Host field and the Request URI field to obtain a field, namely the URI of the webpage element accessed by the data packet is used as a first reference field. Through the first reference field, a data packet having the same URI as that of a web element in a specified web page can be screened out from all reference traffic.
In addition to extracting the first reference field, an access source (Referer) field of the HTTP header needs to be extracted as a second reference field. When the terminal accesses the webpage element, an access request is sent to a server storing the webpage file or the webpage element file, and a data packet of the access request carries a Referer field so as to inform the server of which webpage link the data packet comes from. Therefore, the web page to which the web page element belongs can be determined by extracting the refer field in the terminal data packet, that is, the web page to which the web page element belongs can be determined by the second reference field.
Step 1013, if the first reference field is the same as one of the URIs of the web page element and the second reference field is the same as the URI of the specified web page, determining a data packet including the first reference field and the second reference field in all the traffic data as a reference data packet.
And comparing the first reference field, namely the URI of the webpage element accessed by each data packet in all the flow data with the URI of the webpage element in the specified webpage, and comparing the second reference field, namely the refer field in the data packet with the URI of the specified webpage, and if the first reference field and the second reference field are both identical to the corresponding URI, determining that the data packet comprising the first field and the second field is the data packet accessed to the specified webpage, namely the reference data packet.
In the embodiment of the present invention, the webpage identifier may specifically be a first reference field and a second reference field.
In the embodiment of the present invention, after the unidirectional DPI device acquires the first reference field and the second reference field, the reference packet may be screened out according to the first reference field and the second reference field.
And step 1014, acquiring the terminal IP address of the appointed terminal.
After the reference data packets are screened out, the reference data packets need to be filtered according to the terminal IP address of the specified terminal used by the user, so as to screen out the target data packets of the specified terminal accessing the specified webpage.
Step 1015, determine the reference data packet with the same source IP address and terminal IP address in the reference data packet as the target data packet.
The target data packet constitutes a target flow, and the target flow at least comprises target flow data.
It should be noted that the terminal has a fixed and unique IP address, and the data packet sent by the terminal to the server carries the IP address, i.e., the source IP address. Because the source IP has uniqueness, if the source IP address included in the data packet is the same as the terminal IP address of the designated terminal, the data packet is determined to be the data packet sent by the designated terminal.
In the embodiment of the invention, the target data packet is screened from the data packets forming all the flow data, and the target data packet is the data packet sent when the appointed terminal accesses the appointed webpage, so that the webpage time delay of the appointed terminal accessing the appointed webpage can be determined according to the target flow data.
In order to determine the web page delay, in an implementation manner of the embodiment of the present invention, an IP address of a server storing web page elements needs to be determined first, so as to distinguish target data packets accessing different servers. Therefore, on the basis of the implementation shown in fig. 2, the implementation shown in fig. 3 may be implemented, and after the step 1015 is executed to determine the reference packet with the source IP address being the same as the terminal IP address in the reference packet as the target packet, the step 105 may be further executed:
and 105, acquiring a destination IP address of the target data packet, and determining the destination IP address as the IP address of the target server.
It should be noted that, because the web page delay is related to the time for acquiring each web page element from the server, in the embodiment of the present invention, before calculating the web page delay, it is necessary to determine the target server to which the web page element belongs, that is, determine the destination IP address included in the target data packet, that is, the IP address of the server storing the web page element, according to the header of the network layer protocol included in the screened target data packet.
In the embodiment of the invention, the target IP address contained in the target data packet is used as the IP address of the target server, so that compared with the target server to which the webpage element belongs obtained according to the webpage, the time for determining the IP address of the target server is saved, and the workload of the webpage delay determining device is reduced.
In order to accurately determine the web page delay, in an implementation manner of the embodiment of the present invention, it is necessary to calculate the web page delay by comprehensively considering the single round trip delay of the terminal for accessing the web page and the single round trip delay of the terminal for accessing the web page element, and therefore, on the basis of the implementation manner shown in fig. 4, the implementation manner shown in fig. 5 may also be implemented. Step 104 determines, according to all single round-trip delays, a specified webpage delay of a specified terminal for accessing a specified webpage, which may be specifically executed as steps 1041 to 1043:
step 1041, dividing the single round trip delay of all the single round trip delays when accessing the target servers corresponding to the same IP address into a group.
In the embodiment of the invention, all single round-trip time delays counted in the specified time are grouped according to the IP address of the accessed server, and the single round-trip time delays for accessing the servers corresponding to the same IP address are divided into a group. To facilitate the counting and comparison of the number of times a given terminal accesses each server, each set of single round trip delays may be represented using the following form:
Figure BDA0001486128400000101
Figure BDA0001486128400000102
......
Figure BDA0001486128400000103
wherein, IP1、IP2And IPkRespectively indicating a server corresponding to a first IP address, a server corresponding to a second IP address and a server corresponding to a k-th IP address;
Figure BDA0001486128400000104
and
Figure BDA0001486128400000105
respectively used for indicating the n-th designated terminal1Single round trip delay, nth, of a second access to a server corresponding to the first IP address2A single round trip delay for a second access to a server corresponding to the second IP address, and an nthkAnd the single round trip delay of the server corresponding to the kth IP address is accessed for the second time.
It should be noted that the data amount in each group may not be equal, which may cause the user to access the same server many times and access another server less times because more web page elements in the web page elements accessed by the user exist in the same server; alternatively, the user may visit the same web page element multiple times, but the number of visits to other web page elements is small, which may also result in unequal number of visits by the user to each server.
1042, according to the formula
Figure BDA0001486128400000106
And determining the corresponding weight of each target server.
Wherein alpha isiFor indicating the weight corresponding to the target server corresponding to the ith IP address, niThe number of times that the designated terminal accesses the target server corresponding to the ith IP address in the designated time is represented, and k is used for representing the total number of the target servers.
Step 1043, according to the formula
Figure BDA0001486128400000107
And calculating the specified webpage time delay d.
Wherein d isiThe median of the group formed by the single round trip delay for accessing the target server corresponding to the ith IP address is expressed.
And considering that the single round trip delay in each group has a long tail effect, selecting the median of the group formed by the single round trip delay in each group to calculate the time delay of the specified webpage.
It should be noted that before determining the median of the group formed by each group of single round trip delays, each group of single round trip delays needs to be sorted from large to small or from small to large. In this embodiment of the present invention, based on the method for representing each set of single round trip delays in step 1041, diThe following method may be used for determination:
when n is greater than niIn the case of an odd number of the groups,
Figure BDA0001486128400000111
② when the number is even, if the number is even,
Figure BDA0001486128400000112
it should be noted that after the specified webpage delay is obtained through calculation, the service perception of the user for the specified webpage can be evaluated by using the size relationship between the specified webpage delay and the preset threshold. If the specified webpage time delay is greater than the preset threshold, the service perception of the user is poor, and the webpage problem needs to be located, for example, the round-trip time delay of each server is compared with the preset threshold, if the round-trip time delay of one or more servers is greater than the preset threshold, the physical attribution and the load state of the one or more servers, the network path of the user accessing the one or more servers, the network link utilization rate, the network equipment operation state and the like are analyzed in detail, the problem of the webpage service is determined through the detailed analysis, the network is optimized, the webpage time delay is reduced, and the user perception is improved.
In the embodiment of the invention, the single round-trip time delay obtained in the appointed time is grouped according to the different accessed servers, the time delay difference of the appointed terminal when accessing different servers is considered, and the subsequent positioning of the webpage problem is facilitated; and then selecting the median of each group to calculate the time delay of the specified webpage, namely, when determining the time delay of the webpage, not only taking the time delay of one or more times of the specified terminal accessing the specified webpage or the webpage element as the time delay of the webpage, but also considering that the user may frequently access the specified webpage and the webpage element in a certain time, calculating the time delay of the specified webpage by using the single time webpage time delay of all the specified terminals accessing the specified webpage in the certain time, so that the calculation result of the time delay of the specified webpage is more accurate.
An embodiment of the present invention further provides a device 20 for determining a web page delay, where the device 20 is configured to execute the method flows shown in fig. 1, fig. 3, fig. 4, and fig. 5, and as shown in fig. 6, the device 20 includes:
the obtaining module 21 is configured to obtain target traffic data in all traffic data stored in the unidirectional DPI device according to the web page identifier and the terminal IP address, where the target traffic data at least includes access time of a specific terminal corresponding to the terminal IP address to access a specific web page corresponding to the web page identifier, and response time of the specific terminal in this access to respond to the specific web page.
And a determining module 22, configured to determine a difference between the response time and the access time acquired by the acquiring module 21 as a single round trip delay.
The determining module 22 is further configured to determine all single round trip delays of the specified terminal, which are obtained by the obtaining module 21, when the specified terminal accesses the specified web page within the specified time.
The determining module 22 is further configured to determine, according to all the single round-trip delays, a specified webpage delay of the specified terminal for accessing the specified webpage.
In an implementation manner of the embodiment of the present invention, the obtaining module 21 is further configured to obtain a uniform resource identifier URI of the specified web page, a web page element of the specified web page, and a URI of the web page element.
The obtaining module 21 is further configured to obtain a first reference field and a second reference field of a HTTP header in each data packet in all the traffic data.
The determining module 22 is further configured to determine, as the reference data packet, a data packet in all the traffic data that includes the first reference field and the second reference field if the first reference field is the same as one of the URIs of the web page elements and the second reference field is the same as the URI of the specified web page.
The obtaining module 21 is further configured to obtain a terminal IP address of the specified terminal.
The determining module 22 is further configured to determine, as a target data packet, a reference data packet in the reference data packet, where a source IP address is the same as a terminal IP address, where the target data packet constitutes a target traffic, and the target traffic at least includes target traffic data.
In an implementation manner of the embodiment of the present invention, the determining module 22 is further configured to obtain a destination IP address of the target data packet, and determine the destination IP address as an IP address of the target server.
In an implementation manner of the embodiment of the present invention, the determining module is configured to:
dividing single round-trip time delay when a target server corresponding to the same IP address is accessed into a group in all the single round-trip time delays;
according to the formula
Figure BDA0001486128400000131
Determining a weight alpha corresponding to each target server, wherein alpha isiFor indicating the weight corresponding to the target server corresponding to the ith IP address, niThe terminal is used for indicating the times of the designated terminal accessing the target server corresponding to the ith IP address in the designated time, and k is used for indicating the total number of the target servers;
according to the formula
Figure BDA0001486128400000132
Calculating the time delay d of the specified webpage, wherein diThe median of the group formed by the single round trip delay for accessing the target server corresponding to the ith IP address is expressed.
Compared with the prior art that an operator cannot acquire the webpage time delay when all users access the webpage due to the fact that probes cannot be installed in partial areas, the webpage time delay determining device acquires the flow data of the webpage accessed by the terminal from the unidirectional DPI equipment, and accordingly determines the webpage time delay when the designated terminal accesses the designated webpage according to the flow data; and when the webpage time delay of a certain user at a certain moment needs to be acquired, the flow data of the user at the moment is only needed to be screened from the flow data to calculate the webpage time delay, so that the calculated webpage time delay truly reflects the real-time service perception of the user.
As shown in fig. 7, an embodiment of the present application provides a schematic structural diagram of a network device. The network device 30 includes: a processor 31 and a transceiver 33. Processor 31 is configured to control and manage the actions of network device 30, e.g., to perform the steps performed by determination module 22 described above, and/or to perform other processes for the techniques described herein. The network device 30 may also include a memory 32, a transceiver 33, and a bus 34, the memory 32 for storing program codes and data for the network device; the transceiver 33 is used to support communication between the network device and other network entities, for example, to perform the steps performed by the acquisition module 21.
The processor 31 may implement or execute the various illustrative logical blocks, modules, and circuits described in connection with the disclosure herein. The processor 31 may be a central processing unit, general purpose processor, digital signal processor, application specific integrated circuit, field programmable gate array or other programmable logic device, transistor logic device, hardware component, or any combination thereof. Which may implement or perform the various illustrative logical blocks, modules, and circuits described in connection with the disclosure. The processor may be a combination that implements a computing function, and may include, for example, a combination of one or more microprocessors, a combination of Digital Signal Processing (DSP) and a microprocessor, or the like.
Memory 32 may include volatile memory, such as random access memory; the memory 32 may also include non-volatile memory, such as read-only memory, flash memory, a hard disk, or a solid state disk; the memory may also comprise a combination of memories of the kind described above.
The bus 34 may be an Extended Industry Standard Architecture (EISA) bus or the like. The bus 34 may be divided into an address bus, a data bus, a control bus, and the like. For ease of illustration, only one thick line is shown in FIG. 7, but this is not intended to represent only one bus or type of bus.
Through the above description of the embodiments, it is clear to those skilled in the art that, for convenience and simplicity of description, the foregoing division of the functional modules is merely used as an example, and in practical applications, the above function distribution may be completed by different functional modules according to needs, that is, the internal structure of the device may be divided into different functional modules to complete all or part of the above described functions. For the specific working processes of the system, the apparatus and the unit described above, reference may be made to the corresponding processes in the foregoing method embodiments, and details are not described here again.
The steps of a method or algorithm described in connection with the disclosure herein may be embodied in hardware or in software instructions executed by a processor. The software instructions may consist of corresponding software modules that may be stored in RAM, flash memory, ROM, Erasable Programmable Read Only Memory (EPROM), Electrically Erasable Programmable Read Only Memory (EEPROM), registers, a hard disk, a removable hard disk, a compact disc read only memory (CD-ROM), or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be integral to the processor. The processor and the storage medium may reside in an Application Specific Integrated Circuit (ASIC). Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a Read-Only Memory (ROM), an Erasable Programmable Read-Only Memory (EPROM), an optical fiber, a portable Compact Disc Read-Only Memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In embodiments of the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The above is only an embodiment of the present application, but the scope of the present application is not limited thereto, and any changes or substitutions within the technical scope of the present disclosure should be covered by the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (4)

1.一种网页时延的确定方法,其特征在于,所述方法包括:1. A method for determining web page delay, wherein the method comprises: 根据网页标识和终端互联网协议IP地址,获取单向深度报文检测DPI设备存储的所有流量数据中的目标流量数据,所述目标流量数据至少包括所述终端IP地址对应的指定终端访问所述网页标识对应的指定网页的访问时间,以及本次访问中所述指定终端响应所述指定网页的响应时间;According to the webpage identifier and the terminal Internet Protocol IP address, obtain the target flow data in all the flow data stored by the one-way deep packet detection DPI device, the target flow data at least include the designated terminal corresponding to the terminal IP address accessing the webpage Identify the access time of the corresponding designated webpage, and the response time of the designated terminal responding to the designated webpage in this visit; 将所述响应时间与所述访问时间的差值确定为单次往返时延;determining the difference between the response time and the access time as a single round-trip delay; 确定指定时间内,所述指定终端访问所述指定网页时的所有单次往返时延;Determine all single round-trip delays when the designated terminal accesses the designated webpage within the designated time period; 根据所述所有单次往返时延,确定所述指定终端访问所述指定网页的指定网页时延;determining, according to all the single round-trip delays, the designated webpage delay for the designated terminal to access the designated webpage; 所述根据网页标识和终端IP地址,获取单向DPI设备存储的所有流量数据中的目标流量数据,包括:According to the webpage identifier and the terminal IP address, obtain the target flow data in all the flow data stored by the one-way DPI device, including: 获取所述指定网页的统一资源标识符URI、所述指定网页的网页元素,以及所述网页元素的URI;obtaining the Uniform Resource Identifier URI of the designated webpage, the webpage element of the designated webpage, and the URI of the webpage element; 获取所有流量数据中,每个数据包中超文本传输协议HTTP头部的第一参考字段和第二参考字段,所述第一参考字段为HTTP头部的主机Host字段与请求Request URI字段合并后得到,所述第二参考字段为HTTP头部的访问来源Referer字段;Obtain the first reference field and the second reference field of the HTTP header of the hypertext transfer protocol in each data packet in all traffic data, and the first reference field is obtained by combining the Host field of the HTTP header and the Request URI field. , the second reference field is the access source Referer field of the HTTP header; 若所述第一参考字段与所述网页元素的URI中的一个URI相同,且所述第二参考字段与所述指定网页的URI相同,则将所有流量数据中,包括所述第一参考字段与所述第二参考字段的数据包确定为参考数据包;If the first reference field is the same as one of the URIs of the webpage element, and the second reference field is the same as the URI of the specified webpage, all traffic data will include the first reference field The data packet with the second reference field is determined as a reference data packet; 获取所述指定终端的终端IP地址;Obtain the terminal IP address of the designated terminal; 将所述参考数据包中,源IP地址与所述终端IP地址相同的参考数据包确定为目标数据包,其中,所述目标数据包组成目标流量,所述目标流量中至少包括目标流量数据;In the reference data packet, the reference data packet whose source IP address is the same as the terminal IP address is determined as the target data packet, wherein the target data packet constitutes target traffic, and the target traffic at least includes target traffic data; 在将所述参考数据包中,源IP地址与所述终端IP地址相同的参考数据包确定为目标数据包之后,所述方法还包括:After determining the reference data packet whose source IP address is the same as the terminal IP address in the reference data packet as the target data packet, the method further includes: 获取所述目标数据包的目的IP地址,将所述目的IP地址确定为目标服务器的IP地址;Obtain the destination IP address of the target data packet, and determine the destination IP address as the IP address of the target server; 所述根据所述所有单次往返时延,确定所述指定终端访问所述指定网页的指定网页时延,包括:Determining, according to all the single round-trip delays, the designated webpage delay for the designated terminal to access the designated webpage, including: 将所述所有单次往返时延中,访问相同IP地址对应的目标服务器时的单次往返时延划分为一组,其中,所述所有单次往返时延是由访问网页的单次往返时延及访问网页元素的单次往返时延得出;In all the single round-trip delays, the single round-trip delays when accessing the target server corresponding to the same IP address are divided into a group, wherein the all single round-trip delays are determined by the single round-trip time of accessing the webpage. It is derived from the single round-trip delay of accessing web page elements; 根据公式
Figure FDA0002829797170000021
确定每个目标服务器对应的权重α,其中,αi用于表示第i个IP地址对应的目标服务器对应的权重,ni用于表示所述指定时间内,所述指定终端访问所述第i个IP地址对应的目标服务器的次数,k用于表示所述目标服务器的总数量;
According to the formula
Figure FDA0002829797170000021
Determine the weight α corresponding to each target server, where α i is used to represent the weight corresponding to the target server corresponding to the i-th IP address, and n i is used to indicate that the designated terminal accesses the i-th IP address within the specified time. The number of target servers corresponding to each IP address, and k is used to represent the total number of the target servers;
根据公式
Figure FDA0002829797170000022
计算所述指定网页时延d,其中,di用于表示访问第i个IP地址对应的目标服务器的单次往返时延构成的组的中位数。
According to the formula
Figure FDA0002829797170000022
Calculate the delay d of the designated webpage, where d i is used to represent the median of the group formed by the single round-trip delay of accessing the target server corresponding to the ith IP address.
2.一种网页时延的确定装置,其特征在于,所述装置包括:2. A device for determining a web page delay, wherein the device comprises: 获取模块,用于根据网页标识和终端IP地址,获取单向DPI设备存储的所有流量数据中的目标流量数据,所述目标流量数据至少包括所述终端IP地址对应的指定终端访问所述网页标识对应的指定网页的访问时间,以及本次访问中所述指定终端响应所述指定网页的响应时间;The acquisition module is used to acquire target traffic data in all traffic data stored by the one-way DPI device according to the web page identifier and the terminal IP address, where the target traffic data at least includes the designated terminal corresponding to the terminal IP address accessing the web page identifier The access time of the corresponding designated webpage, and the response time of the designated terminal responding to the designated webpage in this visit; 确定模块,用于将所述获取模块获取的所述响应时间与所述访问时间的差值确定为单次往返时延;a determining module, configured to determine the difference between the response time and the access time acquired by the acquiring module as a single round-trip delay; 所述确定模块,还用于确定指定时间内,所述获取模块获取的所述指定终端访问所述指定网页时的所有单次往返时延;The determining module is further configured to determine all single round-trip delays when the designated terminal accesses the designated webpage obtained by the obtaining module within a designated time period; 所述确定模块,还用于根据所述所有单次往返时延,确定所述指定终端访问所述指定网页的指定网页时延;The determining module is further configured to determine, according to all the single round-trip delays, the designated webpage delay for the designated terminal to access the designated webpage; 所述获取模块,还用于获取所述指定网页的统一资源标识符URI、所述指定网页的网页元素,以及所述网页元素的URI;The obtaining module is further configured to obtain the Uniform Resource Identifier URI of the designated webpage, the webpage element of the designated webpage, and the URI of the webpage element; 所述获取模块,还用于获取所有流量数据中,每个数据包中超文本传输协议HTTP头部的第一参考字段和第二参考字段,所述第一参考字段为HTTP头部的主机Host字段与请求Request URI字段合并后得到,所述第二参考字段为HTTP头部的访问来源Referer字段;The obtaining module is further configured to obtain the first reference field and the second reference field of the HTTP header of the hypertext transfer protocol in each data packet in all traffic data, and the first reference field is the host field of the HTTP header. Obtained after merging with the Request URI field, the second reference field is the access source Referer field of the HTTP header; 所述确定模块,还用于若所述第一参考字段与所述网页元素的URI中的一个URI相同,且所述第二参考字段与所述指定网页的URI相同,则将所有流量数据中,包括所述第一参考字段与所述第二参考字段的数据包确定为参考数据包;The determining module is further configured to, if the first reference field is the same as one of the URIs of the webpage element, and the second reference field is the same as the URI of the specified webpage, put all traffic data in the URI. , the data packet including the first reference field and the second reference field is determined as a reference data packet; 所述获取模块,还用于获取所述指定终端的终端IP地址;The obtaining module is further configured to obtain the terminal IP address of the designated terminal; 所述确定模块,还用于将所述参考数据包中,源IP地址与所述终端IP地址相同的参考数据包确定为目标数据包,其中,所述目标数据包组成目标流量,所述目标流量中至少包括目标流量数据;The determining module is further configured to determine, in the reference data packet, a reference data packet whose source IP address is the same as the terminal IP address as a target data packet, wherein the target data packet constitutes target traffic, and the target The traffic includes at least target traffic data; 所述确定模块,还用于获取所述目标数据包的目的IP地址,将所述目的IP地址确定为目标服务器的IP地址;The determining module is also used for obtaining the destination IP address of the target data packet, and determining the destination IP address as the IP address of the target server; 所述确定模块,用于:The determining module is used for: 将所述所有单次往返时延中,访问相同IP地址对应的目标服务器时的单次往返时延划分为一组,其中,所述所有单次往返时延是由访问网页的单次往返时延及访问网页元素的单次往返时延得出;In all the single round-trip delays, the single round-trip delays when accessing the target server corresponding to the same IP address are divided into a group, wherein the all single round-trip delays are determined by the single round-trip time of accessing the webpage. It is derived from the single round-trip delay of accessing web page elements; 根据公式
Figure FDA0002829797170000041
确定每个目标服务器对应的权重α,其中,αi用于表示第i个IP地址对应的目标服务器对应的权重,ni用于表示所述指定时间内,所述指定终端访问所述第i个IP地址对应的目标服务器的次数,k用于表示所述目标服务器的总数量;
According to the formula
Figure FDA0002829797170000041
Determine the weight α corresponding to each target server, where α i is used to represent the weight corresponding to the target server corresponding to the i-th IP address, and n i is used to indicate that the designated terminal accesses the i-th IP address within the specified time. The number of target servers corresponding to each IP address, and k is used to represent the total number of the target servers;
根据公式
Figure FDA0002829797170000042
计算所述指定网页时延d,其中,di用于表示访问第i个IP地址对应的目标服务器的单次往返时延构成的组的中位数。
According to the formula
Figure FDA0002829797170000042
Calculate the delay d of the designated webpage, where d i is used to represent the median of the group formed by the single round-trip delay of accessing the target server corresponding to the ith IP address.
3.一种网络设备,其特征在于,所述网络设备包括:3. A network device, wherein the network device comprises: 存储器,用于存储一个或多个计算机程序,所述一个或多个计算机程序包括指令;a memory for storing one or more computer programs comprising instructions; 当处理器执行所述指令时,使得所述网络设备执行权利要求1所述的网页时延的确定方法。When the processor executes the instruction, the network device is caused to execute the method for determining the delay of a webpage according to claim 1 . 4.一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,其特征在于,当所述指令在网络设备上运行时,使得所述网络设备执行如权利要求1所述的网页时延的确定方法。4. A computer-readable storage medium, wherein instructions are stored in the computer-readable storage medium, wherein when the instructions are executed on a network device, the network device is made to execute the method according to claim 1. A method for determining web page delay.
CN201711219239.8A 2017-11-28 2017-11-28 Method and device for determining webpage time delay Active CN107992416B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711219239.8A CN107992416B (en) 2017-11-28 2017-11-28 Method and device for determining webpage time delay

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711219239.8A CN107992416B (en) 2017-11-28 2017-11-28 Method and device for determining webpage time delay

Publications (2)

Publication Number Publication Date
CN107992416A CN107992416A (en) 2018-05-04
CN107992416B true CN107992416B (en) 2021-02-23

Family

ID=62033743

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711219239.8A Active CN107992416B (en) 2017-11-28 2017-11-28 Method and device for determining webpage time delay

Country Status (1)

Country Link
CN (1) CN107992416B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112468371B (en) * 2020-12-01 2022-06-10 武汉绿色网络信息服务有限责任公司 A method and device for determining transmission delay information of a DPI device
CN113890828B (en) * 2021-09-30 2024-04-02 北京明略昭辉科技有限公司 Method and device for adjusting bandwidth, electronic equipment and storage medium
CN114186124B (en) * 2021-11-22 2024-12-20 北京达佳互联信息技术有限公司 Information push method, device, electronic device and storage medium
CN113961852B (en) * 2021-12-21 2022-03-11 江苏同步软件技术有限公司 Webpage communication system optimization method and system based on big data

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014151591A2 (en) * 2013-03-15 2014-09-25 Enterasys Networks, Inc. A device, a system and a related method for dynamic traffic mirroring and policy, and the determination of applications running on a network

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070299965A1 (en) * 2006-06-22 2007-12-27 Jason Nieh Management of client perceived page view response time
US8732569B2 (en) * 2011-05-04 2014-05-20 Google Inc. Predicting user navigation events
CN104581753B (en) * 2013-10-09 2018-06-26 中国移动通信集团设计院有限公司 A kind of method, apparatus and terminal for calculating webpage loading time delay
CN105634836B (en) * 2014-10-27 2020-03-17 香港理工大学 Information processing method and device
CN104994133B (en) * 2015-05-22 2018-08-21 华中科技大学 A kind of mobile Web web page access user experience perception evaluating method based on network KPI
CN106470129B (en) * 2015-08-19 2019-08-20 中国电信股份有限公司 Realize method, system, server and the terminal of webpage head screen latency measurement
CN106611010B (en) * 2015-10-26 2020-02-07 北京国双科技有限公司 Method and device for determining webpage loading speed
CN106649384B (en) * 2015-11-03 2019-07-09 中国电信股份有限公司 The method and apparatus classified to URL
CN105373617A (en) * 2015-11-27 2016-03-02 中国联合网络通信集团有限公司 Web page first screen determination method and apparatus
CN106656666B (en) * 2016-12-13 2020-05-22 中国联合网络通信集团有限公司 Method and device for acquiring first screen time of webpage

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014151591A2 (en) * 2013-03-15 2014-09-25 Enterasys Networks, Inc. A device, a system and a related method for dynamic traffic mirroring and policy, and the determination of applications running on a network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Website Response Time;Jakob Nielsen;《https://www.nngroup.com/articles/website-response-times/》;20100620;全文 *

Also Published As

Publication number Publication date
CN107992416A (en) 2018-05-04

Similar Documents

Publication Publication Date Title
CN107624233B (en) VPN transmission tunnel scheduling method and device and VPN client server
CN107992416B (en) Method and device for determining webpage time delay
US10148542B2 (en) Monitoring domain allocation performance
CN106656666B (en) Method and device for acquiring first screen time of webpage
CN109656574B (en) Transaction time delay measurement method and device, computer equipment and storage medium
US10230602B2 (en) Endpoint web monitoring system and method for measuring popularity of a service or application on a web server
CN103067214B (en) For testing the method for web site performance, client, server and system
CN103605689B (en) It is a kind of to obtain the method and device for accessing the residence time
Halepovic et al. Can you GET me now? Estimating the time-to-first-byte of HTTP transactions with passive measurements
US20110078291A1 (en) Distributed performance monitoring in soft real-time distributed systems
JP2005506605A (en) Calculating response time at the server site for any application
CN109739711B (en) Interface test method, device, equipment and storage medium
WO2021088484A1 (en) Network delay detection method and related device
Zimmermann et al. Is the web ready for http/2 server push?
KR20110057529A (en) System for measuring response time of server using dummy request tag and method
EP3593493A1 (en) Prediction of a performance indicator
CN103916463B (en) Network access statistical analysis method and system
WO2015195407A1 (en) Method and apparatus for monitoring and determining page load times
CN111107118A (en) Picture access acceleration method, device, equipment, system and storage medium
CN108768738A (en) Quick self-adapted network bandwidth detection method based on CDN network technology
Rizothanasis et al. Identifying user actions from HTTP (S) traffic
CN113542044A (en) Network quality monitoring method and device and computing equipment
CN105721231B (en) A kind of quality of service perception detection method and device
CN102932400B (en) Method and device for identifying uniform resource locator primary links
CN106302020B (en) Network bandwidth statistical method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant