[go: up one dir, main page]

CN115577306B - Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle - Google Patents

Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle Download PDF

Info

Publication number
CN115577306B
CN115577306B CN202211562330.0A CN202211562330A CN115577306B CN 115577306 B CN115577306 B CN 115577306B CN 202211562330 A CN202211562330 A CN 202211562330A CN 115577306 B CN115577306 B CN 115577306B
Authority
CN
China
Prior art keywords
neighborhood
spatial
points
space
density
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202211562330.0A
Other languages
Chinese (zh)
Other versions
CN115577306A (en
Inventor
石岩
王达
汪晓龙
陈炳蓉
邓敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN202211562330.0A priority Critical patent/CN115577306B/en
Publication of CN115577306A publication Critical patent/CN115577306A/en
Application granted granted Critical
Publication of CN115577306B publication Critical patent/CN115577306B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0645Rental transactions; Leasing transactions

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Traffic Control Systems (AREA)

Abstract

本发明实施例中提供了一种基于自适应密度聚类的共享单车出行潮汐区域探测方法,属于计算技术领域,具体包括:步骤1,对目标区域内共享单车出行订单产生的轨迹点进行数据清洗后提取共享单车的潜在借还区域;步骤2,设计核密度估计函数构建轨迹点空间密度场,通过度量空间密度场的不稳定性自适应地构建轨迹点空间邻域,并据此构建空间密度显著性检验函数提取空间密集邻域;步骤3,根据空间密集邻域和轨迹点密度特征显著性检验自适应识别轨迹点潮汐邻域并基于密度可达扩展提取共享单车出行潮汐区域。通过本发明的方案,对共享单车起止点聚集区域自适应识别,以使得共享单车出行潮汐区域自适应判别与提取,提高了探测过程的适应性和精准度。

Figure 202211562330

The embodiment of the present invention provides a tidal area detection method for shared bicycle travel based on adaptive density clustering, which belongs to the field of computing technology, and specifically includes: Step 1, performing data cleaning on the trajectory points generated by shared bicycle travel orders in the target area Then extract the potential borrowing and returning areas of shared bicycles; step 2, design the kernel density estimation function to construct the spatial density field of trajectory points, and adaptively construct the spatial neighborhood of trajectory points by measuring the instability of the spatial density field, and construct the spatial density accordingly The significance test function extracts the spatially dense neighborhood; step 3, according to the significance test of the spatially dense neighborhood and the trajectory point density feature, adaptively identifies the tidal neighborhood of the trajectory point and extracts the tidal area of shared bicycle travel based on the density reachable extension. Through the solution of the present invention, the starting and ending points of the shared bicycles are adaptively identified to gather areas, so that the tidal regions of the shared bicycles are adaptively distinguished and extracted, and the adaptability and accuracy of the detection process are improved.

Figure 202211562330

Description

基于自适应密度聚类的共享单车出行潮汐区域探测方法A method for detecting tidal areas of shared bicycle travel based on adaptive density clustering

技术领域Technical Field

本发明实施例涉及计算技术领域,尤其涉及一种基于自适应密度聚类的共享单车出行潮汐区域探测方法。The embodiments of the present invention relate to the field of computing technology, and in particular to a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering.

背景技术Background Art

当前共享单车潮汐区域探测方法主要分为基于空间单元特征度量和基于空间轨迹聚类探测两种。其中,基于空间单元特征度量的方法依赖电子围栏或空间网格等硬性空间划分,通过设置参数阈值提取净流量或留存密度过大的空间单元。然而由于空间定位设备自身误差和停车落锁系统容差,共享单车并不总是严格地停放在电子围栏范围内,因此,空间单元划分极易低估共享单车潮汐范围和失衡程度。同时空间单元特征阈值设置依赖于专家经验知识,极大限制了此类方法的结果可靠性和模型迁移性。基于空间轨迹聚类探测的方法避免了空间单元划分问题,通过共享单车起止定位点聚类直接反映真实的单车借还聚集区域。然而,现有这类方法聚类过程中割裂了共享单车起止定位点的分布特征,对于潮汐区域的判别仍未摆脱人工设置阈值方式,极易将共享单车流动量大而借还动态均衡的区域错误地识别为潮汐区域。At present, the detection methods of shared bicycle tidal areas are mainly divided into two types: based on spatial unit feature measurement and based on spatial trajectory clustering detection. Among them, the method based on spatial unit feature measurement relies on hard spatial divisions such as electronic fences or spatial grids, and extracts spatial units with excessive net flow or retention density by setting parameter thresholds. However, due to the errors of the spatial positioning equipment itself and the tolerance of the parking and locking system, shared bicycles are not always strictly parked within the scope of the electronic fence. Therefore, spatial unit division is very likely to underestimate the tidal range and imbalance of shared bicycles. At the same time, the setting of spatial unit feature thresholds depends on expert experience and knowledge, which greatly limits the reliability of the results and model transferability of such methods. The method based on spatial trajectory clustering detection avoids the problem of spatial unit division and directly reflects the real bicycle borrowing and returning aggregation area through the clustering of shared bicycle start and end positioning points. However, the existing methods of this type split the distribution characteristics of the shared bicycle start and end positioning points in the clustering process, and the identification of tidal areas has not yet gotten rid of the manual setting of thresholds, which can easily mistakenly identify areas with large shared bicycle flow and dynamic balance of borrowing and returning as tidal areas.

可见,亟需一种充分顾及出行空间特征和共享单车轨迹特征的基于自适应密度聚类的共享单车出行潮汐区域探测方法。It can be seen that there is an urgent need for a shared bicycle travel tidal area detection method based on adaptive density clustering that fully takes into account the travel space characteristics and shared bicycle trajectory characteristics.

发明内容Summary of the invention

有鉴于此,本发明实施例提供一种基于自适应密度聚类的共享单车出行潮汐区域探测方法,至少部分解决现有技术中存在适应性和精准度较差的问题。In view of this, an embodiment of the present invention provides a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering, which at least partially solves the problems of poor adaptability and accuracy in the prior art.

本发明实施例提供了一种基于自适应密度聚类的共享单车出行潮汐区域探测方法,包括:The embodiment of the present invention provides a method for detecting tidal areas of shared bicycle travel based on adaptive density clustering, comprising:

步骤1,对目标区域内共享单车出行订单产生的轨迹点进行数据清洗后提取共享单车的潜在借还区域;Step 1: Clean the data of the trajectory points generated by the shared bicycle travel orders in the target area and extract the potential borrowing and returning areas of the shared bicycles;

步骤2,设计核密度估计函数构建轨迹点空间密度场,通过度量空间密度场的不稳定性自适应地构建轨迹点空间邻域,并据此构建空间密度显著性检验函数提取空间密集邻域;Step 2: Design a kernel density estimation function to construct the spatial density field of trajectory points, adaptively construct the spatial neighborhood of trajectory points by measuring the instability of the spatial density field, and construct a spatial density significance test function to extract the spatial dense neighborhood based on it;

步骤3,根据空间密集邻域和轨迹点密度特征显著性检验自适应识别轨迹点潮汐邻域并基于密度可达扩展提取共享单车出行潮汐区域。Step 3: Adaptively identify the tidal neighborhood of the trajectory points based on the significance test of spatial dense neighborhood and trajectory point density features, and extract the tidal area of shared bicycle travel based on density reachable expansion.

根据本发明实施例的一种具体实现方式,所述步骤1具体包括:According to a specific implementation of the embodiment of the present invention, step 1 specifically includes:

步骤1.1,删除目标区域范围外、水系和建筑物内的轨迹数据;Step 1.1, delete the trajectory data outside the target area, within the water system and buildings;

步骤1.2,删除时间戳格式异常的轨迹数据,将每辆共享单车订单轨迹点分别按时间戳升序排列,对于开关锁状态连续的异常情况仅保留一条记录;Step 1.2, delete the trajectory data with abnormal timestamp format, sort the trajectory points of each shared bicycle order in ascending order by timestamp, and only keep one record for the abnormal situation of continuous switch lock status;

步骤1.3,对于位置坐标重复的同一辆共享单车轨迹点,仅保留时间戳最早的一条记录;Step 1.3: For the same shared bicycle track points with repeated location coordinates, only the record with the earliest timestamp is retained;

步骤1.4,删除轨迹点时间戳间隔在预设范围之外的一对轨迹点;Step 1.4, deleting a pair of trajectory points whose timestamp interval is outside the preset range;

步骤1.5,将每条轨迹点按预设半径生成空间缓冲区,将所有轨迹点生成的空间缓冲区叠置合并构建单车潜在借还区域。Step 1.5, generate a spatial buffer zone for each trajectory point according to a preset radius, and overlap and merge the spatial buffer zones generated by all trajectory points to construct a potential bicycle borrowing and returning area.

根据本发明实施例的一种具体实现方式,所述步骤2具体包括:According to a specific implementation of the embodiment of the present invention, step 2 specifically includes:

步骤2.1,设计核密度估计函数,并给定单车潜在借还区域内任一轨迹点得到其空间位置处的核密度相对值近似表达式作为轨迹点空间密度场;Step 2.1, design a kernel density estimation function, and given any trajectory point in the potential borrowing and returning area of a bicycle, obtain an approximate expression of the relative value of the kernel density at its spatial position as the spatial density field of the trajectory point;

步骤2.2,根据利用熵函数度量轨迹点空间密度场所有轨迹点所在位置的空间核密度不稳定性,构建轨迹点空间邻域;Step 2.2, constructing the spatial neighborhood of the trajectory point based on the instability of the spatial kernel density of all trajectory points at the locations of the trajectory point spatial density field using the entropy function;

步骤2.3,在空间随机分布假设下,计算任一空间邻域内出现k个其他轨迹点的理论概率分布,并给定任一轨迹点及其对应的空间邻域并计算概率分布值,将概率分布值大于显著性水平值的空间邻域作为空间密集邻域。Step 2.3, under the assumption of spatial random distribution, calculate the theoretical probability distribution of k other trajectory points appearing in any spatial neighborhood, and given any trajectory point and its corresponding spatial neighborhood and calculate the probability distribution value, the spatial neighborhood with a probability distribution value greater than the significance level value is regarded as a spatial dense neighborhood.

根据本发明实施例的一种具体实现方式,所述核密度相对值近似表达式为

Figure 391613DEST_PATH_IMAGE001
,式中,
Figure 715278DEST_PATH_IMAGE002
表示点
Figure 26174DEST_PATH_IMAGE003
和点
Figure 359066DEST_PATH_IMAGE004
间的欧氏空间距离,
Figure 78498DEST_PATH_IMAGE004
表示研究区域内所有满足
Figure 432119DEST_PATH_IMAGE005
的轨迹点,h表示核密度函数带宽;According to a specific implementation of an embodiment of the present invention, the approximate expression of the relative value of the kernel density is:
Figure 391613DEST_PATH_IMAGE001
, where
Figure 715278DEST_PATH_IMAGE002
Indicate point
Figure 26174DEST_PATH_IMAGE003
and Point
Figure 359066DEST_PATH_IMAGE004
The Euclidean distance between
Figure 78498DEST_PATH_IMAGE004
Indicates that all the
Figure 432119DEST_PATH_IMAGE005
The trajectory points, h represents the bandwidth of the kernel density function;

所述熵函数的表达式为

Figure 636836DEST_PATH_IMAGE006
The expression of the entropy function is:
Figure 636836DEST_PATH_IMAGE006

Figure 101315DEST_PATH_IMAGE007
,式中,N表示目标区域内所有轨迹点数量;
Figure 101315DEST_PATH_IMAGE007
, where N represents the number of all trajectory points in the target area;

所述任一空间邻域内出现k个其他轨迹点的理论概率分布的表达式为The theoretical probability distribution of k other trajectory points appearing in any spatial neighborhood is expressed as

Figure 379981DEST_PATH_IMAGE008
Figure 379981DEST_PATH_IMAGE008

Figure 481667DEST_PATH_IMAGE009
,式中,N表示研究区域内所有OD点数量,Area()表示面积计算函数,SNODA分别表示任一轨迹点空间邻域和单车潜在借还区域;
Figure 481667DEST_PATH_IMAGE009
, where N represents the number of all OD points in the study area, Area () represents the area calculation function, SN and ODA represent the spatial neighborhood of any trajectory point and the potential borrowing and returning area of a single vehicle, respectively;

所述概率分布值的表达式为

Figure 908100DEST_PATH_IMAGE010
The expression of the probability distribution value is:
Figure 908100DEST_PATH_IMAGE010

式中,

Figure 910691DEST_PATH_IMAGE011
表示点
Figure 807978DEST_PATH_IMAGE003
空间邻域
Figure 503401DEST_PATH_IMAGE012
内其他轨迹点的数量。In the formula,
Figure 910691DEST_PATH_IMAGE011
Indicate point
Figure 807978DEST_PATH_IMAGE003
Spatial neighborhood
Figure 503401DEST_PATH_IMAGE012
The number of other trajectory points within.

根据本发明实施例的一种具体实现方式,所述步骤2.2具体包括:According to a specific implementation of the embodiment of the present invention, step 2.2 specifically includes:

利用熵函数度量研究区域内所有轨迹点所在位置的空间核密度不稳定性,给定

Figure 682710DEST_PATH_IMAGE013
取值范围
Figure 629937DEST_PATH_IMAGE014
和步长
Figure 804567DEST_PATH_IMAGE015
,迭代计算
Figure 811837DEST_PATH_IMAGE016
,将
Figure 717257DEST_PATH_IMAGE016
最小值对应的
Figure 327230DEST_PATH_IMAGE013
设置为目标区域轨迹点的空间邻域长度;The entropy function is used to measure the spatial kernel density instability of the locations of all trajectory points in the study area.
Figure 682710DEST_PATH_IMAGE013
Value range
Figure 629937DEST_PATH_IMAGE014
and step length
Figure 804567DEST_PATH_IMAGE015
, iterative calculation
Figure 811837DEST_PATH_IMAGE016
,Will
Figure 717257DEST_PATH_IMAGE016
The minimum value corresponds to
Figure 327230DEST_PATH_IMAGE013
Set to the spatial neighborhood length of the trajectory points in the target area;

以任一轨迹点为圆心、空间邻域长度为半径构建轨迹点空间邻域。The spatial neighborhood of a trajectory point is constructed with any trajectory point as the center and the length of the spatial neighborhood as the radius.

根据本发明实施例的一种具体实现方式,所述步骤3具体包括:According to a specific implementation of the embodiment of the present invention, step 3 specifically includes:

步骤3.1,计算任一个空间密集邻域内出现k个共享单车出行起点的理论概率;Step 3.1, calculate the theoretical probability of k shared bicycle trip starting points appearing in any spatially dense neighborhood;

步骤3.2,根据步骤3.1的计算结果计算任一轨迹点对应的空间密集邻域成为目标区域内空间潮汐源邻域的概率值,并将概率值大于显著性水平值的空间密集邻域作为空间潮汐源邻域,将概率值小于显著性水平值的空间密集邻域作为空间潮汐汇邻域;Step 3.2, according to the calculation results of step 3.1, calculate the probability value of the spatial dense neighborhood corresponding to any trajectory point becoming the spatial tidal source neighborhood in the target area, and take the spatial dense neighborhood with a probability value greater than the significance level value as the spatial tidal source neighborhood, and take the spatial dense neighborhood with a probability value less than the significance level value as the spatial tidal sink neighborhood;

步骤3.3,以任一空间潮汐源邻域

Figure 497311DEST_PATH_IMAGE017
为种子点,定义
Figure 941062DEST_PATH_IMAGE017
内其他起点
Figure 422859DEST_PATH_IMAGE018
Figure 243047DEST_PATH_IMAGE003
密度可达,将所有与
Figure 766170DEST_PATH_IMAGE003
密度可达点聚合为空间潮汐源区域
Figure 974298DEST_PATH_IMAGE019
,对于
Figure 349915DEST_PATH_IMAGE019
内满足空间潮汐源邻域判别的其他点
Figure 442636DEST_PATH_IMAGE018
,继续执行聚合操作更新
Figure 180785DEST_PATH_IMAGE019
,直到
Figure 231918DEST_PATH_IMAGE019
内所有满足空间潮汐源邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐源邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐源区域表示为
Figure 327787DEST_PATH_IMAGE020
;Step 3.3, take any spatial tidal source neighborhood
Figure 497311DEST_PATH_IMAGE017
is the seed point, and defines
Figure 941062DEST_PATH_IMAGE017
Other starting points
Figure 422859DEST_PATH_IMAGE018
and
Figure 243047DEST_PATH_IMAGE003
The density can be reached, all
Figure 766170DEST_PATH_IMAGE003
Density-reachable points are aggregated into spatial tidal source regions
Figure 974298DEST_PATH_IMAGE019
,for
Figure 349915DEST_PATH_IMAGE019
Other points that meet the spatial tidal source neighborhood judgment
Figure 442636DEST_PATH_IMAGE018
, continue to perform aggregate operation updates
Figure 180785DEST_PATH_IMAGE019
,until
Figure 231918DEST_PATH_IMAGE019
After all other points in the target area that meet the spatial tidal source neighborhood judgment are visited, other spatial tidal source neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal source areas extracted in the target area are expressed as
Figure 327787DEST_PATH_IMAGE020
;

步骤3.4,以任一空间潮汐汇邻域

Figure 817675DEST_PATH_IMAGE021
为种子点,定义
Figure 816855DEST_PATH_IMAGE021
内其他终点
Figure 773309DEST_PATH_IMAGE018
Figure 592361DEST_PATH_IMAGE003
密度可达,将所有与
Figure 417097DEST_PATH_IMAGE003
密度可达点聚合为空间潮汐汇区域
Figure 769319DEST_PATH_IMAGE022
,对于
Figure 224571DEST_PATH_IMAGE022
内满足空间潮汐汇邻域判别的其他点
Figure 62077DEST_PATH_IMAGE018
,继续执行聚合操作更新
Figure 300292DEST_PATH_IMAGE022
,直到
Figure 867539DEST_PATH_IMAGE022
内所有满足空间潮汐汇邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐汇邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐汇区域表示为
Figure 900217DEST_PATH_IMAGE023
,目标区域内提取的全部空间潮汐区域表示为潮汐源区域和潮汐汇区域的并集Step 3.4, take any spatial tidal sink neighborhood
Figure 817675DEST_PATH_IMAGE021
is the seed point, and defines
Figure 816855DEST_PATH_IMAGE021
Other endpoints
Figure 773309DEST_PATH_IMAGE018
and
Figure 592361DEST_PATH_IMAGE003
The density can be reached, all
Figure 417097DEST_PATH_IMAGE003
Density-reachable points are aggregated into spatial tidal sink areas
Figure 769319DEST_PATH_IMAGE022
,for
Figure 224571DEST_PATH_IMAGE022
Other points that meet the spatial tidal sink neighborhood judgment
Figure 62077DEST_PATH_IMAGE018
, continue to perform aggregate operation updates
Figure 300292DEST_PATH_IMAGE022
,until
Figure 867539DEST_PATH_IMAGE022
After all other points in the target area that meet the spatial tidal sink neighborhood judgment are visited, other spatial tidal sink neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal sink areas extracted in the target area are expressed as
Figure 900217DEST_PATH_IMAGE023
, all spatial tidal areas extracted within the target area are represented as the union of tidal source areas and tidal sink areas

Figure 209974DEST_PATH_IMAGE024
Figure 209974DEST_PATH_IMAGE024
.

根据本发明实施例的一种具体实现方式,所述任一个空间密集邻域内出现k个共享单车出行起点的理论概率的表达式为According to a specific implementation of an embodiment of the present invention, the expression for the theoretical probability of k shared bicycle travel starting points appearing in any spatially dense neighborhood is:

Figure 376513DEST_PATH_IMAGE025
Figure 376513DEST_PATH_IMAGE025

式中,

Figure 939213DEST_PATH_IMAGE026
表示空间密集邻域内起点数量,
Figure 142792DEST_PATH_IMAGE027
表示空间密集邻域内轨迹点总数量,p表示目标区域内出行起点比例;In the formula,
Figure 939213DEST_PATH_IMAGE026
represents the number of starting points in a spatially dense neighborhood,
Figure 142792DEST_PATH_IMAGE027
represents the total number of trajectory points in the spatially dense neighborhood, and p represents the proportion of travel starting points in the target area;

所述概率值的表达式为The expression of the probability value is:

Figure 689311DEST_PATH_IMAGE028
Figure 689311DEST_PATH_IMAGE028

式中,

Figure 659541DEST_PATH_IMAGE029
表示
Figure 309703DEST_PATH_IMAGE030
内起点数量,
Figure 215342DEST_PATH_IMAGE031
表示
Figure 108212DEST_PATH_IMAGE032
内轨迹点总数量。In the formula,
Figure 659541DEST_PATH_IMAGE029
express
Figure 309703DEST_PATH_IMAGE030
The number of internal starting points,
Figure 215342DEST_PATH_IMAGE031
express
Figure 108212DEST_PATH_IMAGE032
The total number of inner track points.

本发明实施例中的基于自适应密度聚类的共享单车出行潮汐区域探测方案,包括:步骤1,对目标区域内共享单车出行订单产生的轨迹点进行数据清洗后提取共享单车的潜在借还区域;步骤2,设计核密度估计函数构建轨迹点空间密度场,通过度量空间密度场的不稳定性自适应地构建轨迹点空间邻域,并据此构建空间密度显著性检验函数提取空间密集邻域;步骤3,根据空间密集邻域和轨迹点密度特征显著性检验自适应识别轨迹点潮汐邻域并基于密度可达扩展提取共享单车出行潮汐区域。The shared bicycle travel tidal area detection scheme based on adaptive density clustering in the embodiment of the present invention includes: step 1, extracting the potential borrowing and returning areas of shared bicycles after data cleaning of the trajectory points generated by shared bicycle travel orders in the target area; step 2, designing a kernel density estimation function to construct the spatial density field of the trajectory points, adaptively constructing the spatial neighborhood of the trajectory points by measuring the instability of the spatial density field, and constructing a spatial density significance test function based on this to extract the spatial dense neighborhood; step 3, adaptively identifying the tidal neighborhood of the trajectory points according to the significance test of the spatial dense neighborhood and the trajectory point density features, and extracting the shared bicycle travel tidal area based on the density reachable expansion.

本发明实施例的有益效果为:通过本发明的方案,充分顾及出行空间特征和共享单车轨迹特征,对共享单车起止点聚集区域自适应识别,以使得共享单车出行潮汐区域自适应判别与提取,提高了探测过程的适应性和精准度。The beneficial effects of the embodiments of the present invention are as follows: through the scheme of the present invention, the travel space characteristics and the shared bicycle trajectory characteristics are fully taken into account, and the shared bicycle starting and ending point aggregation areas are adaptively identified, so that the shared bicycle travel tidal areas can be adaptively identified and extracted, thereby improving the adaptability and accuracy of the detection process.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本发明实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for use in the embodiments will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without creative work.

图1为本发明实施例提供的一种基于自适应密度聚类的共享单车出行潮汐区域探测方法的流程示意图;FIG1 is a schematic flow chart of a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering according to an embodiment of the present invention;

图2为本发明实施例提供的一种基于自适应密度聚类的共享单车出行潮汐区域探测方法的具体实施流程示意图;FIG2 is a schematic diagram of a specific implementation process of a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering provided by an embodiment of the present invention;

图3为本发明实施例提供的一种以某地作为目标区域示意图;FIG3 is a schematic diagram of a certain place as a target area provided by an embodiment of the present invention;

图4为本发明实施例提供的一种潮汐区域整体探测结果示意图;FIG4 is a schematic diagram of an overall detection result of a tidal area provided by an embodiment of the present invention;

图5为本发明实施例提供的一种某地区地铁站周边潮汐区域探测结果示意图。FIG5 is a schematic diagram of tidal area detection results around a subway station in a certain area provided by an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

下面结合附图对本发明实施例进行详细描述。The embodiments of the present invention are described in detail below with reference to the accompanying drawings.

以下通过特定的具体实例说明本发明的实施方式,本领域技术人员可由本说明书所揭露的内容轻易地了解本发明的其他优点与功效。显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。本发明还可以通过另外不同的具体实施方式加以实施或应用,本说明书中的各项细节也可以基于不同观点与应用,在没有背离本发明的精神下进行各种修饰或改变。需说明的是,在不冲突的情况下,以下实施例及实施例中的特征可以相互组合。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The following describes the embodiments of the present invention through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. The present invention can also be implemented or applied through other different specific embodiments, and the details in this specification can also be modified or changed in various ways based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the following embodiments and features in the embodiments can be combined with each other without conflict. Based on the embodiments in the present invention, all other embodiments obtained by ordinary technicians in this field without making creative work belong to the scope of protection of the present invention.

需要说明的是,下文描述在所附权利要求书的范围内的实施例的各种方面。应显而易见,本文中所描述的方面可体现于广泛多种形式中,且本文中所描述的任何特定结构及/或功能仅为说明性的。基于本发明,所属领域的技术人员应了解,本文中所描述的一个方面可与任何其它方面独立地实施,且可以各种方式组合这些方面中的两者或两者以上。举例来说,可使用本文中所阐述的任何数目个方面来实施设备及/或实践方法。另外,可使用除了本文中所阐述的方面中的一或多者之外的其它结构及/或功能性实施此设备及/或实践此方法。It should be noted that various aspects of the embodiments within the scope of the appended claims are described below. It should be apparent that the aspects described herein can be embodied in a wide variety of forms, and any specific structure and/or function described herein is merely illustrative. Based on the present invention, it should be understood by those skilled in the art that an aspect described herein can be implemented independently of any other aspect, and two or more of these aspects can be combined in various ways. For example, any number of aspects described herein can be used to implement the device and/or practice the method. In addition, other structures and/or functionalities other than one or more of the aspects described herein can be used to implement this device and/or practice this method.

还需要说明的是,以下实施例中所提供的图示仅以示意方式说明本发明的基本构想,图式中仅显示与本发明中有关的组件而非按照实际实施时的组件数目、形状及尺寸绘制,其实际实施时各组件的型态、数量及比例可为一种随意的改变,且其组件布局型态也可能更为复杂。It should also be noted that the illustrations provided in the following embodiments are only schematic illustrations of the basic concept of the present invention. The drawings only show components related to the present invention rather than being drawn according to the number, shape and size of components in actual implementation. In actual implementation, the type, quantity and proportion of each component may be changed arbitrarily, and the component layout may also be more complicated.

另外,在以下描述中,提供具体细节是为了便于透彻理解实例。然而,所属领域的技术人员将理解,可在没有这些特定细节的情况下实践所述方面。Additionally, in the following description, specific details are provided to facilitate a thorough understanding of the examples. However, one skilled in the art will appreciate that the aspects described may be practiced without these specific details.

本发明实施例提供一种基于自适应密度聚类的共享单车出行潮汐区域探测方法,所述方法可以应用于城市交通规划场景的公共交通优化过程中。An embodiment of the present invention provides a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering, which can be applied to the public transportation optimization process in urban traffic planning scenarios.

参见图1,为本发明实施例提供的一种基于自适应密度聚类的共享单车出行潮汐区域探测方法的流程示意图。如图1和图2所示,所述方法主要包括以下步骤:Referring to FIG1 , a flow chart of a method for detecting tidal areas for shared bicycle travel based on adaptive density clustering is provided in an embodiment of the present invention. As shown in FIG1 and FIG2 , the method mainly includes the following steps:

步骤1,对目标区域内共享单车出行订单产生的轨迹点进行数据清洗后提取共享单车的潜在借还区域;Step 1: Clean the data of the trajectory points generated by the shared bicycle travel orders in the target area and extract the potential borrowing and returning areas of the shared bicycles;

进一步的,所述步骤1具体包括:Furthermore, the step 1 specifically includes:

步骤1.1,删除目标区域范围外、水系和建筑物内的轨迹数据;Step 1.1, delete the trajectory data outside the target area, within the water system and buildings;

步骤1.2,删除时间戳格式异常的轨迹数据,将每辆共享单车订单轨迹点分别按时间戳升序排列,对于开关锁状态连续的异常情况仅保留一条记录;Step 1.2, delete the trajectory data with abnormal timestamp format, sort the trajectory points of each shared bicycle order in ascending order by timestamp, and only keep one record for the abnormal situation of continuous switch lock status;

步骤1.3,对于位置坐标重复的同一辆共享单车轨迹点,仅保留时间戳最早的一条记录;Step 1.3: For the same shared bicycle track points with repeated location coordinates, only the record with the earliest timestamp is retained;

步骤1.4,删除轨迹点时间戳间隔在预设范围之外的一对轨迹点;Step 1.4, deleting a pair of trajectory points whose timestamp interval is outside the preset range;

步骤1.5,将每条轨迹点按预设半径生成空间缓冲区,将所有轨迹点生成的空间缓冲区叠置合并构建单车潜在借还区域。Step 1.5, generate a spatial buffer zone for each trajectory point according to a preset radius, and overlap and merge the spatial buffer zones generated by all trajectory points to construct a potential bicycle borrowing and returning area.

具体实施时,对共享单车出行订单产生的OD点(即轨迹点)轨迹进行数据清洗和单车潜在借还区域提取。具体包括:In the specific implementation, the OD point (i.e., track point) trajectory generated by the shared bicycle travel order is cleaned and the potential bicycle borrowing and returning area is extracted. Specifically, it includes:

1.1 数据清洗1.1 Data Cleansing

首先,删除研究区域范围外、水系和建筑物内的轨迹数据。然后,删除时间戳格式异常的轨迹数据。进一步,将每辆共享单车订单OD点分别按时间戳升序排列,对于开关锁状态连续的异常情况仅保留一条记录,同时,对于位置坐标重复的同一辆共享单车轨迹点,仅保留时间戳最早的一条记录。最后,删除OD点时间戳间隔小于1分钟或大于1小时的一对OD点。First, delete the trajectory data outside the study area, in the water system, and in the building. Then, delete the trajectory data with abnormal timestamp format. Furthermore, sort the OD points of each shared bicycle order in ascending order by timestamp. For the abnormal situation of continuous switch lock status, only one record is retained. At the same time, for the same shared bicycle trajectory points with repeated location coordinates, only the record with the earliest timestamp is retained. Finally, delete a pair of OD points with an OD point timestamp interval of less than 1 minute or greater than 1 hour.

1.2 单车潜在借还区域提取1.2 Extraction of potential bicycle borrowing and returning areas

将每条OD点按50米半径生成空间缓冲区,将所有OD点生成的空间缓冲区叠置合并构建单车潜在借还区域ODA,进一步精细约束单车可能借还的所有空间范围。A spatial buffer with a radius of 50 meters is generated for each OD point. The spatial buffers generated by all OD points are superimposed and merged to construct the bicycle potential borrowing and returning area ODA , which further refines the constraints on all spatial ranges where bicycles may be borrowed and returned.

步骤2,设计核密度估计函数构建轨迹点空间密度场,通过度量空间密度场的不稳定性自适应地构建轨迹点空间邻域,并据此构建空间密度显著性检验函数提取空间密集邻域;Step 2: Design a kernel density estimation function to construct the spatial density field of trajectory points, adaptively construct the spatial neighborhood of trajectory points by measuring the instability of the spatial density field, and construct a spatial density significance test function to extract the spatial dense neighborhood based on it;

在上述实施例的基础上,所述步骤2具体包括:Based on the above embodiment, step 2 specifically includes:

步骤2.1,设计核密度估计函数,并给定单车潜在借还区域内任一轨迹点得到其空间位置处的核密度相对值近似表达式作为轨迹点空间密度场;Step 2.1, design a kernel density estimation function, and given any trajectory point in the potential borrowing and returning area of a bicycle, obtain an approximate expression of the relative value of the kernel density at its spatial position as the spatial density field of the trajectory point;

步骤2.2,根据利用熵函数度量轨迹点空间密度场所有轨迹点所在位置的空间核密度不稳定性,构建轨迹点空间邻域;Step 2.2, constructing the spatial neighborhood of the trajectory point based on the instability of the spatial kernel density of all trajectory points at the locations of the trajectory point spatial density field using the entropy function;

步骤2.3,在空间随机分布假设下,计算任一空间邻域内出现k个其他轨迹点的理论概率分布,并给定任一轨迹点及其对应的空间邻域并计算概率分布值,将概率分布值大于显著性水平值的空间邻域作为空间密集邻域。Step 2.3, under the assumption of spatial random distribution, calculate the theoretical probability distribution of k other trajectory points appearing in any spatial neighborhood, and given any trajectory point and its corresponding spatial neighborhood and calculate the probability distribution value, the spatial neighborhood with a probability distribution value greater than the significance level value is regarded as a spatial dense neighborhood.

可选的,所述核密度相对值近似表达式为

Figure 226341DEST_PATH_IMAGE033
,式中,
Figure 763632DEST_PATH_IMAGE034
表示点
Figure 699227DEST_PATH_IMAGE035
和点
Figure 187715DEST_PATH_IMAGE036
间的欧氏空间距离,
Figure 109535DEST_PATH_IMAGE036
表示研究区域内所有满足
Figure 360388DEST_PATH_IMAGE037
的轨迹点,h表示核密度函数带宽;Optionally, the relative value of the kernel density is approximated as:
Figure 226341DEST_PATH_IMAGE033
, where
Figure 763632DEST_PATH_IMAGE034
Indicate point
Figure 699227DEST_PATH_IMAGE035
and Point
Figure 187715DEST_PATH_IMAGE036
The Euclidean distance between
Figure 109535DEST_PATH_IMAGE036
Indicates that all the
Figure 360388DEST_PATH_IMAGE037
The trajectory points, h represents the bandwidth of the kernel density function;

所述熵函数的表达式为

Figure 342250DEST_PATH_IMAGE038
The expression of the entropy function is:
Figure 342250DEST_PATH_IMAGE038

Figure 350657DEST_PATH_IMAGE039
,式中,N表示目标区域内所有轨迹点数量;
Figure 350657DEST_PATH_IMAGE039
, where N represents the number of all trajectory points in the target area;

所述任一空间邻域内出现k个其他轨迹点的理论概率分布的表达式为The theoretical probability distribution of k other trajectory points appearing in any spatial neighborhood is expressed as

Figure 669643DEST_PATH_IMAGE040
Figure 669643DEST_PATH_IMAGE040

Figure 680062DEST_PATH_IMAGE041
,式中,N表示研究区域内所有OD点数量,Area()表示面积计算函数,SNODA分别表示任一轨迹点空间邻域和单车潜在借还区域;
Figure 680062DEST_PATH_IMAGE041
, where N represents the number of all OD points in the study area, Area () represents the area calculation function, SN and ODA represent the spatial neighborhood of any trajectory point and the potential borrowing and returning area of a single vehicle, respectively;

所述概率分布值的表达式为

Figure 691881DEST_PATH_IMAGE042
式中,
Figure 656426DEST_PATH_IMAGE043
表示点
Figure 185627DEST_PATH_IMAGE035
空间邻域
Figure 411072DEST_PATH_IMAGE044
内其他轨迹点的数量。The expression of the probability distribution value is:
Figure 691881DEST_PATH_IMAGE042
In the formula,
Figure 656426DEST_PATH_IMAGE043
Indicate point
Figure 185627DEST_PATH_IMAGE035
Spatial neighborhood
Figure 411072DEST_PATH_IMAGE044
The number of other trajectory points within.

进一步的,所述步骤2.2具体包括:Furthermore, the step 2.2 specifically includes:

利用熵函数度量研究区域内所有轨迹点所在位置的空间核密度不稳定性,给定

Figure 734737DEST_PATH_IMAGE045
取值范围
Figure 222131DEST_PATH_IMAGE046
和步长
Figure 882920DEST_PATH_IMAGE047
,迭代计算
Figure 103816DEST_PATH_IMAGE048
,将
Figure 457437DEST_PATH_IMAGE048
最小值对应的
Figure 662154DEST_PATH_IMAGE045
设置为目标区域轨迹点的空间邻域长度;The entropy function is used to measure the spatial kernel density instability of the locations of all trajectory points in the study area.
Figure 734737DEST_PATH_IMAGE045
Value range
Figure 222131DEST_PATH_IMAGE046
and step length
Figure 882920DEST_PATH_IMAGE047
, iterative calculation
Figure 103816DEST_PATH_IMAGE048
,Will
Figure 457437DEST_PATH_IMAGE048
The minimum value corresponds to
Figure 662154DEST_PATH_IMAGE045
Set to the spatial neighborhood length of the trajectory points in the target area;

以任一轨迹点为圆心、空间邻域长度为半径构建轨迹点空间邻域。The spatial neighborhood of a trajectory point is constructed with any trajectory point as the center and the length of the spatial neighborhood as the radius.

具体实施时,本发明采用一种自适应密度检验的方法识别OD点密集聚集邻域作为共享单车出行潮汐探测的候选子区域,首先设计核密度估计函数构建OD点空间密度场,然后通过度量空间密度的不稳定性自适应地构建OD点空间邻域,进一步构建空间密度显著性检验函数提取OD点密集邻域。具体包括:In specific implementation, the present invention adopts an adaptive density test method to identify the densely clustered neighborhood of OD points as the candidate sub-area for shared bicycle travel tidal detection. First, a kernel density estimation function is designed to construct the spatial density field of OD points, and then the spatial neighborhood of OD points is adaptively constructed by measuring the instability of spatial density, and then a spatial density significance test function is constructed to extract the dense neighborhood of OD points. Specifically, it includes:

2.1,OD点空间邻域构建2.1, OD point spatial neighborhood construction

给定任一轨迹点p i ,其空间位置处的核密度相对值近似表达为:Given any trajectory point p i , the relative value of the kernel density at its spatial position is approximately expressed as:

Figure 126633DEST_PATH_IMAGE049
Figure 126633DEST_PATH_IMAGE049

式中,

Figure 202037DEST_PATH_IMAGE050
表示点
Figure 897198DEST_PATH_IMAGE051
和点
Figure 917106DEST_PATH_IMAGE052
间的欧氏空间距离,
Figure 326222DEST_PATH_IMAGE052
表示研究区域内所有满足
Figure 521711DEST_PATH_IMAGE053
的轨迹点,h表示核密度函数带宽。在此基础上,利用熵函数度量研究区域内所有OD点所在位置的空间核密度不稳定性:In the formula,
Figure 202037DEST_PATH_IMAGE050
Indicate point
Figure 897198DEST_PATH_IMAGE051
and Point
Figure 917106DEST_PATH_IMAGE052
The Euclidean distance between
Figure 326222DEST_PATH_IMAGE052
Indicates that all the
Figure 521711DEST_PATH_IMAGE053
The trajectory points are represented by h, and h represents the bandwidth of the kernel density function. On this basis, the entropy function is used to measure the spatial kernel density instability of all OD points in the study area:

Figure 482714DEST_PATH_IMAGE054
Figure 482714DEST_PATH_IMAGE054

式中,N表示研究区域内所有OD点数量。给定

Figure 130864DEST_PATH_IMAGE055
宽松的取值范围
Figure 937146DEST_PATH_IMAGE056
和步长
Figure 16835DEST_PATH_IMAGE057
,迭代计算
Figure 24106DEST_PATH_IMAGE058
,将
Figure 18606DEST_PATH_IMAGE058
最小值对应的
Figure 769525DEST_PATH_IMAGE055
设置为研究区域轨迹点空间邻域eps。基于此,以任一轨迹点
Figure 205185DEST_PATH_IMAGE051
为圆心、eps的长度为半径构建的圆形空间区域表示
Figure 976832DEST_PATH_IMAGE051
的空间邻域,记为
Figure 363689DEST_PATH_IMAGE059
。Where N represents the number of all OD points in the study area.
Figure 130864DEST_PATH_IMAGE055
Loose value range
Figure 937146DEST_PATH_IMAGE056
and step length
Figure 16835DEST_PATH_IMAGE057
, iterative calculation
Figure 24106DEST_PATH_IMAGE058
,Will
Figure 18606DEST_PATH_IMAGE058
The minimum value corresponds to
Figure 769525DEST_PATH_IMAGE055
Set to the spatial neighborhood eps of the trajectory points in the study area. Based on this, any trajectory point
Figure 205185DEST_PATH_IMAGE051
The circular space area is represented by the center and the length of eps as the radius.
Figure 976832DEST_PATH_IMAGE051
The spatial neighborhood of
Figure 363689DEST_PATH_IMAGE059
.

2.2,OD点密集邻域识别2.2, OD point dense neighborhood identification

在空间随机分布假设下,任一空间邻域SN内出现k个其他OD点的理论概率为泊松分布密度函数:Under the assumption of random spatial distribution, the theoretical probability of k other OD points appearing in any spatial neighborhood SN is the Poisson distribution density function:

Figure 511774DEST_PATH_IMAGE060
Figure 511774DEST_PATH_IMAGE060

Figure 801941DEST_PATH_IMAGE061
Figure 801941DEST_PATH_IMAGE061

式中,N表示研究区域内所有OD点数量;Area()表示面积计算函数;SNODA分别表示任一轨迹点空间邻域和单车潜在借还区域。给定任一OD点pi及其空间邻域

Figure 416593DEST_PATH_IMAGE059
Figure 120107DEST_PATH_IMAGE059
成为研究区域内空间密集邻域的概率计算为泊松分布的概率分布函数:Where N represents the number of all OD points in the study area; Area () represents the area calculation function; SN and ODA represent the spatial neighborhood of any trajectory point and the potential borrowing and returning area of a single vehicle, respectively. Given any OD point pi and its spatial neighborhood
Figure 416593DEST_PATH_IMAGE059
,
Figure 120107DEST_PATH_IMAGE059
The probability of being a spatially dense neighbor within the study area is calculated as the probability distribution function of the Poisson distribution:

Figure 478407DEST_PATH_IMAGE062
Figure 478407DEST_PATH_IMAGE062

式中,

Figure 584598DEST_PATH_IMAGE063
表示点
Figure 370151DEST_PATH_IMAGE051
空间邻域
Figure 92119DEST_PATH_IMAGE064
内其他OD点的数量。若
Figure 457373DEST_PATH_IMAGE065
大于给定具有统计意义的显著性水平α(如0.95),则
Figure 722132DEST_PATH_IMAGE059
标记为空间密集邻域
Figure 272062DEST_PATH_IMAGE066
。In the formula,
Figure 584598DEST_PATH_IMAGE063
Indicate point
Figure 370151DEST_PATH_IMAGE051
Spatial neighborhood
Figure 92119DEST_PATH_IMAGE064
The number of other OD points within.
Figure 457373DEST_PATH_IMAGE065
is greater than a given statistically significant level α (such as 0.95), then
Figure 722132DEST_PATH_IMAGE059
Marked as a spatially dense neighborhood
Figure 272062DEST_PATH_IMAGE066
.

步骤3,根据空间密集邻域和轨迹点密度特征显著性检验自适应识别轨迹点潮汐邻域并基于密度可达扩展提取共享单车出行潮汐区域。Step 3: Adaptively identify the tidal neighborhood of the trajectory points based on the significance test of spatial dense neighborhood and trajectory point density features, and extract the tidal area of shared bicycle travel based on density reachable expansion.

在上述实施例的基础上,所述步骤3具体包括:Based on the above embodiment, step 3 specifically includes:

步骤3.1,计算任一个空间密集邻域内出现k个共享单车出行起点的理论概率;Step 3.1, calculate the theoretical probability of k shared bicycle trip starting points appearing in any spatially dense neighborhood;

步骤3.2,根据步骤3.1的计算结果计算任一轨迹点对应的空间密集邻域成为目标区域内空间潮汐源邻域的概率值,并将概率值大于显著性水平值的空间密集邻域作为空间潮汐源邻域,将概率值小于显著性水平值的空间密集邻域作为空间潮汐汇邻域;Step 3.2, according to the calculation results of step 3.1, calculate the probability value of the spatial dense neighborhood corresponding to any trajectory point becoming the spatial tidal source neighborhood in the target area, and take the spatial dense neighborhood with a probability value greater than the significance level value as the spatial tidal source neighborhood, and take the spatial dense neighborhood with a probability value less than the significance level value as the spatial tidal sink neighborhood;

步骤3.3,以任一空间潮汐源邻域

Figure 855228DEST_PATH_IMAGE067
为种子点,定义
Figure 414385DEST_PATH_IMAGE068
内其他起点
Figure 2492DEST_PATH_IMAGE069
Figure 129848DEST_PATH_IMAGE051
密度可达,将所有与
Figure 826409DEST_PATH_IMAGE051
密度可达点聚合为空间潮汐源区域
Figure 799044DEST_PATH_IMAGE070
,对于
Figure 5772DEST_PATH_IMAGE070
内满足空间潮汐源邻域判别的其他点
Figure 897505DEST_PATH_IMAGE069
,继续执行聚合操作更新
Figure 222307DEST_PATH_IMAGE070
,直到
Figure 998633DEST_PATH_IMAGE070
内所有满足空间潮汐源邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐源邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐源区域表示为
Figure 420387DEST_PATH_IMAGE071
;Step 3.3, take any spatial tidal source neighborhood
Figure 855228DEST_PATH_IMAGE067
is the seed point, and defines
Figure 414385DEST_PATH_IMAGE068
Other starting points
Figure 2492DEST_PATH_IMAGE069
and
Figure 129848DEST_PATH_IMAGE051
The density can be reached, all
Figure 826409DEST_PATH_IMAGE051
Density-reachable points are aggregated into spatial tidal source regions
Figure 799044DEST_PATH_IMAGE070
,for
Figure 5772DEST_PATH_IMAGE070
Other points that meet the spatial tidal source neighborhood judgment
Figure 897505DEST_PATH_IMAGE069
, continue to perform aggregate operation updates
Figure 222307DEST_PATH_IMAGE070
,until
Figure 998633DEST_PATH_IMAGE070
After all other points in the target area that meet the spatial tidal source neighborhood judgment are visited, other spatial tidal source neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal source areas extracted in the target area are expressed as
Figure 420387DEST_PATH_IMAGE071
;

步骤3.4,以任一空间潮汐汇邻域

Figure 325764DEST_PATH_IMAGE072
为种子点,定义
Figure 465758DEST_PATH_IMAGE072
内其他终点
Figure 576934DEST_PATH_IMAGE069
Figure 259719DEST_PATH_IMAGE051
密度可达,将所有与
Figure 899779DEST_PATH_IMAGE051
密度可达点聚合为空间潮汐汇区域
Figure 527069DEST_PATH_IMAGE073
,对于
Figure 149593DEST_PATH_IMAGE073
内满足空间潮汐汇邻域判别的其他点
Figure 811518DEST_PATH_IMAGE069
,继续执行聚合操作更新
Figure 622479DEST_PATH_IMAGE073
,直到
Figure 878011DEST_PATH_IMAGE073
内所有满足空间潮汐汇邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐汇邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐汇区域表示为
Figure 393306DEST_PATH_IMAGE074
,目标区域内提取的全部空间潮汐区域表示为潮汐源区域和潮汐汇区域的并集Step 3.4, take any spatial tidal sink neighborhood
Figure 325764DEST_PATH_IMAGE072
is the seed point, and defines
Figure 465758DEST_PATH_IMAGE072
Other endpoints
Figure 576934DEST_PATH_IMAGE069
and
Figure 259719DEST_PATH_IMAGE051
The density can be reached, all
Figure 899779DEST_PATH_IMAGE051
Density-reachable points are aggregated into spatial tidal sink areas
Figure 527069DEST_PATH_IMAGE073
,for
Figure 149593DEST_PATH_IMAGE073
Other points that meet the spatial tidal sink neighborhood judgment
Figure 811518DEST_PATH_IMAGE069
, continue to perform aggregate operation updates
Figure 622479DEST_PATH_IMAGE073
,until
Figure 878011DEST_PATH_IMAGE073
After all other points in the target area that meet the spatial tidal sink neighborhood judgment are visited, other spatial tidal sink neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal sink areas extracted in the target area are expressed as
Figure 393306DEST_PATH_IMAGE074
, all spatial tidal areas extracted within the target area are represented as the union of tidal source areas and tidal sink areas

Figure 316263DEST_PATH_IMAGE024
Figure 316263DEST_PATH_IMAGE024
.

进一步的,所述任一个空间密集邻域内出现k个共享单车出行起点的理论概率的表达式为Furthermore, the theoretical probability of k shared bicycle trip starting points appearing in any spatially dense neighborhood is expressed as

Figure 796661DEST_PATH_IMAGE075
Figure 796661DEST_PATH_IMAGE075

式中,

Figure 132964DEST_PATH_IMAGE076
表示空间密集邻域内起点数量,
Figure 124054DEST_PATH_IMAGE077
表示空间密集邻域内轨迹点总数量,p表示目标区域内出行起点比例;In the formula,
Figure 132964DEST_PATH_IMAGE076
represents the number of starting points in a spatially dense neighborhood,
Figure 124054DEST_PATH_IMAGE077
represents the total number of trajectory points in the spatially dense neighborhood, and p represents the proportion of travel starting points in the target area;

所述概率值的表达式为The expression of the probability value is:

Figure 229413DEST_PATH_IMAGE079
Figure 229413DEST_PATH_IMAGE079

式中,

Figure 647756DEST_PATH_IMAGE080
表示
Figure 612301DEST_PATH_IMAGE081
内起点数量,
Figure 734978DEST_PATH_IMAGE082
表示
Figure 865483DEST_PATH_IMAGE083
内轨迹点总数量。In the formula,
Figure 647756DEST_PATH_IMAGE080
express
Figure 612301DEST_PATH_IMAGE081
The number of internal starting points,
Figure 734978DEST_PATH_IMAGE082
express
Figure 865483DEST_PATH_IMAGE083
The total number of inner track points.

具体实施时,在上述OD点密集聚集邻域识别基础上,本发明采用一种基于OD点密度特征聚类的方法探测共享单车出行潮汐区域,首先基于OD点密度特征显著性检验自适应地识别OD点潮汐邻域,然后基于密度可达扩展提取共享单车出行潮汐区域。具体包括:In specific implementation, based on the above-mentioned identification of densely clustered neighborhoods of OD points, the present invention adopts a method based on OD point density feature clustering to detect the shared bicycle travel tidal area. First, the OD point tidal neighborhood is adaptively identified based on the OD point density feature significance test, and then the shared bicycle travel tidal area is extracted based on density reachable expansion. Specifically including:

3.1,OD点潮汐邻域识别3.1, OD point tidal neighborhood identification

给定任一空间密集邻域SDNSDN内出现k个单车出行起点的理论概率为二项分布密度函数:Given any spatially dense neighborhood SDN , the theoretical probability of k bicycle trip starting points appearing in the SDN is the binomial distribution density function:

Figure 657989DEST_PATH_IMAGE084
Figure 657989DEST_PATH_IMAGE084

式中,

Figure 500043DEST_PATH_IMAGE085
表示
Figure 832936DEST_PATH_IMAGE086
内起点数量,
Figure 522674DEST_PATH_IMAGE087
表示空间密集邻域内轨迹点总数量,p表示目标区域内出行起点比例。给定任一OD点
Figure 407453DEST_PATH_IMAGE088
及其空间密集邻域
Figure 579547DEST_PATH_IMAGE089
Figure 184971DEST_PATH_IMAGE089
成为研究区域内空间潮汐源邻域的可能性计算为:In the formula,
Figure 500043DEST_PATH_IMAGE085
express
Figure 832936DEST_PATH_IMAGE086
The number of internal starting points,
Figure 522674DEST_PATH_IMAGE087
represents the total number of trajectory points in the spatially dense neighborhood, and p represents the proportion of travel starting points in the target area.
Figure 407453DEST_PATH_IMAGE088
and its spatially dense neighborhood
Figure 579547DEST_PATH_IMAGE089
,
Figure 184971DEST_PATH_IMAGE089
The probability of becoming a neighbor of a spatial tidal source in the study area is calculated as:

Figure 650588DEST_PATH_IMAGE091
Figure 650588DEST_PATH_IMAGE091

式中,

Figure 316055DEST_PATH_IMAGE092
表示
Figure 476909DEST_PATH_IMAGE093
内起点数量,
Figure 276238DEST_PATH_IMAGE094
表示
Figure 722261DEST_PATH_IMAGE095
内轨迹点总数量。若
Figure 558630DEST_PATH_IMAGE096
大于给定具有统计意义的显著性水平α(如0.95),则
Figure 737939DEST_PATH_IMAGE097
标记为空间潮汐源邻域
Figure 544221DEST_PATH_IMAGE098
;若
Figure 594217DEST_PATH_IMAGE099
小于给定具有统计意义的显著性水平1-α,则
Figure 631180DEST_PATH_IMAGE100
标记为空间潮汐汇邻域
Figure 360102DEST_PATH_IMAGE101
。In the formula,
Figure 316055DEST_PATH_IMAGE092
express
Figure 476909DEST_PATH_IMAGE093
The number of internal starting points,
Figure 276238DEST_PATH_IMAGE094
express
Figure 722261DEST_PATH_IMAGE095
The total number of inner track points.
Figure 558630DEST_PATH_IMAGE096
is greater than a given statistically significant level α (such as 0.95), then
Figure 737939DEST_PATH_IMAGE097
Marked as spatial tidal source neighborhood
Figure 544221DEST_PATH_IMAGE098
;like
Figure 594217DEST_PATH_IMAGE099
is less than the given statistically significant level 1- α , then
Figure 631180DEST_PATH_IMAGE100
Marked as a spatial tidal sink neighborhood
Figure 360102DEST_PATH_IMAGE101
.

3.2,OD点潮汐区域提取3.2, OD point tidal area extraction

以任一空间潮汐源邻域

Figure 642179DEST_PATH_IMAGE098
为种子点,定义
Figure 77839DEST_PATH_IMAGE098
内其他起点
Figure 833175DEST_PATH_IMAGE102
Figure 580551DEST_PATH_IMAGE103
密度可达,将所有与
Figure 400739DEST_PATH_IMAGE103
密度可达点聚合为空间潮汐源区域
Figure 923862DEST_PATH_IMAGE104
,对于
Figure 538514DEST_PATH_IMAGE104
内满足空间潮汐源邻域判别的其他点
Figure 648553DEST_PATH_IMAGE102
,继续执行聚合操作更新
Figure 865907DEST_PATH_IMAGE104
,直到
Figure 10581DEST_PATH_IMAGE104
内所有满足空间潮汐源邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐源邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐源区域表示为
Figure 300529DEST_PATH_IMAGE105
。Take any spatial tidal source neighborhood
Figure 642179DEST_PATH_IMAGE098
is the seed point, and defines
Figure 77839DEST_PATH_IMAGE098
Other starting points
Figure 833175DEST_PATH_IMAGE102
and
Figure 580551DEST_PATH_IMAGE103
The density can be reached, all
Figure 400739DEST_PATH_IMAGE103
Density-reachable points are aggregated into spatial tidal source regions
Figure 923862DEST_PATH_IMAGE104
,for
Figure 538514DEST_PATH_IMAGE104
Other points that meet the spatial tidal source neighborhood judgment
Figure 648553DEST_PATH_IMAGE102
, continue to perform aggregate operation updates
Figure 865907DEST_PATH_IMAGE104
,until
Figure 10581DEST_PATH_IMAGE104
After all other points in the target area that meet the spatial tidal source neighborhood judgment are visited, other spatial tidal source neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal source areas extracted in the target area are expressed as
Figure 300529DEST_PATH_IMAGE105
.

同理,以任一空间潮汐汇邻域

Figure 163443DEST_PATH_IMAGE106
为种子点,定义
Figure 794275DEST_PATH_IMAGE106
内其他终点
Figure 918089DEST_PATH_IMAGE102
Figure 779604DEST_PATH_IMAGE103
密度可达,将所有与
Figure 864234DEST_PATH_IMAGE103
密度可达点聚合为空间潮汐汇区域
Figure 95495DEST_PATH_IMAGE107
,基于以上空间扩展策略,最终,研究区域内提取的全部空间潮汐汇区域表示为
Figure 214761DEST_PATH_IMAGE108
。研究区域内提取的全部空间潮汐区域表示为潮汐源区域和潮汐汇区域的并集Similarly, any spatial tidal sink neighborhood
Figure 163443DEST_PATH_IMAGE106
is the seed point, and defines
Figure 794275DEST_PATH_IMAGE106
Other endpoints
Figure 918089DEST_PATH_IMAGE102
and
Figure 779604DEST_PATH_IMAGE103
The density can be reached, all
Figure 864234DEST_PATH_IMAGE103
Density-reachable points are aggregated into spatial tidal sink areas
Figure 95495DEST_PATH_IMAGE107
, based on the above spatial expansion strategy, finally, all the spatial tidal sink areas extracted in the study area are expressed as
Figure 214761DEST_PATH_IMAGE108
The total spatial tidal area extracted in the study area is represented as the union of the tidal source area and the tidal sink area.

Figure 106232DEST_PATH_IMAGE024
Figure 106232DEST_PATH_IMAGE024
.

本实施例提供的基于自适应密度聚类的共享单车出行潮汐区域探测方法,通过充分顾及出行空间特征和共享单车轨迹特征,对共享单车起止点聚集区域自适应识别,以使得共享单车出行潮汐区域自适应判别与提取,提高了探测过程的适应性和精准度。The shared bicycle travel tidal area detection method based on adaptive density clustering provided in this embodiment fully considers the travel space characteristics and shared bicycle trajectory characteristics, and adaptively identifies the shared bicycle starting and ending point aggregation areas, so as to adaptively distinguish and extract the shared bicycle travel tidal area, thereby improving the adaptability and accuracy of the detection process.

下面将结合一个具体实施例对本方案进行说明,采用某市2020年12月21日共享单车OD点数据对本发明的具体实施过程进行说明:The present solution will be described below in conjunction with a specific embodiment, and the specific implementation process of the present invention will be described using the OD point data of shared bicycles in a certain city on December 21, 2020:

(1)实施例中选择某市A区和B区构成目标区域,采用共享单车OD点数据,该数据时间为2020年12月21日上午6:00:00~10:00:00,轨迹数据与研究区域如图3所示。(1) In this embodiment, District A and District B of a city are selected to constitute the target area, and the OD point data of shared bicycles is used. The data time is 6:00:00-10:00:00 on December 21, 2020. The trajectory data and the study area are shown in Figure 3.

(2)删除研究区域范围外和水系、建筑物内的轨迹数据;删除时间戳格式异常的轨迹数据;将每辆共享单车订单OD点分别按时间戳升序排列,对于开关锁状态连续的异常情况仅保留一条记录;对于位置坐标重复的同一个共享单车轨迹点,仅保留时间戳最早的一条记录;删除OD点时间戳间隔小于1分钟或大于1小时的一对OD点。(2) Delete trajectory data outside the study area and within water systems and buildings; delete trajectory data with abnormal timestamp format; arrange the OD points of each shared bicycle order in ascending order by timestamp, and only retain one record for abnormal situations where the switch lock status is continuous; for the same shared bicycle trajectory point with repeated location coordinates, only retain the record with the earliest timestamp; delete a pair of OD points with an OD point timestamp interval of less than 1 minute or greater than 1 hour.

(3)将每条OD点按50米半径生成空间缓冲区,将所有OD点生成的空间缓冲区叠置合并构建单车潜在借还区域ODA,进一步精细约束单车可能借还的所有空间范围。(3) Generate a spatial buffer with a radius of 50 meters for each OD point. Overlay and merge the spatial buffers generated by all OD points to construct the bicycle potential borrowing and returning area (ODA) , further refining the constraints on all spatial ranges where bicycles may be borrowed and returned.

(4)OD点空间邻域构建。给定任一轨迹点

Figure 943738DEST_PATH_IMAGE103
,其空间位置处的核密度相对值近似表达为:(4) Construction of OD point spatial neighborhood. Given any trajectory point
Figure 943738DEST_PATH_IMAGE103
, the relative value of the kernel density at its spatial position is approximately expressed as:

Figure 447531DEST_PATH_IMAGE109
Figure 447531DEST_PATH_IMAGE109

式中,

Figure 421303DEST_PATH_IMAGE110
表示点
Figure 483675DEST_PATH_IMAGE103
和点
Figure 542898DEST_PATH_IMAGE102
间的欧氏空间距离,
Figure 850383DEST_PATH_IMAGE102
表示研究区域内所有满足
Figure 944241DEST_PATH_IMAGE111
的轨迹点,h表示核密度函数带宽。在此基础上,利用熵函数度量研究区域内所有OD点所在位置的空间核密度不稳定性:In the formula,
Figure 421303DEST_PATH_IMAGE110
Indicate point
Figure 483675DEST_PATH_IMAGE103
and Point
Figure 542898DEST_PATH_IMAGE102
The Euclidean distance between
Figure 850383DEST_PATH_IMAGE102
Indicates that all the
Figure 944241DEST_PATH_IMAGE111
The trajectory points are represented by h, and h represents the bandwidth of the kernel density function. On this basis, the entropy function is used to measure the spatial kernel density instability of all OD points in the study area:

Figure 171654DEST_PATH_IMAGE112
Figure 171654DEST_PATH_IMAGE112

Figure 983752DEST_PATH_IMAGE113
Figure 983752DEST_PATH_IMAGE113

式中,N表示研究区域内所有OD点数量。给定

Figure 563769DEST_PATH_IMAGE114
宽松的取值范围
Figure 777713DEST_PATH_IMAGE115
和步长
Figure 181887DEST_PATH_IMAGE116
,迭代计算
Figure 950123DEST_PATH_IMAGE117
,将
Figure 333831DEST_PATH_IMAGE117
最小值对应的
Figure 995757DEST_PATH_IMAGE114
设置为研究区域轨迹点空间邻域eps。基于此,以任一轨迹点
Figure 806718DEST_PATH_IMAGE103
为圆心、eps为半径构建的圆形空间区域表示
Figure 295206DEST_PATH_IMAGE103
的空间邻域,记为
Figure 341659DEST_PATH_IMAGE118
。其中,设置
Figure 999036DEST_PATH_IMAGE119
Figure 980899DEST_PATH_IMAGE120
分别为1 m、200m和1 m。Where N represents the number of all OD points in the study area.
Figure 563769DEST_PATH_IMAGE114
Loose value range
Figure 777713DEST_PATH_IMAGE115
and step length
Figure 181887DEST_PATH_IMAGE116
, iterative calculation
Figure 950123DEST_PATH_IMAGE117
,Will
Figure 333831DEST_PATH_IMAGE117
The minimum value corresponds to
Figure 995757DEST_PATH_IMAGE114
Set to the spatial neighborhood eps of the trajectory points in the study area. Based on this, any trajectory point
Figure 806718DEST_PATH_IMAGE103
The circular space area is represented by the center and eps is the radius.
Figure 295206DEST_PATH_IMAGE103
The spatial neighborhood of
Figure 341659DEST_PATH_IMAGE118
Among them, setting
Figure 999036DEST_PATH_IMAGE119
and
Figure 980899DEST_PATH_IMAGE120
They are 1 m , 200 m and 1 m respectively.

(5)OD点密集邻域识别。在空间随机分布假设下,任一空间邻域SN内出现k个其他OD点的理论概率为泊松分布密度函数:(5) Identification of dense neighborhoods of OD points. Under the assumption of spatial random distribution, the theoretical probability of k other OD points appearing in any spatial neighborhood SN is the Poisson distribution density function:

Figure 317202DEST_PATH_IMAGE121
Figure 317202DEST_PATH_IMAGE121

式中,N表示研究区域内所有OD点数量;Area()表示面积计算函数;SNODA分别表示任一轨迹点空间邻域和单车潜在借还区域。给定任一OD点p i 及其空间邻域

Figure 308292DEST_PATH_IMAGE122
Figure 413651DEST_PATH_IMAGE122
成为研究区域内空间密集邻域的概率计算为泊松分布的概率分布函数:Where N represents the number of all OD points in the study area; Area () represents the area calculation function; SN and ODA represent the spatial neighborhood of any trajectory point and the potential borrowing and returning area of a single vehicle, respectively. Given any OD point p i and its spatial neighborhood
Figure 308292DEST_PATH_IMAGE122
,
Figure 413651DEST_PATH_IMAGE122
The probability of being a spatially dense neighbor within the study area is calculated as the probability distribution function of the Poisson distribution:

Figure 64950DEST_PATH_IMAGE123
Figure 64950DEST_PATH_IMAGE123

式中,

Figure 29495DEST_PATH_IMAGE124
表示点p i 空间邻域
Figure 558697DEST_PATH_IMAGE125
内其他OD点的数量。若
Figure 456246DEST_PATH_IMAGE126
大于给定具有统计意义的显著性水平α,则
Figure 373386DEST_PATH_IMAGE125
标记为空间密集邻域
Figure 595201DEST_PATH_IMAGE127
。其中,设置α=0.95。In the formula,
Figure 29495DEST_PATH_IMAGE124
Represents the spatial neighborhood of point p i
Figure 558697DEST_PATH_IMAGE125
The number of other OD points within.
Figure 456246DEST_PATH_IMAGE126
is greater than a given statistically significant level α , then
Figure 373386DEST_PATH_IMAGE125
Marked as a spatially dense neighborhood
Figure 595201DEST_PATH_IMAGE127
. Among them, α is set to 0.95.

(6)OD点潮汐邻域识别。给定任一空间密集邻域SDNSDN内出现k个单车出行起点的理论概率为二项分布密度函数:(6) Identification of OD point tidal neighborhood. Given any spatially dense neighborhood SDN , the theoretical probability of k bicycle trip starting points appearing in the SDN is the binomial distribution density function:

Figure 662514DEST_PATH_IMAGE128
Figure 662514DEST_PATH_IMAGE128

式中,

Figure 8045DEST_PATH_IMAGE129
表示
Figure 502611DEST_PATH_IMAGE130
内起点数量,
Figure 707327DEST_PATH_IMAGE131
表示空间密集邻域内轨迹点总数量,p表示目标区域内出行起点比例。给定任一OD点
Figure 76867DEST_PATH_IMAGE132
及其空间密集邻域
Figure 276904DEST_PATH_IMAGE133
Figure 207951DEST_PATH_IMAGE133
成为研究区域内空间潮汐源邻域的可能性计算为:In the formula,
Figure 8045DEST_PATH_IMAGE129
express
Figure 502611DEST_PATH_IMAGE130
The number of internal starting points,
Figure 707327DEST_PATH_IMAGE131
represents the total number of trajectory points in the spatially dense neighborhood, and p represents the proportion of travel starting points in the target area.
Figure 76867DEST_PATH_IMAGE132
and its spatially dense neighborhood
Figure 276904DEST_PATH_IMAGE133
,
Figure 207951DEST_PATH_IMAGE133
The probability of becoming a neighbor of a spatial tidal source in the study area is calculated as:

Figure 634384DEST_PATH_IMAGE135
Figure 634384DEST_PATH_IMAGE135

式中,

Figure 902554DEST_PATH_IMAGE136
表示
Figure 363623DEST_PATH_IMAGE137
内起点数量,
Figure 698527DEST_PATH_IMAGE138
表示
Figure 471311DEST_PATH_IMAGE137
内轨迹点总数量。若
Figure 684117DEST_PATH_IMAGE139
大于给定具有统计意义的显著性水平α,则
Figure 734113DEST_PATH_IMAGE137
标记为空间潮汐源邻域
Figure 600438DEST_PATH_IMAGE140
;若
Figure 1463DEST_PATH_IMAGE141
小于给定具有统计意义的显著性水平1-α,则
Figure 516496DEST_PATH_IMAGE137
标记为空间潮汐汇邻域
Figure 14473DEST_PATH_IMAGE142
。其中,设置α=0.95。In the formula,
Figure 902554DEST_PATH_IMAGE136
express
Figure 363623DEST_PATH_IMAGE137
The number of internal starting points,
Figure 698527DEST_PATH_IMAGE138
express
Figure 471311DEST_PATH_IMAGE137
The total number of inner track points.
Figure 684117DEST_PATH_IMAGE139
is greater than a given statistically significant level α , then
Figure 734113DEST_PATH_IMAGE137
Marked as spatial tidal source neighborhood
Figure 600438DEST_PATH_IMAGE140
;like
Figure 1463DEST_PATH_IMAGE141
is less than the given statistically significant level 1- α , then
Figure 516496DEST_PATH_IMAGE137
Marked as a spatial tidal sink neighborhood
Figure 14473DEST_PATH_IMAGE142
. Among them, α is set to 0.95.

(7)OD点潮汐区域提取。以任一空间潮汐源邻域

Figure 723803DEST_PATH_IMAGE143
为种子点,定义
Figure 346546DEST_PATH_IMAGE140
内其他起点
Figure 760210DEST_PATH_IMAGE144
Figure 784797DEST_PATH_IMAGE103
密度可达,将所有与
Figure 118825DEST_PATH_IMAGE103
密度可达点聚合为空间潮汐源区域
Figure 494443DEST_PATH_IMAGE145
,对于
Figure 587164DEST_PATH_IMAGE145
内满足空间潮汐源邻域判别的其他点
Figure 590892DEST_PATH_IMAGE144
,继续执行聚合操作更新
Figure 281505DEST_PATH_IMAGE145
,直到
Figure 144419DEST_PATH_IMAGE145
内所有满足空间潮汐源邻域判别的其他点全部访问完毕,重新选取目标区域内未被访问的其他空间潮汐源邻域重复上述过程,最终得到目标区域内提取的全部空间潮汐源区域表示为
Figure 306410DEST_PATH_IMAGE105
。(7) Extraction of tidal area at OD point. Take any spatial tidal source neighborhood as
Figure 723803DEST_PATH_IMAGE143
is the seed point, and defines
Figure 346546DEST_PATH_IMAGE140
Other starting points
Figure 760210DEST_PATH_IMAGE144
and
Figure 784797DEST_PATH_IMAGE103
The density can be reached, all
Figure 118825DEST_PATH_IMAGE103
Density-reachable points are aggregated into spatial tidal source regions
Figure 494443DEST_PATH_IMAGE145
,for
Figure 587164DEST_PATH_IMAGE145
Other points that meet the spatial tidal source neighborhood judgment
Figure 590892DEST_PATH_IMAGE144
, continue to perform aggregate operation updates
Figure 281505DEST_PATH_IMAGE145
,until
Figure 144419DEST_PATH_IMAGE145
After all other points in the target area that meet the spatial tidal source neighborhood judgment are visited, other spatial tidal source neighborhoods that have not been visited in the target area are reselected to repeat the above process. Finally, all spatial tidal source areas extracted in the target area are expressed as
Figure 306410DEST_PATH_IMAGE105
.

同理,以任一空间潮汐汇邻域

Figure 69704DEST_PATH_IMAGE142
为种子点,定义
Figure 963842DEST_PATH_IMAGE142
内其他终点
Figure 579631DEST_PATH_IMAGE144
Figure 715952DEST_PATH_IMAGE103
密度可达,将所有与
Figure 100797DEST_PATH_IMAGE103
密度可达点聚合为空间潮汐汇区域
Figure 493732DEST_PATH_IMAGE146
,基于以上空间扩展策略,如图4和图5所示,最终,研究区域内提取的全部空间潮汐汇区域表示为
Figure 101212DEST_PATH_IMAGE147
。研究区域内提取的全部空间潮汐区域表示为潮汐源区域和潮汐汇区域的并集Similarly, any spatial tidal sink neighborhood
Figure 69704DEST_PATH_IMAGE142
is the seed point, and defines
Figure 963842DEST_PATH_IMAGE142
Other endpoints
Figure 579631DEST_PATH_IMAGE144
and
Figure 715952DEST_PATH_IMAGE103
The density can be reached, all
Figure 100797DEST_PATH_IMAGE103
Density-reachable points are aggregated into spatial tidal sink areas
Figure 493732DEST_PATH_IMAGE146
,Based on the above spatial expansion strategy, as shown in Figures 4 and 5, finally, all the spatial tidal sink areas extracted in the study area are expressed as
Figure 101212DEST_PATH_IMAGE147
The total spatial tidal area extracted in the study area is represented as the union of the tidal source area and the tidal sink area.

Figure 11530DEST_PATH_IMAGE024
Figure 11530DEST_PATH_IMAGE024
.

由此可以看出,本发明的方法相比现有方案的优点在于:It can be seen from this that the method of the present invention has the following advantages over the existing solutions:

(1)现有基于空间单元特征度量的方法需要进行空间单元划分,同时潮汐程度判断依赖人工经验设置,难以精细可靠地探测共享单车出行潮汐区域,本发明有效避免了空间划分操作,基于具有统计学意义的显著性检验自适应地提取共享单车出行潮汐现象的精细空间范围。(1) Existing methods based on spatial unit feature measurement require spatial unit division. At the same time, the tidal degree judgment relies on manual experience settings, which makes it difficult to accurately and reliably detect the tidal area of shared bicycle travel. The present invention effectively avoids the spatial division operation and adaptively extracts the fine spatial range of the shared bicycle travel tidal phenomenon based on a statistically significant significance test.

(2)现有基于空间轨迹聚类探测的方法忽略了共享单车起止定位点的分布特征,仍利用共享单车密度或留存量等特征设置阈值判别潮汐现象,难以真实体现现实城市空间中潮汐失衡程度,极易产生借还动态均衡区域的误判,本发明基于共享单车OD点真实空间分布和实际相对数量特征进行潮汐显著性检验,具有严密的数学与地理学可解释性,能够可靠、稳定地提取共享单车出行潮汐区域。(2) The existing methods based on spatial trajectory clustering detection ignore the distribution characteristics of the starting and ending points of shared bicycles, and still use characteristics such as shared bicycle density or retention to set thresholds to distinguish tidal phenomena. This makes it difficult to truly reflect the degree of tidal imbalance in real urban space and is prone to misjudgment of the dynamic equilibrium area of borrowing and returning. The present invention performs tidal significance testing based on the real spatial distribution and actual relative quantity characteristics of shared bicycle OD points. It has rigorous mathematical and geographical interpretability and can reliably and stably extract shared bicycle travel tidal areas.

描述于本发明实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。The units involved in the embodiments of the present invention may be implemented in software or hardware.

应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。It should be understood that various parts of the present invention can be implemented by hardware, software, firmware or a combination thereof.

以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求的保护范围为准。The above is only a specific embodiment of the present invention, but the protection scope of the present invention is not limited thereto. Any changes or substitutions that can be easily thought of by a person skilled in the art within the technical scope disclosed by the present invention should be included in the protection scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (3)

1. The utility model provides a sharing bicycle trip tide area detection method based on self-adaptation density clustering which is characterized by comprising the following steps:
step 1, data cleaning is carried out on track points generated by a travel order of a shared bicycle in a target area, and then a potential borrowing and returning area of the shared bicycle is extracted;
step 2, designing a kernel density estimation function to construct a track point space density field, adaptively constructing a track point space neighborhood by measuring the instability of the space density field, and constructing a space density significance test function to extract a space density neighborhood according to the space density estimation function;
the step 2 specifically includes:
step 2.1, designing a kernel density estimation function, and giving a kernel density relative value approximate expression of any track point in a potential borrowing and returning area of a bicycle to obtain a spatial position of the track point as a track point spatial density field, wherein the kernel density relative value approximate expression is as follows
Figure QLYQS_1
Wherein->
Figure QLYQS_2
Representation dot->
Figure QLYQS_3
And (4) point->
Figure QLYQS_4
Euclidean space distance between->
Figure QLYQS_5
Indicating all fulfilments within the investigation region +.>
Figure QLYQS_6
Track points of>
Figure QLYQS_7
Representing the bandwidth of the kernel density function;
step 2.2, measuring the spatial kernel density instability of the positions of the track points in the track point spatial density places by utilizing an entropy function, and constructing a track point spatial neighborhood;
measuring the spatial kernel density instability of all track points in the research area by using entropy function, and giving the bandwidth of kernel density function
Figure QLYQS_8
Value range->
Figure QLYQS_9
And step size->
Figure QLYQS_10
Iterative calculation of entropy function->
Figure QLYQS_11
Will->
Figure QLYQS_12
Corresponding to the minimum value
Figure QLYQS_13
Setting the space neighborhood length of the track point of the target area;
constructing a track point space neighborhood by taking any track point as a circle center and the space neighborhood length as a radius;
step 2.3, under the assumption of space random distribution, calculating theoretical probability distribution of k other track points in any space neighborhood, giving any track point and a corresponding space neighborhood thereof, calculating probability distribution value of the space neighborhood being a dense neighborhood, and taking the space neighborhood with the probability distribution value larger than a significance level value as the space dense neighborhood;
step 3, adaptively identifying the track point tide neighborhood according to the space dense neighborhood and the track point density characteristic significance test and extracting the shared bicycle trip tide region based on density reachable expansion;
the step 3 specifically includes:
step 3.1, calculating the occurrence in any one of the spatially dense neighborskTheoretical probability of individual sharing of a bicycle travel starting point; wherein the occurrence in any one of the spatially dense neighborskThe expression of the theoretical probability of each shared bicycle travel starting point is
Figure QLYQS_14
In the method, in the process of the invention,
Figure QLYQS_15
representing the number of start points in a spatially dense neighborhood, +.>
Figure QLYQS_16
Represents the total number of trace points in a spatially dense neighborhood, +.>
Figure QLYQS_17
Representing the travel starting point proportion in the target area;
step 3.2, calculating a spatial dense neighborhood corresponding to any track point according to the calculation result of the step 3.1 to become a probability value of a spatial tide source neighborhood in the target area, taking the spatial dense neighborhood with the probability value larger than the significance level value as the spatial tide source neighborhood, and taking the spatial dense neighborhood with the probability value smaller than the significance level value as the spatial tide sink neighborhood; wherein, the expression of the probability value that the space dense neighborhood corresponding to any track point becomes the space tide source neighborhood in the target area is that
Figure QLYQS_18
In the method, in the process of the invention,
Figure QLYQS_19
representation->
Figure QLYQS_20
Number of internal origin->
Figure QLYQS_21
Representation->
Figure QLYQS_22
Total number of inner trace points>
Figure QLYQS_23
Representation dot->
Figure QLYQS_24
Is a spatially dense neighborhood of (1); />
Step 3.3, using any space tidal source neighborhood
Figure QLYQS_26
For seed points, define->
Figure QLYQS_29
Other starting points->
Figure QLYQS_32
And->
Figure QLYQS_27
The density is reachable, all are combined with +.>
Figure QLYQS_30
The density reachable points are aggregated into a spatial tidal source area +.>
Figure QLYQS_33
For->
Figure QLYQS_34
Other points satisfying the space tidal source neighborhood discrimination in the interior continue to execute aggregation operation update +.>
Figure QLYQS_25
Up to->
Figure QLYQS_28
After all other points meeting the space tidal source neighborhood discrimination in the target area are accessed, the other space tidal source neighborhood which is not accessed in the target area is selected again, and the process is repeated, so that the whole space tidal source area extracted in the target area is finally obtained and expressed as +.>
Figure QLYQS_31
Step 3.4, converging the neighborhood by any space tide
Figure QLYQS_37
For seed points, define->
Figure QLYQS_40
Other end point->
Figure QLYQS_44
And->
Figure QLYQS_38
The density is reachable, all are combined with +.>
Figure QLYQS_41
The density reachable points are aggregated into a space tidal current area +.>
Figure QLYQS_43
For->
Figure QLYQS_46
Other points satisfying the space tide sink neighborhood discrimination in the interior continue to execute aggregation operation update +.>
Figure QLYQS_35
Up to->
Figure QLYQS_39
After all other points meeting the space tide sink neighborhood discrimination are accessed, the other space tide sink neighborhood which is not accessed in the target area is selected again, the process is repeated, and finally, all the space tide sink areas extracted in the target area are expressed as +.>
Figure QLYQS_42
The total spatial tidal area extracted within the target area is represented as the union of the tidal Source area and the tidal sink area
Figure QLYQS_45
Figure QLYQS_36
2. The method according to claim 1, wherein the step 1 specifically comprises:
step 1.1, deleting track data outside the range of a target area, in a water system and in a building;
step 1.2, deleting track data with abnormal timestamp formats, arranging track points of each shared bicycle order in ascending order according to the timestamps, and keeping only one record for abnormal conditions of continuous unlocking and locking states;
step 1.3, for the same shared bicycle track point with repeated position coordinates, only one record with the earliest time stamp is reserved;
step 1.4, deleting a pair of track points with track point time stamp intervals outside a preset range;
and 1.5, generating a space buffer area according to a preset radius by each track point, and superposing and combining the space buffer areas generated by all track points to construct a bicycle potential borrowing and returning area.
3. The method of claim 2, wherein the entropy function has an expression of
Figure QLYQS_47
Figure QLYQS_48
In which, in the process,Nrepresenting the number of all track points in the target area;
occurrence in any one of the spatial neighborskThe theoretical probability distribution of each other trace point is expressed as
Figure QLYQS_49
Figure QLYQS_50
In which, in the process,Nrepresenting the number of all OD points in the investigation region, the OD points representing the trace points,Area() Representing the area calculation function,SNandODArespectively representing a space neighborhood of any track point and a bicycle potential borrowing and returning area;
the expression for calculating the probability distribution value of the spatial neighborhood being a dense neighborhood is as follows
Figure QLYQS_51
In the method, in the process of the invention,
Figure QLYQS_52
representation dot->
Figure QLYQS_53
Spatial neighborhood->
Figure QLYQS_54
The number of other trace points in the track. />
CN202211562330.0A 2022-12-07 2022-12-07 Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle Active CN115577306B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211562330.0A CN115577306B (en) 2022-12-07 2022-12-07 Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211562330.0A CN115577306B (en) 2022-12-07 2022-12-07 Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle

Publications (2)

Publication Number Publication Date
CN115577306A CN115577306A (en) 2023-01-06
CN115577306B true CN115577306B (en) 2023-04-28

Family

ID=84590435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211562330.0A Active CN115577306B (en) 2022-12-07 2022-12-07 Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle

Country Status (1)

Country Link
CN (1) CN115577306B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115879016B (en) * 2023-02-20 2023-05-16 中南大学 Prediction method of tidal period for shared bicycle travel
CN118365501B (en) * 2024-06-17 2024-09-10 中南大学 Shared bicycle rebalancing method guided by balancing area division and rewarding mechanism

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107240277A (en) * 2016-03-27 2017-10-10 深圳市以捷创新科技有限公司 The drive dynamic control device of the tide lane signal lamp system of smart electronicses police
JP7206583B2 (en) * 2016-11-25 2023-01-18 株式会社リコー Information processing device, imaging device, device control system, moving object, information processing method and program
US10446022B2 (en) * 2017-06-09 2019-10-15 Here Global B.V. Reversible lane active direction detection based on GNSS probe data
CN107886202A (en) * 2017-12-01 2018-04-06 南京熊猫电子股份有限公司 Effectively solve the problems, such as the method for shared resource tide and scheduling system
CN109544191B (en) * 2018-11-28 2020-11-13 淮北智行信息科技有限公司 Shared vehicle settlement exception handling method, vehicle, system, device and server
CN112884327A (en) * 2021-02-25 2021-06-01 湖南喜宝达信息科技有限公司 Electric bicycle tide scheduling method, server system and readable storage medium
CN114495562A (en) * 2021-12-30 2022-05-13 中兴智能交通股份有限公司 A system and method for tidal control at the entrance and exit of a parking lot
CN114897656B (en) * 2022-07-15 2022-11-25 深圳市城市交通规划设计研究中心股份有限公司 Shared bicycle tidal area parking dredging method, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN115577306A (en) 2023-01-06

Similar Documents

Publication Publication Date Title
CN115577306B (en) Self-adaptive density clustering-based method for detecting travel tide area of shared bicycle
CN110738856B (en) Mobile clustering-based urban traffic jam fine identification method
Biagioni et al. Map inference in the face of noise and disparity
CN108415975B (en) A taxi passenger hot spot identification method based on BDCH-DBSCAN
Liu et al. Discovering spatio-temporal causal interactions in traffic data streams
WO2016107463A1 (en) Transaction risk detection method and device
CN105740904B (en) A Travel and Activity Pattern Recognition Method Based on DBSCAN Clustering Algorithm
CN109557532B (en) Tracking before detection method and radar target detection system based on 3D Hough transform
Li et al. Knowledge-based trajectory completion from sparse GPS samples
CN104750830B (en) The cycle method for digging of time series data
Li et al. Automatic generation and validation of road maps from GPS trajectory data sets
CN111275975B (en) Method and device for acquiring intersection turning flow data and storage medium
CN116132923A (en) A high-precision space-time trajectory recovery method based on mobile phone signaling data
CN112001401A (en) Instance segmentation network training model and training method, instance segmentation network
Ou et al. A data‐driven approach to determining freeway incident impact areas with fuzzy and graph theory‐based clustering
CN103914558A (en) Method for mining space-time aggregation patterns of meteorological elements on basis of space-time statistics
CN104408923A (en) Method and device for evaluating traffic state
Tang et al. A novel method for road intersection construction from vehicle trajectory data
CN116010838A (en) A Vehicle Trajectory Clustering Method Fused with Density Value and K-means Algorithm
CN111444286B (en) Long-distance traffic node relevance mining method based on trajectory data
CN116828397B (en) Track information acquisition method and device, electronic equipment and storage medium
CN116151493B (en) Traffic congestion prediction method and device based on head effect and recurrent neural network
CN116668958A (en) Urban resident dynamic OD matrix extraction method based on mobile phone signaling data
CN107239461B (en) Road island determination method and device
CN114462609A (en) A Trajectory Restoration Method for Floating Car Data Based on Hidden Markov Model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant