[go: up one dir, main page]

CN109063034B - Indoor space semantic value calculation method based on space and social multimedia data - Google Patents

Indoor space semantic value calculation method based on space and social multimedia data Download PDF

Info

Publication number
CN109063034B
CN109063034B CN201810779775.1A CN201810779775A CN109063034B CN 109063034 B CN109063034 B CN 109063034B CN 201810779775 A CN201810779775 A CN 201810779775A CN 109063034 B CN109063034 B CN 109063034B
Authority
CN
China
Prior art keywords
social
indoor
text
evaluation
contribution degree
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810779775.1A
Other languages
Chinese (zh)
Other versions
CN109063034A (en
Inventor
陈珂
寿黎但
喻影
陈刚
江大伟
伍赛
胡天磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201810779775.1A priority Critical patent/CN109063034B/en
Publication of CN109063034A publication Critical patent/CN109063034A/en
Application granted granted Critical
Publication of CN109063034B publication Critical patent/CN109063034B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data

Landscapes

  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种基于空间和社交多媒体数据的室内空间语义价值计算方法。将室内独立子空间的语义价值划分为区位和社交贡献度,融合得室内独立子空间的语义价值;分析各室内独立子空间拓扑连接关系,构建室内空间模型,对各物体间距离进行计算,根据用户行走约束计算用户在室内空间中的各条行走路径的选择概率构建路径矩阵,计算区位贡献度;根据室内独立子空间的社交多媒体数据,分析句子结构并提取关键词,构建情感分类基分类器;将句子集合划分为关键句和非关键句,对社交评价文本进行分类;根据社交评价文本的分类计算社交贡献度。本发明的计算室内空间价值方法能用于解决新引进店铺的价值预测,多维度店铺评估等关键问题。The invention discloses an indoor space semantic value calculation method based on space and social multimedia data. The semantic value of the indoor independent subspace is divided into location and social contribution, and the semantic value of the indoor independent subspace is fused; the topological connection relationship of each indoor independent subspace is analyzed, the indoor space model is constructed, and the distance between each object is calculated. The user walking constraint calculates the selection probability of each walking path of the user in the indoor space, constructs a path matrix, and calculates the location contribution; according to the social multimedia data of the indoor independent subspace, analyzes the sentence structure and extracts keywords to construct a sentiment classification base classifier ; Divide the sentence set into key sentences and non-key sentences, and classify the social evaluation text; calculate the social contribution degree according to the classification of the social evaluation text. The method for calculating indoor space value of the present invention can be used to solve key problems such as value prediction of newly introduced stores, multi-dimensional store evaluation and the like.

Description

Indoor space semantic value calculation method based on space and social multimedia data
Technical Field
The invention relates to a spatial modeling and data processing method combining the geographic field and the machine learning field, in particular to an indoor spatial semantic value calculation method based on spatial and social multimedia data.
Background
The indoor space is composed of a plurality of indoor independent subspaces and provides abundant goods or services and a elegant environment for people. Indoor space managers increasingly emphasize important contribution of indoor independent sub-spaces with high semantic value to indoor space operation success, so that how to fully utilize existing data to model and quantitatively evaluate indoor space semantic value and how to predict semantic value of newly introduced indoor independent sub-spaces to be recruited are focuses and hot spots concerned by current academic circles and industrial circles.
However, the existing research on the influence factors of the space value in the shopping center is mainly established on the limited and qualitative sample data collected by the research personnel in the academic world, and the research result is not only influenced by the level and efficiency of the research personnel, but also consumes a great deal of manpower and time cost; meanwhile, as such data samples suffer from the deficiency that is difficult to ignore in the dual dimensions of time and space, the uncertainty of various research results is greatly increased.
Disclosure of Invention
The invention aims to innovatively use geographical location data of indoor independent subspaces in an indoor space and fresh social multimedia data on the Internet to quantitatively research the value of the indoor space aiming at the defects of the prior art.
The technical scheme adopted by the invention for solving the technical problems is as follows:
the method depends on the support of spatial data and social multimedia data, applies methods such as an indoor space model, a reachability theory and emotion analysis, and obtains semantic values of indoor independent sub-spaces through accurate and rapid calculation from two aspects of technology and data.
The method comprises the following steps: firstly, dividing the semantic value of the indoor independent subspace into two sub-problems of a zone bit contribution degree and a social contribution degree, respectively calculating the zone bit contribution degree and the social contribution degree of the indoor independent subspace, and then fusing the zone bit contribution degree and the social contribution degree of the indoor independent subspace to obtain the semantic value of the indoor independent subspace.
The position contribution degree of the indoor independent subspace is calculated by adopting the following method:
A) analyzing topological connection relations among the indoor independent sub-spaces according to the two-dimensional data of the indoor space by using a generation algorithm, and constructing an indoor space model;
B) calculating the distance between objects in the indoor space model by using a distance measurement method, and meanwhile, calculating the selection probability of each walking path of the user in the indoor space according to the walking constraint of the user to construct a path matrix;
C) and calculating the zone bit contribution degree of the indoor independent subspace by utilizing the path matrixes of different floors.
The social contribution degree of the indoor independent subspace is calculated in the following mode:
a) analyzing sentence structures of social evaluation texts and extracting keywords for weighting according to social multimedia data of indoor independent subspaces on the Internet to construct an emotion classification base classifier;
b) a key sentence extraction algorithm is adopted, a sentence set of the social evaluation text is divided into key sentences and non-key sentences, and the social evaluation text is classified by combining a classifier fusion method;
c) and calculating the social contribution degree of each indoor independent subspace according to the classification of the social evaluation texts of the indoor independent subspaces.
The indoor independent subspace location contribution degree is calculated by the indoor independent subspace location contribution degree calculating module in the following mode, and the indoor independent subspace location contribution degree calculating module comprises an SVG map obtaining module, an indoor space modeling module and a location calculating module:
step 1.1, acquiring an indoor map of an indoor space in an SVG format by an SVG map acquisition module in a tracing mode, and analyzing the indoor map into a set consisting of various geometric elements;
and splitting the indoor maps of multiple floors into indoor maps of each single floor for processing.
Step 1.2, in an indoor space modeling module, analyzing a set formed by various geometric elements through a generation algorithm to obtain topological connection relations among the geometric elements, wherein the topological relations comprise polygonal elements, communication doors of the polygonal elements and communication relations among the polygonal elements;
step 1.3, calculating and obtaining the position contribution degree of the indoor independent subspace through a position calculation module
Step 1.3.1: adding user walking constraints on the basis of topological connection relations among all geometric elements, and constructing and obtaining an indoor space distance model with constraints;
step 1.3.2: with the two data structures D2R and R2D as hybrid indices, the hybrid indices are used to compute the distances dis of sub-paths between different polygon elementsi
Step 1.3.3: calculating the selection probability p of each different walking path from the outside of the indoor space to the indoor independent subspacekThe walking path is formed by sub-paths among different polygon elements;
step 1.3.4: for each indoor independent subspace, combining the path lengths and the selection probabilities of different walking paths, establishing a data structure of a path matrix:
for the ith indoor independent subspace of the jth floor, the following path matrix is established:
Figure BDA0001732272310000021
wherein: p is a radical ofkRepresenting the probability, dis, of the k-th travel path that may be selected to travel from different exterior doorways of the indoor space to the indoor independent subspacekRepresenting the path length of the k-th walking path which can be selected from the walking from the different external entrances and exits of the indoor space to the indoor independent subspace, wherein k is 1-n, k represents the ordinal number of the path, and n represents the total number of the paths;
step 1.3.5: considering a user as a shared resource for all indoor independent subspaces, the size of a scaled resource can be represented by a path selection probability p based on user behavior, while the distance between the resource and the target point is represented by a path distance dis calculated by a graphical model. Calculating the zone bit contribution degree of the indoor independent subspace by using the path matrix of all possible walking paths reaching the indoor independent subspace, and specifically calculating the following formula:
Figure BDA0001732272310000031
wherein, Value (store)i) Indicating the location contribution, value (store) of the i-th indoor independent subspacejk) Indicating the position contribution degree of the k-th walking path passing through the j-th floor; β represents the magnitude of the attenuation coefficient and is typically set to 1,1.2, 1.5;
Figure BDA0001732272310000032
indicating the intensity of the attenuation with distance.
The larger the calculation result value is, the stronger the accessibility of the indoor independent subspace is, and the higher the position contribution degree is.
The user walking constraint in step 1.3 refers to a movement constraint of the user from the corridor to the destination indoor independent subspace within the indoor space of the indoor space.
The position contribution degree of the indoor independent subspace is calculated by the indoor independent subspace social contribution degree calculating module in the following mode, wherein the indoor independent subspace social contribution degree calculating module comprises an internet crawler module, a text processing module and an emotion analysis module:
social multimedia data includes news data, social evaluation text data, marketing data, and the like.
2.1, acquiring a social media data set of each indoor independent subspace under the Internet by adopting an Internet crawler module;
2.2, the social media data set contains a social evaluation text, then, a text processing module is used for carrying out de-duplication of the social evaluation text and text vectorization of the social evaluation text, and dependency syntax is used for analyzing sentence structures in the social evaluation text and carrying out part-of-speech tagging;
2.3, evaluating and calculating social contribution degree by adopting emotion analysis module
Step 2.3.1: calculating a TF-IDF characteristic value of each word in each social evaluation text by using a word frequency inverse document frequency TF-IDF algorithm;
step 2.3.2: an emotional word weighting algorithm based on dependency syntactic analysis is proposed. Extracting a designated relationship pair from a sentence structure of the social evaluation text, and if the designated relationship pair is extracted, multiplying the TF-IDF characteristic value of the key emotional word by an enhancement parameter b to enhance; if the appointed relation pair is not extracted, the TF-IDF characteristic value of the key emotional word is not processed;
step 2.3.3: a keyword extraction algorithm suitable for Chinese text is proposed. And dividing all sentences in each social evaluation text into independent and non-coincident parts according to the keyword attribute, the emotional word attribute, the punctuation mark attribute and the position attribute of the sentences. The social rating text d is divided into a series of sentence components: d ═ s1,s2,…,smWhere m represents the number of sentences, siRepresenting the ith sentence;
each sentence siDividing the sentence into a series of word components, and calculating the evaluation value of each sentence by adopting the following formula:
Figure BDA0001732272310000041
wherein, f(s)i) Weight of attribute, λ, representing the ith sentence1Representing a keyword attribute weight, λ2Representing emotional word attribute weight, λ3Representing punctuation mark attribute weights, λ4Representing the location attribute weight of the sentence in the social rating text,
Figure BDA0001732272310000042
an evaluation value representing an attribute of the keyword,
Figure BDA0001732272310000043
comments indicating the attributes of emotional wordsThe value of the method is as follows,
Figure BDA0001732272310000044
an evaluation value representing the attribute of the punctuation mark,
Figure BDA0001732272310000045
an evaluation value indicating a position attribute;
finally, a sentence with the highest evaluation value in the social evaluation text is taken as an emotion key sentence of the social evaluation text;
step 2.3.4: obtaining emotion key sentence classifier f through training of all labeled social evaluation text samples1Non-emotion key sentence classifier f2And full text classifier f3And labeling the to-be-detected object of the unlabeled social evaluation text corresponding to the indoor independent subspace by adopting a classifier fusion processing mode, wherein the labeling refers to labeling the social evaluation text with emotional tendency which is divided into two types of positive emotional tendency and negative emotional tendency, and specifically comprises the following steps:
training emotion key sentence classifier f by using emotion key sentences in labeled social evaluation text1Using the trained emotion key sentence classifier f1Processing the social evaluation text which is not labeled to obtain a first group of probability values of the social evaluation text on emotional tendency;
training a non-emotion key sentence classifier f using sentences other than emotion key sentences in labeled social evaluation text2Using trained non-emotion key sentence classifier f2Processing the social evaluation texts which are not labeled to obtain a second group of probability values of the social evaluation texts on emotional tendency;
training full-text classifier f using ensemble of labeled social rating texts3Using trained full-text classifier f3Processing the social evaluation texts which are not labeled to obtain a third group of probability values of the social evaluation texts on emotional tendency;
and finally, selecting the maximum probability value of the positive emotional tendency and the maximum probability value of the negative emotional tendency from the three groups of probability values, and comparing the sizes of the probability values: if the maximum probability value of the positive emotional tendency is greater than the maximum probability value equal to the negative emotional tendency, the social evaluation text is marked as good evaluation; if the maximum probability value of the positive emotional tendency is smaller than the maximum probability value of the negative emotional tendency, the social evaluation text is marked as poor evaluation;
step 2.3.5: calculating the social contribution degree S of the indoor independent subspace by adopting the following formula:
N=u+v
p=u/n
Figure BDA0001732272310000051
wherein S represents the social contribution degree of the indoor independent subspace, u represents the total number of good comments in all social evaluation texts, v represents the total number of bad comments in all social evaluation texts, N represents the total number of all social evaluation texts, p represents the good comment rate, and z represents the social contribution degree of the indoor independent subspaceαIs the quantile of a normal distribution.
The invention has the beneficial effects that:
the method applies methods such as space modeling, reachability theory, emotion analysis and the like, and starts from two aspects of technology and data, and a set of evaluation algorithm and model for calculating indoor independent subspace semantic value are constructed;
the automatic construction of the shopping center indoor space model with customer walking constraint is realized, and the calculation of the distance model is optimized by a mixed index method in the calculation process;
according to the emotion classification method, the emotion classification accuracy and the recall rate are improved through the dependency syntax analysis-based emotion word weighting classifier construction method and the semi-supervised learning-based key sentence extraction and classifier fusion algorithm.
Drawings
FIG. 1 is a block diagram showing the embodiment of the present invention
FIG. 2 is a flowchart of a location contribution algorithm.
FIG. 3 is a flow chart of a social contribution algorithm.
Detailed Description
The technical solution of the present invention will now be further explained with reference to specific embodiments and examples.
Referring to fig. 1, the embodiment of the present invention and its implementation are as follows:
in order to evaluate and predict different store values in a shopping center, the problem can be decomposed into two parts, one part is calculation of the store location contribution degree based on geographical accessibility, the other part is calculation of the store social contribution degree based on brand effect, and then the calculation results of the two parts are fused to obtain the comprehensive location contribution degree of the store.
Step 1: in the store location contribution degree calculation module, inputting an original plane map provided by a shopping center, and outputting the location contribution degree of each store, specifically:
the shop location contribution degree calculation module comprises an SVG map acquisition module, an indoor space modeling module and a location calculation module.
1.1, acquiring an indoor map of a shopping center in an SVG format by an SVG map acquisition module in a tracing mode, and analyzing the indoor map into a set consisting of various geometric elements;
the SVG map acquisition module of specific implementation can adopt the cool happy space map editor.
1.2, in an indoor space modeling module, analyzing a set formed by various geometric elements through a generation algorithm based on Rtree to obtain a topological connection relation among the geometric elements, wherein the topological connection relation comprises a polygonal element, a communication door of the polygonal element and a communication relation among the polygonal elements;
in a specific implementation, the polygon elements include shops, corridors, facilities, escalators, atrium and the like, and the communication relationship between the polygon elements is, for example, the communication relationship between the shops and the corridors.
1.3 zone bit computing Module
As shown in fig. 2, the specific steps are as follows:
step 1.3.1: adding customer walking constraints on the basis of topological connection relations among all geometric elements, and constructing and obtaining an indoor space distance model with constraints;
the customer walking restriction refers to the movement restriction of the customer from the corridor to the target shop in the indoor space of the shopping center.
Step 1.3.2: with the two data structures D2R and R2D as hybrid indices, the hybrid indices are used to compute the distances dis of sub-paths between different polygon elementsi
In a specific embodiment, the sub-paths are, for example, from the main communication door to the escalator, from the escalator to the store, from the main communication door to the store, and the like.
Step 1.3.3: calculating the selection probability p of each different walking path of the customer from the outside of the shopping center to the shopkThe walking path is formed by sub-paths among different polygon elements;
for example, the probability for a customer's total path from outside the mall to two floors is calculated as: and the probability of selecting to enter the gate of the shopping center is the probability of selecting to enter the escalator after entering the gate of the shopping center, and the probability of selecting to go upwards after entering the escalator.
Step 1.3.4: for each shop, combining the path lengths and the selection probabilities of different walking paths, establishing a data structure of a path matrix:
for the ith store on the jth floor, the following path matrix is established:
Figure BDA0001732272310000061
wherein: p is a radical ofkRepresenting the probability, dis, of walking from different exterior entrances to the kth walking path that the store may choosekA path length representing a k-th walking path which can be selected by walking from an external entrance of a shopping center to a shop, wherein k is 1 to n, k represents the ordinal number of the path, and n represents the total number of the paths;
(1) number of rows n of matrix: based on different exterior entrances and exits, a total of n travel paths may arrive at the store, where the size of n is related to floor j. The path matrix of the stores on the same floor has the same number n of rows. The number n of rows of the matrix corresponds to the selectable strength of the location influence factors.
(2) First column p of the matrixk:pkRepresenting the probability of walking from different exterior doorways of the shopping mall to the k-th walking path that the store may choose. In which shops located on the same floor j have the same first column p of the matrixk. Matrix element pkRepresenting the size of the target population in the location impact factor.
(3) Second column dis of the matrixk:diskRepresenting the path length of the k-th travel path that may be selected to travel from different exterior doorways of the shopping mall to the store. Where different shops i, located on the same floor j, have different path distances. Matrix element diskRepresenting the strength of reachability among location influencers.
Step 1.3.5: calculating the position contribution degree of the shop by using the path matrix of all possible walking paths reaching the shop, and specifically calculating the following formula:
Figure BDA0001732272310000071
wherein, Value (store)i) Indicates the location contribution, Value (store) of the ith storejk) The zone bit contribution degree of the k-th walking path passing through the jth floor is shown, and beta represents the magnitude of the attenuation coefficient and is generally set to be 1,1.2 and 1.5;
Figure BDA0001732272310000072
indicating the intensity of the attenuation with distance.
Step 2: in the store social contribution degree calculation module, the social multimedia data of all stores in the same shopping center on the internet is input, and the social contribution degree evaluation and ranking of all stores are output.
The shop social contribution degree calculation module comprises an internet crawler module, a text processing module and an emotion analysis module.
2.1, acquiring a social media data set of each shop under an internet portal website by adopting an internet crawler module as a problem input;
in a specific implementation, the social media data set employs a collection of user comments.
2.2, the social media data set contains user comments, then a text processing module is used for carrying out duplication elimination on the user comments and text vectorization on the user comments, and dependency syntax is used for analyzing sentence structures in the user comments and carrying out part-of-speech tagging;
2.3, an emotion analysis module is adopted to evaluate and calculate the social contribution degree, a supervision algorithm technology and a semi-supervision algorithm technology are adopted in the emotion analysis module, and a specific algorithm flow is shown in fig. 3.
Step 2.3.1: calculating the TF-IDF characteristic value of each word in each user comment by using a word frequency inverse document frequency TF-IDF algorithm;
step 2.3.2: an emotional word weighting algorithm based on dependency syntactic analysis is proposed. Extracting a specified relation pair from a sentence structure of the user comment, and if the specified relation pair is extracted, multiplying the TF-IDF characteristic value of the key emotional word by an enhancement parameter b to enhance; if the appointed relation pair is not extracted, the TF-IDF characteristic value of the key emotional word is not processed;
in a specific implementation, the specified relationship pair is extracted by adopting a dependency relationship-based emotion word extraction rule table which is specifically shown as follows:
TABLE 1 Emotion term extraction rule Table based on dependency relationship
Figure BDA0001732272310000073
Figure BDA0001732272310000081
In the above table, & denotes the relation of sum/simultaneous relation, adj denotes an adjective, n denotes a noun, adv denotes an adverb, and vt denotes a verb.
For example, the piece of clothing looks good, extract "(clothing, look)", belongs to the first category in the rules of the table above; the vegetables are very delicious and are strongly recommended to be extracted (dish, recommendation), and belong to the fifth category in the rules of the table "
Step 2.3.3: a key sentence extraction algorithm suitable for Chinese text is proposed.
And dividing all sentences in each user comment into independent and non-coincident parts according to the keyword attribute, the emotional word attribute, the punctuation mark attribute and the position attribute of the sentences.
Dividing the user comment d into a series of sentence components: d ═ s1,s2,…,smWhere m represents the number of sentences, siRepresenting the ith sentence;
each sentence siDividing the sentence into a series of word components, and calculating the evaluation value of each sentence by adopting the following formula:
Figure BDA0001732272310000083
wherein, f(s)i) Weight of attribute, λ, representing the ith sentence1Representing a keyword attribute weight, λ2Representing emotional word attribute weight, λ3Representing punctuation mark attribute weights, λ4Representing the location attribute weight of the sentence in the user comment,
Figure BDA0001732272310000084
an evaluation value representing an attribute of the keyword,
Figure BDA0001732272310000085
an evaluation value representing the attribute of an emotional word,
Figure BDA0001732272310000086
an evaluation value representing the attribute of the punctuation mark,
Figure BDA0001732272310000087
an evaluation value indicating a position attribute;
the above keywords are extracted using the following table:
TABLE 2
Figure BDA0001732272310000082
Figure BDA0001732272310000091
Extracting the emotional words by using a hawnet general word library, and adopting punctuation marks belonging to the following range {? | a! And extracting- }, wherein the position attribute is extracted in a manner of weighting the first sentence and the last sentence of each comment. The individual attribute weights are obtained by maximizing the classification accuracy.
For the evaluation value of each attribute, if there are several such attributes in a sentence, the number of such attributes is given as the evaluation value of the attribute.
Finally, taking the sentence with the highest evaluation value in the user comments as an emotion key sentence of the user comments;
step 2.3.4: obtaining emotion key sentence classifier f through training of all marked user comments1Non-emotion key sentence classifier f2And full text classifier f3And labeling the unlabeled user comments aiming at the shops by adopting a classifier fusion processing mode, wherein the labeling refers to labeling emotional tendency on the user comments, and the emotional tendency is divided into two types of positive emotional tendency and negative emotional tendency, specifically:
training emotion key sentence classifier f by using emotion key sentences in labeled user comments1Using the trained emotion key sentence classifier f1Processing the user comments which are not marked to obtain a first group of probability values of the user comments on the emotional tendency;
training non-emotion key sentence classifier f using sentences other than emotion key sentences in labeled user comments2Using trained non-emotion key sentence classifier f2Processing the user comments which are not marked to obtain a second group of probability values of the user comments on the emotional tendency;
use pairTraining full-text classifier f by using marked user comment entirety3Using trained full-text classifier f3Processing the user comments which are not marked to obtain a third group of probability values of the user comments on the emotional tendency;
in a specific implementation, each group of probability values comprises a probability value of positive emotional tendency and a probability value of negative emotional tendency, and the classifier adopts an LR classifier.
And finally, selecting the maximum probability value of the positive emotional tendency and the maximum probability value of the negative emotional tendency from the three groups of probability values, and comparing the sizes of the probability values: if the maximum probability value of the positive emotional tendency is greater than the maximum probability value equal to the negative emotional tendency, marking the comment of the user as good comment; if the maximum probability value of the positive emotional tendency is smaller than the maximum probability value of the negative emotional tendency, marking the comment of the user as poor comment;
the specific fusion method assumes that the class of each unlabeled sample is determined by the sub-classifier with the highest confidence, and the above method avoids the adverse effect of noise features in classification decision.
Step 2.3.5: social contribution to the store is calculated and ranked based on the number of reviews and the goodness of review. The social contribution S of the store is calculated using the following formula:
N=u+v
p=u/n
Figure BDA0001732272310000092
wherein S represents the social contribution of the store, u represents the total number of good reviews in all user reviews, v represents the total number of bad reviews in all user reviews, N represents the total number of all user reviews, p represents the good review rate, z represents the social contribution of the store, andαis the quantile of a normal distribution, in general, zαThe value is only required to be 2.
And step 3: after the two subproblems of the location contribution degree and the social contribution degree of the shop are solved, the two calculation results can be combined, and the combined location contribution degree of the shop can be calculated and evaluated by a fusion method to serve as the output of the whole research problem.
Implementations may use linear weighting for fusion.

Claims (4)

1. A method for calculating indoor space semantic value based on space and social multimedia data is characterized in that: the method comprises the following steps: firstly, dividing the semantic value of the indoor independent subspace into two sub-problems of a locational contribution degree and a social contribution degree, respectively calculating the locational contribution degree and the social contribution degree of the indoor independent subspace, and then fusing the locational contribution degree and the social contribution degree of the indoor independent subspace to obtain the semantic value of the indoor independent subspace;
the indoor independent subspace location contribution degree is calculated by the indoor independent subspace location contribution degree calculating module in the following mode, and the indoor independent subspace location contribution degree calculating module comprises an SVG map obtaining module, an indoor space modeling module and a location calculating module:
step 1.1, acquiring an indoor map of an indoor space in an SVG format by an SVG map acquisition module in a tracing mode, and analyzing the indoor map into a set consisting of various geometric elements;
step 1.2, in an indoor space modeling module, analyzing a set formed by various geometric elements through a generation algorithm to obtain topological connection relations among the geometric elements, wherein the topological relations comprise polygonal elements, communication doors of the polygonal elements and communication relations among the polygonal elements;
step 1.3, calculating and obtaining the position contribution degree of the indoor independent subspace through a position calculation module
Step 1.3.1: adding user walking constraints on the basis of topological connection relations among all geometric elements, and constructing and obtaining an indoor space distance model with constraints;
step 1.3.2: with the two data structures D2R and R2D as hybrid indices, the hybrid indices are used to compute the distances dis of sub-paths between different polygon elementsi
Step 1.3.3: computing each failure of a user to reach an indoor independent subspace from outside the indoor spaceProbability p of selection of same walking pathkThe walking path is formed by sub-paths among different polygon elements;
step 1.3.4: for each indoor independent subspace, combining the path lengths and the selection probabilities of different walking paths, establishing a data structure of a path matrix:
for the ith indoor independent subspace of the jth floor, the following path matrix is established:
Figure FDA0003261436230000011
wherein: p is a radical ofkRepresenting the probability, dis, of the k-th travel path that may be selected to travel from different exterior doorways of the indoor space to the indoor independent subspacekRepresenting the path length of the k-th walking path which can be selected from the walking from the different external entrances and exits of the indoor space to the indoor independent subspace, wherein k is 1-n, k represents the ordinal number of the path, and n represents the total number of the paths;
step 1.3.5: calculating the zone bit contribution degree of the indoor independent subspace by using the path matrix of all possible walking paths reaching the indoor independent subspace, and specifically calculating the following formula:
Figure FDA0003261436230000021
wherein, Value (store)i) Indicating the location contribution, value (store) of the i-th indoor independent subspacejk) Indicating the position contribution degree of the k-th walking path passing through the j-th floor; beta represents the magnitude of the attenuation coefficient,
Figure FDA0003261436230000022
indicating the intensity of attenuation with distance;
the social contribution degree of the indoor independent subspace is calculated by the indoor independent subspace social contribution degree calculating module in the following mode, wherein the indoor independent subspace social contribution degree calculating module comprises an internet crawler module, a text processing module and an emotion analysis module:
2.1, acquiring a social media data set of each indoor independent subspace under the Internet by adopting an Internet crawler module;
2.2, the social media data set contains a social evaluation text, then, a text processing module is used for carrying out de-duplication of the social evaluation text and text vectorization of the social evaluation text, and dependency syntax is used for analyzing sentence structures in the social evaluation text and carrying out part-of-speech tagging;
2.3, evaluating and calculating social contribution degree by adopting emotion analysis module
Step 2.3.1: calculating a TF-IDF characteristic value of each word in each social evaluation text by using a word frequency inverse document frequency TF-IDF algorithm;
step 2.3.2: extracting a designated relationship pair from a sentence structure of the social evaluation text, and if the designated relationship pair is extracted, multiplying the TF-IDF characteristic value of the key emotional word by an enhancement parameter b to enhance; if the appointed relation pair is not extracted, the TF-IDF characteristic value of the key emotional word is not processed;
step 2.3.3: the social rating text d is divided into a series of sentence components: d ═ s1,s2,…,smWhere m represents the number of sentences, siRepresenting the ith sentence;
each sentence siDividing the sentence into a series of word components, and calculating the evaluation value of each sentence by adopting the following formula:
Figure FDA0003261436230000023
wherein, f(s)i) Weight of attribute, λ, representing the ith sentence1Representing a keyword attribute weight, λ2Representing emotional word attribute weight, λ3Representing punctuation mark attribute weights, λ4Representing the location attribute weight of the sentence in the social rating text,
Figure FDA0003261436230000024
an evaluation value representing an attribute of the keyword,
Figure FDA0003261436230000025
an evaluation value representing the attribute of an emotional word,
Figure FDA0003261436230000026
an evaluation value representing the attribute of the punctuation mark,
Figure FDA0003261436230000027
an evaluation value indicating a position attribute;
finally, a sentence with the highest evaluation value in the social evaluation text is taken as an emotion key sentence of the social evaluation text;
step 2.3.4: obtaining emotion key sentence classifier f through training of all labeled social evaluation text samples1Non-emotion key sentence classifier f2And full text classifier f3And labeling the to-be-detected object of the unlabeled social evaluation text corresponding to the indoor independent subspace by adopting a classifier fusion processing mode, wherein the labeling refers to labeling the social evaluation text with emotional tendency which is divided into two types of positive emotional tendency and negative emotional tendency, and specifically comprises the following steps:
training emotion key sentence classifier f by using emotion key sentences in labeled social evaluation text1Using the trained emotion key sentence classifier f1Processing the social evaluation text which is not labeled to obtain a first group of probability values of the social evaluation text on emotional tendency;
training a non-emotion key sentence classifier f using sentences other than emotion key sentences in labeled social evaluation text2Using trained non-emotion key sentence classifier f2Processing the social evaluation texts which are not labeled to obtain a second group of probability values of the social evaluation texts on emotional tendency;
training full-text classifier f using ensemble of labeled social rating texts3Using trained full-text classifier f3Processing the social evaluation texts which are not labeled to obtain a third group of probability values of the social evaluation texts on emotional tendency;
and finally, selecting the maximum probability value of the positive emotional tendency and the maximum probability value of the negative emotional tendency from the three groups of probability values, and comparing the sizes of the probability values: if the maximum probability value of the positive emotional tendency is greater than the maximum probability value equal to the negative emotional tendency, the social evaluation text is marked as good evaluation; if the maximum probability value of the positive emotional tendency is smaller than the maximum probability value of the negative emotional tendency, the social evaluation text is marked as poor evaluation;
step 2.3.5: calculating the social contribution degree S of the indoor independent subspace by adopting the following formula:
N=u+v
p=u/n
Figure FDA0003261436230000031
wherein S represents the social contribution degree of the indoor independent subspace, u represents the total number of good comments in all social evaluation texts, v represents the total number of bad comments in all social evaluation texts, N represents the total number of all social evaluation texts, p represents the good comment rate, and z represents the social contribution degree of the indoor independent subspaceαIs the quantile of a normal distribution.
2. The method for calculating the semantic value of the indoor space based on the spatial and social multimedia data as claimed in claim 1, wherein the method comprises the following steps: the position contribution degree of the indoor independent subspace is calculated by adopting the following method:
A) analyzing the topological connection relation among the indoor independent subspaces according to the indoor space data by using a generation algorithm to construct an indoor space model;
B) calculating the distance between objects in the indoor space model by using a distance measurement method, and meanwhile, calculating the selection probability of each walking path of the user in the indoor space according to the walking constraint of the user to construct a path matrix;
C) and calculating the zone bit contribution degree of the indoor independent subspace by utilizing the path matrixes of different floors.
3. The method for calculating the semantic value of the indoor space based on the spatial and social multimedia data as claimed in claim 1, wherein the method comprises the following steps: the social contribution degree of the indoor independent subspace is calculated in the following mode:
a) analyzing sentence structures of social evaluation texts and extracting keywords for weighting according to social multimedia data of the indoor independent subspace to construct an emotion classification base classifier;
b) a key sentence extraction algorithm is adopted, a sentence set of the social evaluation text is divided into key sentences and non-key sentences, and the social evaluation text is classified by combining a classifier fusion method;
c) and calculating the social contribution degree of each indoor independent subspace according to the classification of the social evaluation texts of the indoor independent subspaces.
4. The method for calculating the semantic value of the indoor space based on the spatial and social multimedia data as claimed in claim 1, wherein the method comprises the following steps: the user walking constraint in step 1.3 refers to a movement constraint of the user from the corridor to the destination indoor independent subspace within the indoor space of the indoor space.
CN201810779775.1A 2018-07-16 2018-07-16 Indoor space semantic value calculation method based on space and social multimedia data Active CN109063034B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810779775.1A CN109063034B (en) 2018-07-16 2018-07-16 Indoor space semantic value calculation method based on space and social multimedia data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810779775.1A CN109063034B (en) 2018-07-16 2018-07-16 Indoor space semantic value calculation method based on space and social multimedia data

Publications (2)

Publication Number Publication Date
CN109063034A CN109063034A (en) 2018-12-21
CN109063034B true CN109063034B (en) 2022-01-04

Family

ID=64816865

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810779775.1A Active CN109063034B (en) 2018-07-16 2018-07-16 Indoor space semantic value calculation method based on space and social multimedia data

Country Status (1)

Country Link
CN (1) CN109063034B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113553815A (en) * 2020-04-26 2021-10-26 阿里巴巴集团控股有限公司 Intelligent report description automatic generation method and device based on hierarchical attention pointer generation network

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102163189A (en) * 2010-02-24 2011-08-24 富士通株式会社 Method and device for extracting evaluative information from critical texts
CN102884779A (en) * 2010-02-24 2013-01-16 数字标记公司 Intuitive computing methods and systems
CN104243511A (en) * 2013-06-07 2014-12-24 寻觅科技股份有限公司 Object tracking service system and method thereof
CN104268180A (en) * 2014-09-16 2015-01-07 贵州榆木科技有限公司 System and method for providing personalized markers for electronic map information user
CN104471575A (en) * 2012-05-18 2015-03-25 文件档案公司 Using content
CN105719191A (en) * 2016-01-20 2016-06-29 东北大学 System and method for social group discovery with uncertain behavioral semantics in multi-scale space
CN106372229A (en) * 2016-09-08 2017-02-01 中国科学院遥感与数字地球研究所 Indoor moving object behavior modeling method supporting complex semantic query
CN107038609A (en) * 2017-04-24 2017-08-11 广州华企联信息科技有限公司 A kind of Method of Commodity Recommendation and system based on deep learning
CN107301166A (en) * 2017-02-13 2017-10-27 上海大学 Towards the multi-level features model and characteristic evaluation method of cross-cutting progress information extraction
CN108052583A (en) * 2017-11-17 2018-05-18 康成投资(中国)有限公司 Electric business body constructing method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070011051A1 (en) * 2005-07-08 2007-01-11 Onestop Media Group Adaptive advertisements

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102163189A (en) * 2010-02-24 2011-08-24 富士通株式会社 Method and device for extracting evaluative information from critical texts
CN102884779A (en) * 2010-02-24 2013-01-16 数字标记公司 Intuitive computing methods and systems
CN104471575A (en) * 2012-05-18 2015-03-25 文件档案公司 Using content
CN104243511A (en) * 2013-06-07 2014-12-24 寻觅科技股份有限公司 Object tracking service system and method thereof
CN104268180A (en) * 2014-09-16 2015-01-07 贵州榆木科技有限公司 System and method for providing personalized markers for electronic map information user
CN105719191A (en) * 2016-01-20 2016-06-29 东北大学 System and method for social group discovery with uncertain behavioral semantics in multi-scale space
CN106372229A (en) * 2016-09-08 2017-02-01 中国科学院遥感与数字地球研究所 Indoor moving object behavior modeling method supporting complex semantic query
CN107301166A (en) * 2017-02-13 2017-10-27 上海大学 Towards the multi-level features model and characteristic evaluation method of cross-cutting progress information extraction
CN107038609A (en) * 2017-04-24 2017-08-11 广州华企联信息科技有限公司 A kind of Method of Commodity Recommendation and system based on deep learning
CN108052583A (en) * 2017-11-17 2018-05-18 康成投资(中国)有限公司 Electric business body constructing method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Automatic short answer scoring using words overlapping methods;Feddy Setio Pribadi;《https://doi.org/10.1063/1.4976906》;20170310;020042-1-6 *
In Search of Indoor Dense Regions:An Approach Using Indoor Positioning Data;H Li;《2019 IEEE 35th international Conference on Data Engineering》;20180131;第1-15页 *
Selecting Attributes for Sentiment Classification Using Feature Relation Networks;A Abbasi;《IEEE Transactions on Knowledge &Data Engineering》;20110323;第447-462页 *
基于商品特征关联度的购物客户评论可信排序方法;黄婷婷 等;《计算机应用》;20140810;第2322-2327页 *

Also Published As

Publication number Publication date
CN109063034A (en) 2018-12-21

Similar Documents

Publication Publication Date Title
Zhong et al. A building regulation question answering system: A deep learning methodology
CN107609132B (en) Semantic ontology base based Chinese text sentiment analysis method
Sharma et al. A document-level sentiment analysis approach using artificial neural network and sentiment lexicons
CN114265935B (en) A text mining-based decision-making support method and system for scientific and technological project establishment management
CN109726747B (en) Data fusion ordering method based on social network recommendation platform
CN109726745A (en) A goal-based sentiment classification method incorporating descriptive knowledge
CN110990718A (en) Social network model building module of company image improving system
Mozafari et al. Emotion detection by using similarity techniques
CN110688540B (en) Cheating account screening method, device, equipment and medium
CN113449111B (en) Social governance hot topic automatic identification method based on time-space semantic knowledge migration
Xiong et al. Affective impression: Sentiment-awareness POI suggestion via embedding in heterogeneous LBSNs
CN114298011A (en) Neural network, training method, aspect level emotion analysis method, device and storage medium
CN112115712A (en) Topic-based crowd sentiment analysis
Aliane et al. A genetic algorithm feature selection based approach for Arabic sentiment classification
Parthasarathy et al. Sentiment analyzer: analysis of journal citations from citation databases
CN115712780A (en) Information pushing method and device based on cloud computing and big data
CN114595693B (en) A text sentiment analysis method based on deep learning
Lu et al. Exploring the sentiment strength of user reviews
CN117556118A (en) Visual recommendation system and method based on scientific research big data prediction
Qi et al. Investigation of the influence of Twitter user habits on sentiment of their opinions towards transportation services
Wegrzyn-Wolska et al. Tweets mining for French presidential election
Zhao et al. Fuzzy sentiment membership determining for sentiment classification
CN109063034B (en) Indoor space semantic value calculation method based on space and social multimedia data
CN119128125A (en) Personalized document recommendation method and system based on knowledge graph
CN113657766A (en) A measurement method of tourist attractions happiness index based on tourist multivariate data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant