[go: up one dir, main page]

CN114328461A - Big data analysis-based enterprise innovation and growth capacity evaluation method and system - Google Patents

Big data analysis-based enterprise innovation and growth capacity evaluation method and system Download PDF

Info

Publication number
CN114328461A
CN114328461A CN202111669894.XA CN202111669894A CN114328461A CN 114328461 A CN114328461 A CN 114328461A CN 202111669894 A CN202111669894 A CN 202111669894A CN 114328461 A CN114328461 A CN 114328461A
Authority
CN
China
Prior art keywords
enterprise
data
value
sample
index item
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111669894.XA
Other languages
Chinese (zh)
Other versions
CN114328461B (en
Inventor
李奇陵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kechuangtong Chengdu Co ltd
Original Assignee
Kechuangtong Chengdu Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kechuangtong Chengdu Co ltd filed Critical Kechuangtong Chengdu Co ltd
Priority to CN202111669894.XA priority Critical patent/CN114328461B/en
Publication of CN114328461A publication Critical patent/CN114328461A/en
Application granted granted Critical
Publication of CN114328461B publication Critical patent/CN114328461B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明提供的基于大数据分析的企业创新与成长能力的评价方法及系统,涉及定量评价领域,其方法包括:收集样本企业各企业维度的样本数据,对样本数据进行数据清洗、计算和标签分类;以清洗分类后的样本数据训练指标权重模型,以便获取指标项的相对重要性分数和优序图计算各指标项的最终权重值;根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业的最终评价分数;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用。本发明通过将科技型企业的无形资产,包括创新能力和科技项目进行精准可靠的量化评估计算占比权重,准确衡量企业的成长能力,为企业未来的发展规划及决策提供数据支撑。

Figure 202111669894

The method and system for evaluating the innovation and growth capability of an enterprise based on big data analysis provided by the present invention relate to the field of quantitative evaluation. ;Train the index weight model with the sample data after cleaning and classification, so as to obtain the relative importance score of the index item and the priority map to calculate the final weight value of each index item; Combine the final weight of each index item according to the enterprise data of each enterprise to be evaluated The final evaluation score of each enterprise to be evaluated is calculated separately; among them, the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property, projects, financing loans, assets and R&D expenses. The invention accurately and reliably measures the growth ability of the enterprise by performing accurate and reliable quantitative evaluation on the intangible assets of the scientific and technological enterprise, including innovation ability and scientific and technological projects, and calculates the proportion weight, and provides data support for the future development planning and decision-making of the enterprise.

Figure 202111669894

Description

一种基于大数据分析的企业创新与成长能力的评价方法及 系统An evaluation method and system for enterprise innovation and growth ability based on big data analysis

技术领域technical field

本发明涉及定量评价领域,具体涉及一种基于大数据分析的企业创新与成长能力的评价方法及系统。The invention relates to the field of quantitative evaluation, in particular to a method and system for evaluating the innovation and growth capability of an enterprise based on big data analysis.

背景技术Background technique

传统的基于大数据分析的企业创新与成长能力的评价方法主要是财务分析法,通过企业的财务数据、商业模式和商业计划进行分析,如资产负债情况、销售利润情况等,这一类企业成长能力评估方法只能在企业融资、审计或者发布财报时才能进行评估或更新,一般面向规模较大的企业或者成立时间较久的企业;对成立时间较短的创业型科技企业,他们的特点是经营时间较短、财务数据较少、企业无形资产比重较大,这类企业再以财务分析法进行企业成长能力评估就不适用了。The traditional evaluation method of enterprise innovation and growth ability based on big data analysis is mainly financial analysis method, which analyzes the financial data, business model and business plan of the enterprise, such as assets and liabilities, sales profit, etc., this type of enterprise grows. The capability evaluation method can only be evaluated or updated when the company raises funds, audits or releases financial reports, and is generally aimed at larger-scale companies or companies that have been established for a long time; for entrepreneurial technology companies that have been established for a short time, their characteristics are: With a short operating time, less financial data, and a large proportion of intangible assets, it is not applicable for such enterprises to use the financial analysis method to evaluate the growth ability of the enterprise.

另一种企业评价技术是企业画像,但是企业画像只是从不同维度把企业已有数据展示出来,并没有对其进行深入分析,也没有将企业的经营行为进行量化评估。Another enterprise evaluation technology is enterprise portrait, but enterprise portrait only displays the existing data of the enterprise from different dimensions, and does not conduct in-depth analysis of it, nor does it quantitatively evaluate the business behavior of the enterprise.

由于不能准确可靠的对无形资产比重较大的科技型中小企业进行有效的成长能力评估,这类企业在获取融资贷款、政策支持等方面将会非常困难,不利于企业的发展壮大和科技创新。Due to the inability to accurately and reliably assess the growth capability of technology-based SMEs with a large proportion of intangible assets, it will be very difficult for such enterprises to obtain financing loans and policy support, which is not conducive to the development and growth of enterprises and technological innovation.

发明内容SUMMARY OF THE INVENTION

本发明目的在于提供一种基于大数据分析的企业创新与成长能力的评价方法及系统,通过把企业经营中不可量化的创新能力、成果转化能力等进行量化评价,使得企业可根据评价结果中各维度的评价情况进行相关的加强,在政策施行或者企业投融资时可根据此评价结构获取到更切合的数据支撑和投资项目。The purpose of the present invention is to provide an evaluation method and system for enterprise innovation and growth ability based on big data analysis. The evaluation of the dimensions is strengthened, and more suitable data support and investment projects can be obtained according to this evaluation structure during policy implementation or enterprise investment and financing.

为达成上述目的,本发明提出如下技术方案:一种基于大数据分析的企业创新与成长能力的评价方法,包括:In order to achieve the above-mentioned purpose, the present invention proposes the following technical solution: a method for evaluating the innovation and growth capability of an enterprise based on big data analysis, including:

收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;Collect the sample data of each enterprise dimension of the sample enterprise, and perform data cleaning, calculation and label classification on the sample data according to the time series; among which, the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property rights, projects, financing loans, assets and R&D expenses;

采用样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;Using the cleaned sample data and label classification of the sample companies, the index weight model is trained, so as to obtain the relative importance score and priority map of the index items according to the index weight model, and then calculate the final weight value of each index item;

根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名;其中,企业数据为待评价企业进行数据清洗后的各企业维度的样本数据。According to the enterprise data of each enterprise to be evaluated and the final weight value of each index item, the final evaluation score and ranking of each index item of each to-be-evaluated enterprise are calculated respectively; wherein, the enterprise data is a sample of each enterprise dimension after data cleaning of the to-be-evaluated enterprise data.

进一步的,所述按照时间序列对样本数据进行数据清洗、计算和标签分类的过程包括:Further, the process of performing data cleaning, calculation and label classification on the sample data according to the time series includes:

获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;Obtain the industrial and commercial information of the sample companies, as well as the number of researchers and the proportion of educational backgrounds in each of the past three years, qualification certification information, the number of Class I intellectual property rights, the number of Class II intellectual property rights, project application information, loan amount, and financing data , R&D expenses and technical contract information, where the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount and the total loan amount;

清洗和归一化上述获取数据的内容,替换中文内容为数字表示,获得数值型企业数据;Clean and normalize the content of the obtained data above, replace the Chinese content with digital representation, and obtain numerical enterprise data;

对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;Label and classify the numerical enterprise data to determine each index item, put the data of each index item into the object of the enterprise data matrix, and calculate the data value of the sample enterprise under each index item respectively;

计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。Calculate the sum, average, maximum value, minimum value, 25% value and 75% value of the data values of all sample companies under each indicator, and deal with abnormal data values.

进一步的,所述指标项的最终权重值的计算过程为:Further, the calculation process of the final weight value of the index item is:

预定义一个多级的数据分类标签作为计算指标项;Predefine a multi-level data classification label as the calculation index item;

接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;Receive all the sample data of the sample enterprise, use the gradient boosting decision tree algorithm of the machine learning xgboost library to extract the features of each sample data and classify, sort and group, and combine the input calculation index items to calculate the softmax loss function plus the regular term, and obtain each sample data one by one. The relative importance score and ranking diagram of each index item;

将指标项的相对重要性分数与预设对应指标项占比分数和的均值作为该指标项的最终权重值。The average value of the relative importance score of the index item and the proportion score of the preset corresponding index item is taken as the final weight value of the index item.

进一步的,所述各待评价企业各指标项的最终评价分数的计算过程为:Further, the calculation process of the final evaluation score of each index item of each enterprise to be evaluated is as follows:

获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;Obtain the sample data of each enterprise dimension of each to-be-evaluated enterprise and process the data to obtain the enterprise data matrix of each to-be-evaluated enterprise;

划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;Each index item calculated by the index weight model is divided into five groups of evaluation dimensions, including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability. Perform matrix dot product on the corresponding value in and its corresponding final weight value to obtain the evaluation score of the corresponding index item;

根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。According to the characteristics of normal distribution, the scores of each group of evaluation dimensions are converted into a 100-point system as the final evaluation score of the enterprise index item.

进一步的,所述样本企业的指标项包括员工总数、研究人员占比、大专以上学历占比、研究生占比、企业资质数量、产学研资质、下辖机构数量、研发活动数量、研发费用占比、企业前年研发费用、去年研发费用、前年立项费用、去年立项费用、项目金额最大值、项目金额最小值、项目金额均值、项目金额总数额、获批项目总次数、申请项目总次数、股权投资总数额、股权投资总次数、前年债权总额、去年债权总额、债权金额最大值、债权金额最小值、债权金额均值、债权金额总数额、获得债权总次数、前年总资产、去年总资产、前年净资产、去年净资产、前年主营业务收入、去年主营业务收入、前年成本费用、去年成本费用、前年净利润、去年净利润、前年上缴利税、去年上缴利税、前年主营业务占比、去年主营业务占比、前年净利润率、去年净利润率、管理认证、I类知识产权、II类知识产权、前年技术合同数量、去年技术合同数量、最大技术合同技术交易额、最小技术合同技术交易额、平均技术合同技术交易额、技术合同技术交易额总额、技术合同技术交易额总次数、国家科技奖励数量、形成标准数量、科技成果转化数量、高新技术产品数量、高新技术收入、上年度总资产增长率、净资产增长率、主营业务收入增长率、成本费用变化率、净利润增长率、上缴利税增长率、研发费用增长率、主营业务占比变化率、净利润率增长率、研发费用占比增长率、知识产权数量增长率和研发人员占比增长率。Further, the indicator items of the sample companies include the total number of employees, the proportion of researchers, the proportion of college education, the proportion of postgraduates, the number of enterprise qualifications, the qualifications of production, education and research, the number of subordinate institutions, the number of research and development activities, and the proportion of research and development expenses. , the company's research and development expenses in the previous year, last year's research and development expenses, the previous year's project establishment cost, the last year's project establishment cost, the maximum project amount, the minimum project amount, the average project amount, the total project amount, the total number of approved projects, the total number of applications for projects, and equity investment Total amount, total number of equity investments, total amount of claims in the previous year, total amount of claims in the previous year, maximum value of claims, minimum value of claims, average value of claims, total amount of claims, total number of claims obtained, total assets in the previous year, total assets in the previous year, net in the previous year Assets, last year's net assets, last year's main business income, last year's main business income, last year's cost, last year's cost, last year's net profit, last year's net profit, previous year's profits and taxes, last year's profits and taxes, the proportion of the previous year's main business, last year Proportion of main business, net profit margin of the previous year, net profit margin of the previous year, management certification, intellectual property rights of type I, intellectual property rights of type II, the number of technical contracts in the previous year, the number of technical contracts in the previous year, the largest technical transaction volume of technical contracts, and the smallest technical contract technology Transaction value, average technical transaction value of technical contracts, total technical transaction value of technical contracts, total number of technical transactions of technical contracts, number of national science and technology awards, number of established standards, number of transformation of scientific and technological achievements, number of high-tech products, high-tech income, previous year Growth rate of total assets, growth rate of net assets, growth rate of main business income, rate of change of costs and expenses, growth rate of net profit, growth rate of profits and taxes paid, growth rate of R&D expenses, rate of change of main business proportion, growth rate of net profit rate , the growth rate of the proportion of R&D expenses, the growth rate of the number of intellectual property rights, and the growth rate of the proportion of R&D personnel.

进一步的,所述异常数据值的处理方法为:对于占比类数据不在0~1范围的数据值,大于1时的设置为相应项的75%值,小于1时的设置为相应项的25%值;对于增长率大于20的数据设置为相应项的75%值,增长率小于-20的数据设置为相应项的25%值。Further, the processing method of the abnormal data value is as follows: for the data value whose proportion data is not in the range of 0 to 1, when it is greater than 1, it is set as 75% of the value of the corresponding item, and when it is less than 1, it is set as 25% of the corresponding item. % value; for data with a growth rate greater than 20, it is set to the 75% value of the corresponding item, and data with a growth rate less than -20 is set to the 25% value of the corresponding item.

本发明另一方案在于公开一种基于大数据分析的企业创新与成长能力的评价系统,该系统包括:Another solution of the present invention is to disclose an evaluation system for enterprise innovation and growth capability based on big data analysis, the system comprising:

企业数据收集处理模块,用于收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;The enterprise data collection and processing module is used to collect sample data of each enterprise dimension of the sample enterprise, and perform data cleaning, calculation and label classification of the sample data according to the time series; among which, the enterprise dimension includes business, employee, qualification, intellectual property, project, financing Loans, assets and research and development expenses;

指标权重模块,用于根据样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;The indicator weight module is used to train the indicator weight model according to the cleaned sample data and label classification of the sample enterprises, so as to obtain the relative importance score and priority map of the indicator items according to the indicator weight model, and then calculate the final weight value of each indicator item. ;

分数赋予模块,用于根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名。The score assigning module is used to calculate the final evaluation score and ranking of each index item of each to-be-evaluated enterprise according to the enterprise data of each to-be-evaluated enterprise and the final weight value of each index item.

进一步的,所述企业数据收集处理模块包括:Further, the enterprise data collection and processing module includes:

第一获取单元,用于获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;The first acquisition unit is used to obtain the industrial and commercial information of the sample enterprises, as well as the number of researchers and the proportion of educational backgrounds in each of the past three years, information on qualification certification, the number of Class I intellectual property rights, the number of Class II intellectual property rights, and project declarations Information, loan amount, financing data, research and development expenses and technical contract information, wherein the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount and the total loan amount;

清洗及归一化单元,用于清洗和归一化获取单元获取的数据,包括删去重复数据、评价不相关数据和替换中文内容为数字表示,获得数值型企业数据;The cleaning and normalization unit is used to clean and normalize the data acquired by the acquisition unit, including deleting duplicate data, evaluating irrelevant data, and replacing Chinese content with digital representation to obtain numerical enterprise data;

分类单元,用于对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;The classification unit is used to label and classify the numerical enterprise data to determine each index item, put the data of each index item into the object of the enterprise data matrix, and calculate the data value of the sample enterprise under each index item respectively;

计算单元,用于计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。The calculation unit is used to calculate the sum, average, maximum value, minimum value, 25% value and 75% value of data values of all sample enterprises under each index item, and process abnormal data values.

进一步的,所述指标权重模块包括:Further, the indicator weight module includes:

预定义单元,用于预定义一个多级的数据分类标签作为计算指标项;The predefined unit is used to predefine a multi-level data classification label as the calculation index item;

机器学习单元,用于接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;The machine learning unit is used to receive all the sample data of the sample enterprise, use the gradient boosting decision tree algorithm of the machine learning xgboost library to extract the features of each sample data and classify, sort and group, and combine the input calculation index items to perform softmax loss function plus regularization Item calculation, obtain the relative importance score and ranking diagram of each index item one by one;

权重计算单元,用于计算指标项的相对重要性分数与预设对应指标项占比分数和的均值,并以该均值作为该指标项的最终权重值。The weight calculation unit is configured to calculate the mean value of the relative importance score of the index item and the proportion score of the preset corresponding index item, and use the mean value as the final weight value of the index item.

进一步的,所述分数赋予模块,包括:Further, the score assignment module includes:

第二获取单元,用于获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;The second obtaining unit is used to obtain sample data of each enterprise dimension of each enterprise to be evaluated and perform data processing to obtain the enterprise data matrix of each enterprise to be evaluated;

划分评价单元,用于划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;The evaluation unit is divided, and each index item used for the calculation of the index weight model is divided into five groups of evaluation dimensions, including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability. The corresponding value of the index item in the enterprise data matrix is multiplied by the matrix dot with its corresponding final weight value to obtain the evaluation score of the corresponding index item;

分值转换单元,用于根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。The score conversion unit is used to convert the scores of each group of evaluation dimensions into a 100-point system according to the normal distribution characteristics as the final evaluation score of the enterprise index item.

由以上技术方案可知,本发明的技术方案,获得了如下有益效果:As can be seen from the above technical solutions, the technical solutions of the present invention have obtained the following beneficial effects:

本发明公开的基于大数据分析的企业创新与成长能力的评价方法及系统,不仅考虑企业的财务指标还通过对企业的无形资产,如创新能力和科技项目等进行精准的量化评估,准确可靠的衡量企业成长能力。具体的,其方法包括:收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类,其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;采用样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数;该最终评价分数围绕五个方面,即内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力。The method and system for evaluating the innovation and growth ability of an enterprise based on big data analysis disclosed in the present invention not only considers the financial indicators of the enterprise, but also conducts accurate quantitative evaluation on the intangible assets of the enterprise, such as innovation ability and scientific and technological projects. Measure the ability of a business to grow. Specifically, the method includes: collecting sample data of each enterprise dimension of the sample enterprise, and performing data cleaning, calculation and label classification on the sample data according to the time series, wherein the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property rights, projects, financing loans , assets and R&D expenses; use the sample data and label classification after cleaning from the sample companies to train the index weight model, so as to obtain the relative importance score and priority map of the index items according to the index weight model, and then calculate the final weight value of each index item ;According to the enterprise data of each enterprise to be evaluated and the final weight value of each index item, the final evaluation score of each index item of each to-be-evaluated enterprise is calculated respectively; the final evaluation score is based on five aspects, namely endogenous innovation, collaborative investment, management and operation Ability, achievement transformation ability and continuous growth ability.

本发明获得的企业最终评价分数,相较侧重财务指标项的财务分析法更适用于中小微企业进行企业成长能力评估,为企业未来的发展规划、政策适配及决策提供数据支撑;并且相较于企业画像,不仅能从不同维度展示企业的发展数据,而且能更深入的分析企业的经营行为、创新能力对企业成长能力的影响。The final evaluation score of the enterprise obtained by the present invention is more suitable for small, medium and micro enterprises to evaluate the growth ability of the enterprise compared with the financial analysis method that focuses on financial index items, and provides data support for the future development planning, policy adaptation and decision-making of the enterprise; For enterprise portraits, it can not only display the development data of the enterprise from different dimensions, but also analyze the impact of the enterprise's business behavior and innovation ability on the enterprise's growth ability more deeply.

应当理解,前述构思以及在下面更加详细地描述的额外构思的所有组合只要在这样的构思不相互矛盾的情况下都可以被视为本公开的发明主题的一部分。It is to be understood that all combinations of the foregoing concepts, as well as additional concepts described in greater detail below, are considered to be part of the inventive subject matter of the present disclosure to the extent that such concepts are not contradictory.

结合附图从下面的描述中可以更加全面地理解本发明教导的前述和其他方面、实施例和特征。本发明的其他附加方面例如示例性实施方式的特征和/或有益效果将在下面的描述中显见,或通过根据本发明教导的具体实施方式的实践中得知。The foregoing and other aspects, embodiments and features of the present teachings can be more fully understood from the following description when taken in conjunction with the accompanying drawings. Other additional aspects of the invention, such as features and/or benefits of the exemplary embodiments, will be apparent from the description below, or learned by practice of specific embodiments in accordance with the teachings of this invention.

附图说明Description of drawings

附图不意在按比例绘制。在附图中,在各个图中示出的每个相同或近似相同的组成部分可以用相同的标号表示。为了清晰起见,在每个图中,并非每个组成部分均被标记。现在,将通过例子并参考附图来描述本发明的各个方面的实施例,其中:The drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures may be represented by the same reference numeral. For clarity, not every component is labeled in every figure. Embodiments of various aspects of the present invention will now be described by way of example and with reference to the accompanying drawings, wherein:

图1为本发明基于大数据分析的企业创新与成长能力的评价方法的流程图;Fig. 1 is the flow chart of the evaluation method of enterprise innovation and growth ability based on big data analysis of the present invention;

图2为实施例中样本数据清洗准备流程图;Fig. 2 is the flow chart of sample data cleaning preparation in the embodiment;

图3为实施例中样本数据清洗流程图一;Fig. 3 is the sample data cleaning flow chart 1 in the embodiment;

图4为实施例中样本数据清洗流程图二;4 is a second flow chart of sample data cleaning in the embodiment;

图5为实施例中生成评价指标的每个评价维度的权重。FIG. 5 shows the weight of each evaluation dimension for generating the evaluation index in the embodiment.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例的附图,对本发明实施例的技术方案进行清楚、完整地描述。显然,所描述的实施例是本发明的一部分实施例,而不是全部的实施例。基于所描述的本发明的实施例,本领域普通技术人员在无需创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。除非另作定义,此处使用的技术术语或者科学术语应当为本发明所属领域内具有一般技能的人士所理解的通常意义。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings of the embodiments of the present invention. Obviously, the described embodiments are some, but not all, embodiments of the present invention. Based on the described embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present invention. Unless otherwise defined, technical or scientific terms used herein should have the ordinary meaning as understood by one of ordinary skill in the art to which this invention belongs.

本发明专利申请说明书以及权利要求书中使用的“第一”、“第二”以及类似的词语并不表示任何顺序、数量或者重要性,而只是用来区分不同的组成部分。同样,除非上下文清楚地指明其它情况,否则单数形式的“一个”“一”或者“该”等类似词语也不表示数量限制,而是表示存在至少一个。“包括”或者“包含”等类似的词语意指出现在“包括”或者“包含”前面的元件或者物件涵盖出现在“包括”或者“包含”后面列举的特征、整体、步骤、操作、元素和/或组件,并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。The terms "first", "second" and similar terms used in the description of the patent application and the claims of the present invention do not denote any order, quantity or importance, but are only used to distinguish different components. Also, unless the context clearly dictates otherwise, the singular forms "a," "an," or "the" and the like do not denote a limitation of quantity, but rather denote the presence of at least one. Words like "including" or "comprising" mean that the elements or items appearing before "including" or "including" cover the features, integers, steps, operations, elements and/or recited after "including" or "including" or components, does not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or sets thereof.

基于现有技术中对企业成长能力的评价方法主要是财务分析法,侧重企业的财务指标,导致该方法评估的企业成长能力只能在企业融资、审计或者发布财报时使用,并且只能适用于规模较大或者成立时间较久的企业,对于中小微企业和成立时间较短的创业型科技企业并不适用,原因在于这类企业在其经营活动中创新能力、成果转化能力在该评价方法中未进行量化评估。因此,本发明旨在于提出一种基于大数据分析的企业创新与成长能力的评价方法及系统,除财务指标等常见指标外还将企业无形资产不可量化的创新能力、成果转化能力等进行量化评估,准确可靠的评估企业成长能力,为企业未来发展及决策提供数据支持。Based on the existing technology, the evaluation method of enterprise growth ability is mainly financial analysis method, which focuses on the financial indicators of the enterprise. As a result, the enterprise growth ability evaluated by this method can only be used in enterprise financing, auditing or issuing financial reports, and can only be applied to Large-scale or long-established enterprises are not applicable to small and medium-sized micro-enterprises and entrepreneurial technology enterprises that have been established for a short time, because the innovation ability and achievement transformation ability of such enterprises in their business activities are included in this evaluation method. Quantitative assessments were not performed. Therefore, the present invention aims to propose a method and system for evaluating the innovation and growth capability of an enterprise based on big data analysis, which, in addition to common indicators such as financial indicators, also quantitatively evaluates the non-quantifiable innovation capability and achievement transformation capability of the intangible assets of the enterprise. , to accurately and reliably evaluate the growth ability of the enterprise, and provide data support for the future development and decision-making of the enterprise.

下面结合附图所示的实施例,对本发明公开的基于大数据分析的企业创新与成长能力的评价方法及系统做进一步具体介绍。The method and system for evaluating the innovation and growth capability of an enterprise based on big data analysis disclosed in the present invention will be further described in detail below with reference to the embodiments shown in the accompanying drawings.

结合图1所示的流程图,本申请实施例公开的基于大数据分析的企业创新与成长能力的评价方法,执行时包括如下步骤:With reference to the flowchart shown in FIG. 1 , the method for evaluating the innovation and growth capability of an enterprise based on big data analysis disclosed in the embodiment of the present application includes the following steps during execution:

步骤S102,收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;Step S102, collect the sample data of each enterprise dimension of the sample enterprise, and perform data cleaning, calculation and label classification on the sample data according to the time series; wherein, the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property, projects, financing loans, assets and research and development cost;

步骤S102的具体执行过程为:首先,获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;然后,清洗和归一化上述获取数据的内容,替换中文内容为数字表示,获得数值型企业数据;其次,对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;最后,计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。The specific execution process of step S102 is as follows: first, obtain the industrial and commercial information of the sample enterprise, as well as the number of researchers and the proportion of educational backgrounds, qualification certification information, the number of Type I intellectual property rights, and the number of Type II intellectual property rights in each of the past three years. , project application information, loan amount, financing data, R&D expenses, and technical contract information, where the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount, and the total loan amount; then, the content of the obtained data is cleaned and normalized. , replace the Chinese content with digital representation, and obtain numerical enterprise data; secondly, label and classify the numerical enterprise data to determine each index item, put the data of each index item into the object of the enterprise data matrix, and separately calculate the sample enterprises in Data values under each indicator; finally, calculate the sum, average, maximum, minimum, 25% and 75% of the data values of all sample companies under each indicator, and deal with abnormal data values.

上述的样本企业的指标项包括员工总数、研究人员占比、大专以上学历占比、研究生占比、企业资质数量、产学研资质、下辖机构数量、研发活动数量、研发费用占比、企业前年研发费用、去年研发费用、前年立项费用、去年立项费用、项目金额最大值、项目金额最小值、项目金额均值、项目金额总数额、获批项目总次数、申请项目总次数、股权投资总数额、股权投资总次数、前年债权总额、去年债权总额、债权金额最大值、债权金额最小值、债权金额均值、债权金额总数额、获得债权总次数、前年总资产、去年总资产、前年净资产、去年净资产、前年主营业务收入、去年主营业务收入、前年成本费用、去年成本费用、前年净利润、去年净利润、前年上缴利税、去年上缴利税、前年主营业务占比、去年主营业务占比、前年净利润率、去年净利润率、管理认证、I类知识产权、II类知识产权、前年技术合同数量、去年技术合同数量、最大技术合同技术交易额、最小技术合同技术交易额、平均技术合同技术交易额、技术合同技术交易额总额、技术合同技术交易额总次数、国家科技奖励数量、形成标准数量、科技成果转化数量、高新技术产品数量、高新技术收入、上年度总资产增长率、净资产增长率、主营业务收入增长率、成本费用变化率、净利润增长率、上缴利税增长率、研发费用增长率、主营业务占比变化率、净利润率增长率、研发费用占比增长率、知识产权数量增长率和研发人员占比增长率。The indicators of the above sample companies include the total number of employees, the proportion of researchers, the proportion of college graduates, the proportion of graduate students, the number of enterprise qualifications, the qualifications of production, education and research, the number of subordinate institutions, the number of research and development activities, the proportion of research and development expenses, the year before the company. R&D expenses, last year's R&D expenses, last year's project establishment expenses, last year's project establishment expenses, maximum project amount, minimum project amount, average project amount, total project amount, total number of approved projects, total number of applications for projects, total equity investment, Total number of equity investments, total amount of claims in the previous year, total amount of claims last year, maximum value of claims, minimum value of claims, average value of claims, total amount of claims, total number of claims obtained, total assets in the previous year, total assets in the previous year, net assets in the previous year, last year Net assets, main business income in the previous year, main business income in the previous year, cost in the previous year, cost in the previous year, net profit in the previous year, net profit in the previous year, profits and taxes paid in the previous year, profits and taxes paid in the previous year, the proportion of the main business in the previous year, and the main business in the previous year Proportion, net profit margin of the previous year, net profit margin of the previous year, management certification, intellectual property rights of type I, intellectual property rights of type II, the number of technical contracts in the previous year, the number of technical contracts in the last year, the largest technical transaction value of technical contracts, the smallest technical transaction value of technical contracts, Average technical transaction value of technical contracts, total technical transaction value of technical contracts, total number of technical transactions of technical contracts, number of national science and technology awards, number of established standards, number of transformation of scientific and technological achievements, number of high-tech products, high-tech income, total asset growth in the previous year rate, growth rate of net assets, growth rate of main business income, rate of change of cost and expense, growth rate of net profit, growth rate of profits and taxes paid, growth rate of research and development expenses, rate of change of main business proportion, rate of change of net profit rate, research and development expenses Proportion growth rate, intellectual property quantity growth rate, and R&D personnel ratio growth rate.

样本企业各企业维度样本数据的获取、清洗、计算和分类过程至少包括如图2至图4所示的3个阶段。The process of acquiring, cleaning, calculating and classifying the sample data of each enterprise dimension of the sample enterprise includes at least three stages as shown in Figures 2 to 4.

如图2所示,前期进入数据准备阶段,处理营业收入数据,从数据库导出所有样本企业用于评价指标模型需要的相关数据,该相关数据至少涵盖上述8个企业维度,该相关数据形成“全部企业.xls”文档存储于内存中,备用;从“全部企业.xls”文档中获取所有不重复的样本企业名称的集合,对该集合中任一样本企业:先获取企业最新的一条科创贷数据,作为该企业在本评价方法中的数据基础;然后计算该企业近三年来1)获取的最大、最小、平均和总科创贷放贷金额、2)每一年研究人员的平均占比、3)每一年I类知识产权的平均数量、4)每一年II类知识产权的平均数量,形成数据对象CA;对集合中任一样本企业合并企业最新的一条科创贷数据和其对应的数据对象CA构成新的企业数据并转存为第一步初始表格“date1.xls”文件中。As shown in Figure 2, the data preparation stage is entered in the early stage, the operating income data is processed, and the relevant data required by all sample companies to evaluate the index model is derived from the database. "Enterprise.xls" file is stored in memory for backup; obtain a collection of all unique sample company names from the "All-enterprise.xls" file, and for any sample company in the collection: first obtain the company's latest science and technology loan The data is used as the data basis for the enterprise in this evaluation method; then calculate the 1) maximum, minimum, average and total amount of science and technology innovation loans obtained by the enterprise in the past three years, 2) the average proportion of researchers in each year, 3) The average number of Class I intellectual property rights in each year, 4) The average number of Class II intellectual property rights in each year, forming the data object CA; the latest piece of science and technology loan data for any sample enterprise merger in the collection and its corresponding The data object CA constitutes the new enterprise data and is transferred to the initial table "date1.xls" file in the first step.

对第一步初始表格中的数据内容进行归一化处理,例如,将表格中研究人员占比、大专以上学历占比和员工总数列的中文去掉,并转化为float类型数据;将表格中表示是否是高企、技术先进企业、科技型中小企业的否/是转为0/1;以及,将该第一步初始表格中指定列的企业数据类型转换成float类型数据,并将每一个样本企业的企业信息和企业对应的企业数据合并到第一步结果表格“sheet1.xlsx”文件中。Normalize the data content in the initial table of the first step, for example, remove the Chinese in the columns of the proportion of researchers, the proportion of college degree or above, and the total number of employees in the table, and convert them into float type data; Whether it is a high-tech enterprise, a technologically advanced enterprise, or a technology-based small and medium-sized enterprise will be converted to 0/1; The enterprise information and the corresponding enterprise data of the enterprise are merged into the first step result table "sheet1.xlsx" file.

如图3所示,中期对样本企业的成果转换能力进行标签分类定量评估,包含获取五部分数据;其一为,第一步结果表格“sheet1.xlsx”文件;其二为,从第一步初始表格“date1.xls”文件获得到的关于若干资质认证信息的企业数据和企业的知识产权、研究人员数据;对于资质认证根据表格中分隔符整理所有企业包含的、不重复的认证类型,按认证类型转为类型列,若企业由此认证的列值记为1;其三为,自数据库中获取所有企业的股权融资数据,将该数据与上述的“全部企业.xls”文档所有相关数据取交集,获得第一交集数据,根据第一交集数据对任一样本企业计算企业获取到的投资次数和投资金额总额;其四为,自数据库中获取所有企业的技术合同数据,将该数据与上述的“全部企业.xls”文档所有相关数据取交集,获得第二交集数据,根据第二交集数据对任一样本企业依次计算企业近三年每一年的技术交易额和企业技术交易额最大、最小、平均、总额数据和次数;其五为,自数据库中获取所有企业的项目申报数据,将该数据与上述的“全部企业.xls”文档所有相关数据取交集,获得第三交集数据,根据第三交集数据对任一样本企业依次计算企业近三年每一年的项目立项金额和企业项目立项金额的最大、最小、平均、总额数据和次数。As shown in Figure 3, in the mid-term, the label classification and quantitative evaluation of the achievement conversion ability of the sample enterprises includes obtaining five parts of data; one is the result table "sheet1.xlsx" file of the first step; the second is, from the first step The enterprise data obtained from the initial form "date1.xls" file about certain qualification certification information and the intellectual property rights and researcher data of the enterprise; The certification type is converted to a type column. If the value of the column certified by the company is 1; the third is to obtain the equity financing data of all companies from the database, and use this data with all relevant data in the above "All companies.xls" document. Take the intersection to obtain the first intersection data, and calculate the investment times and total investment amount obtained by the enterprise for any sample enterprise according to the first intersection data; the fourth is to obtain the technical contract data of all enterprises from the database, and combine the data with Take the intersection of all relevant data in the above "All Enterprises.xls" document to obtain the second intersection data, and calculate the technology transaction volume of the enterprise in each of the past three years and the largest enterprise technology transaction volume for any sample enterprise in turn according to the second intersection data. , minimum, average, total data and times; the fifth is to obtain the project declaration data of all enterprises from the database, and intersect the data with all relevant data of the above "All enterprises.xls" document to obtain the third intersection data, According to the third intersection data, for any sample enterprise, calculate the project approval amount of the enterprise in each year of the past three years and the maximum, minimum, average, total data and times of the enterprise project approval amount.

最后,将上述五部分数据按照上述的企业维度合并,输出得到第二步结果表格“date99.xlsx”文件。Finally, combine the above five parts of data according to the above enterprise dimensions, and output the result table "date99.xlsx" file of the second step.

如图4所示,后期计算样本企业在各指标项下的数据值和异常数据值处理,该阶段对前期和中期的数据进行组合处理;即,将第一步结果表格“date1.xls”文件中的营业收入与第二步结果表格“date99.xlsx”文件合并,依次计算每个样本企业总共获取到的资助补贴金额(产学研),每个样本企业的资质认证数量(企业资质),每个企业获取到的管理认证数量(管理认证),样本企业的资产、利润、收入、缴税和研发费用增长率,样本企业人才、费用、主营业务收入、零利润等去/前年的占比,计算结果数据再与第二步结果表格“date99.xlsx”文件中的数据合并,记为对所有样本企业的指标项数据“bb_date”表格对象;对指标项数据“bb_date”表格对象中的每一列数据,分别计算该列数据的count、mean、std、min、25%、50%、75%、max、null_%(有数据的列行数占总行数的比例),计算的数值记录为“n_df”表格对象。As shown in Figure 4, the data values and abnormal data values of the sample enterprises under each indicator are calculated in the later stage, and the data in the early and mid-term are combined in this stage; that is, the first step result table "date1.xls" file The operating income in the second step is combined with the "date99.xlsx" file of the second step result table, and the total amount of subsidy and subsidies obtained by each sample enterprise (industry-university-research), the number of qualification certifications (enterprise qualification) of each sample enterprise, and the The number of management certifications (management certification) obtained by each enterprise, the growth rate of assets, profits, income, tax payment and R&D expenses of the sample enterprises, the proportion of talents, expenses, main business income, zero profits, etc. of the sample enterprises in the past/previous year , the calculation result data is merged with the data in the second step result table "date99.xlsx" file, and recorded as the "bb_date" table object of the indicator item data for all sample companies; For a column of data, calculate the count, mean, std, min, 25%, 50%, 75%, max, null_% of the column data (the ratio of the number of columns and rows with data to the total number of rows), and the calculated value is recorded as " n_df" table object.

最后,对指标项数据“bb_date”表格对象中的异常数据值进行处理,实施例提出的异常数据值的处理方法为:对于占比类数据不在0~1范围的数据值,大于1时的设置为相应项的75%值,小于1时的设置为相应项的25%值;对于增长率大于20的数据设置为相应项的75%值,增长率小于-20的数据设置为相应项的25%值;将部分超过最大值的标签列的值设置为相应的最大值,包括获批项目总次数、下辖机构数量、申请项目总次数、债权金额最大值、债权金额均值、债权金额总数额;异常数据值处理完毕后,将处理完后的数据存入第三步结果数据“data_72.xlsx”文件中。Finally, the abnormal data values in the table object of the indicator item data "bb_date" are processed. The processing method of the abnormal data values proposed in the embodiment is: for the data values whose proportion data is not in the range of 0 to 1, the setting when it is greater than 1 It is the 75% value of the corresponding item, and if it is less than 1, it is set to the 25% value of the corresponding item; for the data whose growth rate is greater than 20, it is set to the 75% value of the corresponding item, and the data whose growth rate is less than -20 is set to the 25% value of the corresponding item. % value; set the value of some label columns that exceed the maximum value to the corresponding maximum value, including the total number of approved projects, the number of subordinate institutions, the total number of application projects, the maximum amount of claims, the average value of claims, and the total amount of claims ; After the abnormal data value is processed, store the processed data in the third step result data "data_72.xlsx" file.

步骤S104,采用样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;Step S104, using the sample data cleaned by the sample enterprise and the label classification, and training the index weight model, so as to obtain the relative importance score and the priority map of the index items according to the index weight model, and then calculate the final weight value of each index item;

步骤S104的具体执行过程为:预定义一个多级的数据分类标签作为计算指标项;接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;将指标项的相对重要性分数与预设对应指标项占比分数和的均值作为该指标项的最终权重值。The specific execution process of step S104 is: predefining a multi-level data classification label as a calculation index item; receiving all the sample data of the sample enterprise, using the gradient boosting decision tree algorithm of the machine learning xgboost library to extract the characteristics of each sample data and carry out the process. Categorize and sort into groups, perform softmax loss function plus regular term calculation in combination with the input calculation index items, and obtain the relative importance score and priority map of each index item one by one; take the relative importance score of the index item and the preset corresponding index item to account for The average of the ratio scores and the sum is used as the final weight value of this indicator item.

图5示出了以项目金额总数额为计算指标项的最终权重值的计算过程。具体为,首先,将第三步结果数据“data_72.xlsx”中的项目金额总数额列复制为列Y,对列Y中的数据进行预处理,包括排除列Y中值为null的所有行、保留至少有50个非null列的行;其次,利用xgboost库的梯度提升决策树算法进行权重模型的训练,特征矩阵为列Y以外的数据,标签为列Y;随后,获取指标项的重要性分数结果,并把该结果与特征矩阵源的列名称绑定,即绑定结果为行名称为第三步结果数据“data_72.xlsx”的列名,值为重要性分数;再在绑定结果的基础上,附加特征矩阵源列索引编号,形成编号列1,对编号列1按照重要性分数进行升序排列;对编号列1附加特征矩阵源行索引编号,形成编号列2,并按照编号列1的排序,获取编号列2的列名称和数值。FIG. 5 shows the calculation process of calculating the final weight value of the index item by taking the total amount of the project amount as the calculation index item. Specifically, first, copy the column of total item amount in the result data "data_72.xlsx" of the third step as column Y, and preprocess the data in column Y, including excluding all rows with null values in column Y, Retain rows with at least 50 non-null columns; secondly, use the gradient boosting decision tree algorithm of the xgboost library to train the weight model, the feature matrix is data other than column Y, and the label is column Y; then, get the importance of the index item Score the result, and bind the result to the column name of the feature matrix source, that is, the binding result is the column name of the third step result data "data_72.xlsx", and the value is the importance score; then in the binding result On the basis of , add the feature matrix source column index number to form numbered column 1, and arrange the numbered column 1 in ascending order according to the importance score; add the feature matrix source row index number to the numbered column 1 to form the numbered column 2, and according to the numbered column Sort by 1, get the column name and value of the numbered column 2.

最后,合并编号列2和自预设的所有指标权重分配表“zonghe.xlsx”中读取的“综合权重”、“索引”列数据,计算指标项的最终权重值,计算方法为:(编号列2的数值+综合权重)/2,依次对企业维度的指标项进行计算,所有结果保存为权重数据“qz.xlsx”文件。Finally, merge the data in column No. 2 and the column data of "Comprehensive Weight" and "Index" read from the preset weight allocation table "zonghe.xlsx" for all indicators, and calculate the final weight value of the indicator item. The calculation method is: (No. The value in column 2 + comprehensive weight)/2, calculate the index items of the enterprise dimension in turn, and save all the results as the weight data "qz.xlsx" file.

步骤S106,根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名;其中,企业数据为待评价企业进行数据清洗后的各企业维度的样本数据。Step S106, calculate the final evaluation score and ranking of each index item of each to-be-evaluated enterprise according to the enterprise data of each to-be-evaluated enterprise and the final weight value of each index item; wherein, the enterprise data is each enterprise after data cleaning of the to-be-evaluated enterprise dimension of sample data.

步骤S108的具体执行过程为:获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。The specific execution process of step S108 is: obtaining sample data of each enterprise dimension of each enterprise to be evaluated and performing data processing to obtain an enterprise data matrix of each to-be-evaluated enterprise; dividing each index item calculated by the index weight model into five groups of evaluation dimensions, Including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability, the corresponding value of any index item in each group of evaluation dimensions in the enterprise data matrix and the corresponding final weight value are matrix dot multiplied, Obtain the evaluation scores of the corresponding index items; according to the normal distribution characteristics, convert the scores of each group of evaluation dimensions into a 100-point system as the final evaluation score of the enterprise index items.

本申请的实施例,还提供一种电子设备,该设备包括处理器和存储器,存储器用于存储程序指令,并将所述程序指令传输给处理器;当程序指令被处理器执行时,处理器运行上述实施例中的基于大数据分析的企业创新与成长能力的评价方法。The embodiment of the present application further provides an electronic device, the device includes a processor and a memory, the memory is used to store program instructions, and transmit the program instructions to the processor; when the program instructions are executed by the processor, the processor Run the method for evaluating the innovation and growth capability of an enterprise based on big data analysis in the above embodiment.

上述程序可以运行在处理器中,或者也可以存储在计算机可读存储介质中,计算机可读存储介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机可读存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读存储介质不包括暂存电脑可读媒体(transitory media),如调制的数据信号和载波。The above program can be executed in a processor, or can also be stored in a computer-readable storage medium. The computer-readable storage medium includes permanent and non-permanent, removable and non-removable media. Information can be realized by any method or technology. storage. Information may be computer readable instructions, data structures, modules of programs, or other data. Examples of computer-readable storage media include, but are not limited to, phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Flash Memory or other memory technology, Compact Disc Read Only Memory (CD-ROM), Digital Versatile Disc (DVD) or other optical storage , magnetic tape cartridges, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, computer-readable storage media does not include transitory computer-readable media, such as modulated data signals and carrier waves.

这些计算机程序也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤,对应与不同的步骤可以通过不同的模块来实现。These computer programs can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby executing instructions on the computer or other programmable device Steps for implementing the functions specified in one or more flows of the flowchart and/or one or more blocks of the block diagram are provided, and corresponding and different steps may be implemented by different modules.

示例性的,本实施例中就提供了这样的一种设备或系统,即基于大数据分析的企业创新与成长能力的评价系统,该系统包括如下程序模块:企业数据收集处理模块,用于收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;指标权重模块,用于根据样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;分数赋予模块,用于根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名。Exemplarily, this embodiment provides such a device or system, that is, an evaluation system for enterprise innovation and growth capabilities based on big data analysis. The system includes the following program modules: an enterprise data collection and processing module for collecting data. The sample data of each enterprise dimension of the sample enterprise is used for data cleaning, calculation and label classification according to the time series; among which, the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property rights, projects, financing loans, assets and R&D expenses; indicator weights The module is used to train the index weight model according to the sample data and label classification after cleaning of the sample enterprise, so as to obtain the relative importance score and priority map of the index items according to the index weight model, and then calculate the final weight value of each index item; score; The endowment module is used to calculate the final evaluation score and ranking of each index item of each to-be-evaluated enterprise according to the enterprise data of each to-be-evaluated enterprise and the final weight value of each index item.

该系统用于实现上述的实施例中评价方法的功能,该系统中的每个模块与方法中的每个步骤相对应,已经在方法中进行过说明的,在此不再赘述。The system is used to implement the functions of the evaluation method in the above-mentioned embodiment, and each module in the system corresponds to each step in the method, which has been described in the method, and will not be repeated here.

例如,企业数据收集处理模块包括:第一获取单元,用于获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;清洗及归一化单元,用于清洗和归一化获取单元获取的数据,包括删去重复数据、评价不相关数据和替换中文内容为数字表示,获得数值型企业数据;分类单元,用于对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;计算单元,用于计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。For example, the enterprise data collection and processing module includes: the first acquisition unit, which is used to obtain the industrial and commercial information of the sample enterprises, as well as the number of researchers and the proportion of educational backgrounds in each of the past three years, qualification certification information, the number of I intellectual property rights, The number of Class II intellectual property rights, project application information, loan amount, financing data, R&D expenses and technical contract information, where the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount and the total loan amount; cleaning and normalization unit , used to clean and normalize the data acquired by the acquisition unit, including deleting duplicate data, evaluating irrelevant data, and replacing Chinese content with digital representation to obtain numerical enterprise data; classification unit, used to label numerical enterprise data Determine each index item by classification, put the data of each index item into the object of the enterprise data matrix, and calculate the data value of the sample enterprise under each index item respectively; the calculation unit is used to calculate the data of all sample enterprises under each index item Sum, mean, maximum, minimum, 25%, and 75% of values, and handle outlier data values.

又例如,指标权重模块包括:预定义单元,用于预定义一个多级的数据分类标签作为计算指标项;机器学习单元,用于接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;权重计算单元,用于计算指标项的相对重要性分数与预设对应指标项占比分数和的均值,并以该均值作为该指标项的最终权重值。For another example, the indicator weight module includes: a pre-defined unit for pre-defining a multi-level data classification label as a calculation indicator item; a machine learning unit for receiving all sample data of the sample enterprise, using the gradient boosting of the machine learning xgboost library The decision tree algorithm extracts the features of each sample data and performs classification, sorting and grouping. Combined with the input calculation index items, the softmax loss function plus regular term calculation is performed, and the relative importance score and priority map of each index item are obtained one by one; weight calculation unit , which is used to calculate the mean value of the relative importance score of the index item and the proportion score of the preset corresponding index item, and use the mean value as the final weight value of the index item.

以及例如,分数赋予模块包括:第二获取单元,用于获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;划分评价单元,用于划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;分值转换单元,用于根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。And for example, the score giving module includes: a second obtaining unit for obtaining sample data of each enterprise dimension of each to-be-evaluated enterprise and performing data processing to obtain an enterprise data matrix of each to-be-evaluated enterprise; dividing the evaluation unit for dividing indicators Each index item calculated by the weighting model is divided into five groups of evaluation dimensions, including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability. Perform matrix dot product with the corresponding final weight value to obtain the evaluation score of the corresponding index item; the score conversion unit is used to convert the scores of each group of evaluation dimensions into a 100-point system according to the normal distribution characteristics as the enterprise index The final evaluation score for the item.

通过本申请的实施例解决了成立时间较短的创业型科技企业不适用财务分析法进行企业成长能力评估的问题,通过对企业的经营活动中不可量化的创新能力、成果转化能力等进行量化评估,避免向企业的财务指标项侧重,同时适用于大、中、小和微型企业的成长能力评估,为企业的发展规划、政策匹配和企业投融资决策提供更精准可靠的数据支撑。The embodiment of the present application solves the problem that the entrepreneurial technology enterprise with a relatively short establishment time cannot use the financial analysis method to evaluate the enterprise growth ability, and quantitatively evaluates the unquantifiable innovation ability and achievement transformation ability in the business activities of the enterprise. , avoid focusing on the financial indicators of enterprises, and apply to the growth ability assessment of large, medium, small and micro enterprises, and provide more accurate and reliable data support for enterprise development planning, policy matching and enterprise investment and financing decisions.

虽然本发明已以较佳实施例揭露如上,然其并非用以限定本发明。本发明所属技术领域中具有通常知识者,在不脱离本发明的精神和范围内,当可作各种的更动与润饰。因此,本发明的保护范围当视权利要求书所界定者为准。Although the present invention has been disclosed above with preferred embodiments, it is not intended to limit the present invention. Those skilled in the art to which the present invention pertains can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, the protection scope of the present invention should be determined according to the claims.

Claims (10)

1.一种基于大数据分析的企业创新与成长能力的评价方法,其特征在于,包括:1. An evaluation method for enterprise innovation and growth capability based on big data analysis, characterized in that it comprises: 收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;Collect the sample data of each enterprise dimension of the sample enterprise, and perform data cleaning, calculation and label classification on the sample data according to the time series; among which, the enterprise dimension includes industry and commerce, employees, qualifications, intellectual property rights, projects, financing loans, assets and R&D expenses; 采用样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;Using the cleaned sample data and label classification of the sample companies, the index weight model is trained, so as to obtain the relative importance score and priority map of the index items according to the index weight model, and then calculate the final weight value of each index item; 根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名;其中,企业数据为待评价企业进行数据清洗后的各企业维度的样本数据。According to the enterprise data of each enterprise to be evaluated and the final weight value of each index item, the final evaluation score and ranking of each index item of each to-be-evaluated enterprise are calculated respectively; wherein, the enterprise data is a sample of each enterprise dimension after data cleaning of the to-be-evaluated enterprise data. 2.根据权利要求1所述的基于大数据分析的企业创新与成长能力的评价方法,其特征在于,所述按照时间序列对样本数据进行数据清洗、计算和标签分类的过程包括:2. The evaluation method of enterprise innovation and growth capability based on big data analysis according to claim 1, wherein the process of performing data cleaning, calculation and label classification on the sample data according to time series comprises: 获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;Obtain the industrial and commercial information of the sample companies, as well as the number of researchers and the proportion of educational backgrounds in each of the past three years, qualification certification information, the number of Class I intellectual property rights, the number of Class II intellectual property rights, project application information, loan amount, and financing data , R&D expenses and technical contract information, where the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount and the total loan amount; 清洗和归一化上述获取数据的内容,替换中文内容为数字表示,获得数值型企业数据;Clean and normalize the content of the obtained data above, replace the Chinese content with digital representation, and obtain numerical enterprise data; 对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;Label and classify the numerical enterprise data to determine each index item, put the data of each index item into the object of the enterprise data matrix, and calculate the data value of the sample enterprise under each index item respectively; 计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。Calculate the sum, average, maximum value, minimum value, 25% value and 75% value of the data values of all sample companies under each indicator, and deal with abnormal data values. 3.根据权利要求2所述的基于大数据分析的企业创新与成长能力的评价方法,其特征在于,所述指标项的最终权重值的计算过程为:3. the evaluation method of the enterprise innovation and growth ability based on big data analysis according to claim 2, is characterized in that, the calculation process of the final weight value of described index item is: 预定义一个多级的数据分类标签作为计算指标项;Predefine a multi-level data classification label as the calculation index item; 接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;Receive all the sample data of the sample enterprise, use the gradient boosting decision tree algorithm of the machine learning xgboost library to extract the features of each sample data and classify, sort and group, and combine the input calculation index items to calculate the softmax loss function plus the regular term, and obtain each sample data one by one. The relative importance score and ranking diagram of each index item; 将指标项的相对重要性分数与预设对应指标项占比分数和的均值作为该指标项的最终权重值。The average value of the relative importance score of the index item and the proportion score of the preset corresponding index item is taken as the final weight value of the index item. 4.根据权利要求3所述的基于大数据分析的企业创新与成长能力的评价方法,其特征在于,所述各待评价企业各指标项的最终评价分数的计算过程为:4. the evaluation method of the enterprise innovation and growth ability based on big data analysis according to claim 3, is characterized in that, the calculation process of the final evaluation score of each index item of each said enterprise to be evaluated is: 获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;Obtain the sample data of each enterprise dimension of each to-be-evaluated enterprise and process the data to obtain the enterprise data matrix of each to-be-evaluated enterprise; 划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;Each index item calculated by the index weight model is divided into five groups of evaluation dimensions, including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability. Perform matrix dot product on the corresponding value in and its corresponding final weight value to obtain the evaluation score of the corresponding index item; 根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。According to the characteristics of normal distribution, the scores of each group of evaluation dimensions are converted into a 100-point system as the final evaluation score of the enterprise index item. 5.根据权利要求2所述的基于大数据分析的企业创新与成长能力的评价方法,其特征在于,所述样本企业的指标项包括员工总数、研究人员占比、大专以上学历占比、研究生占比、企业资质数量、产学研资质、下辖机构数量、研发活动数量、研发费用占比、企业前年研发费用、去年研发费用、前年立项费用、去年立项费用、项目金额最大值、项目金额最小值、项目金额均值、项目金额总数额、获批项目总次数、申请项目总次数、股权投资总数额、股权投资总次数、前年债权总额、去年债权总额、债权金额最大值、债权金额最小值、债权金额均值、债权金额总数额、获得债权总次数、前年总资产、去年总资产、前年净资产、去年净资产、前年主营业务收入、去年主营业务收入、前年成本费用、去年成本费用、前年净利润、去年净利润、前年上缴利税、去年上缴利税、前年主营业务占比、去年主营业务占比、前年净利润率、去年净利润率、管理认证、I类知识产权、II类知识产权、前年技术合同数量、去年技术合同数量、最大技术合同技术交易额、最小技术合同技术交易额、平均技术合同技术交易额、技术合同技术交易额总额、技术合同技术交易额总次数、国家科技奖励数量、形成标准数量、科技成果转化数量、高新技术产品数量、高新技术收入、上年度总资产增长率、净资产增长率、主营业务收入增长率、成本费用变化率、净利润增长率、上缴利税增长率、研发费用增长率、主营业务占比变化率、净利润率增长率、研发费用占比增长率、知识产权数量增长率和研发人员占比增长率。5. The evaluation method of enterprise innovation and growth ability based on big data analysis according to claim 2, wherein the index items of the sample enterprise include the total number of employees, the proportion of researchers, the proportion of college education, and the proportion of graduate students. Proportion, number of enterprise qualifications, industry-university-research qualifications, number of subordinate institutions, number of R&D activities, ratio of R&D expenses, R&D expenses in the previous year, last year's R&D expenses, project establishment expenses in the previous year, last year's project establishment expenses, maximum project amount, and minimum project amount Value, average value of project amount, total amount of project amount, total number of approved projects, total number of application projects, total amount of equity investment, total number of equity investment, total amount of debt in the previous year, total amount of debt in the last year, maximum amount of debt, minimum amount of debt, Average amount of claims, total amount of claims, total number of claims, total assets in the previous year, total assets in the previous year, net assets in the previous year, net assets in the previous year, main business income in the previous year, main business income in the previous year, cost in the previous year, cost in the previous year, Net profit of the previous year, net profit of the previous year, profit and tax paid in the previous year, profit and tax paid in the previous year, the proportion of the main business in the previous year, the proportion of the main business in the previous year, the net profit rate of the previous year, the net profit rate of the previous year, management certification, class I intellectual property, class II Intellectual property rights, the number of technical contracts in the previous year, the number of technical contracts in the last year, the largest technical transaction value of technical contracts, the smallest technical transaction value of technical contracts, the average technical transaction value of technical contracts, the total technical transaction value of technical contracts, the total number of technical transactions of technical contracts, the country The number of scientific and technological awards, the number of standards formed, the number of scientific and technological achievements transformed, the number of high-tech products, high-tech income, the growth rate of total assets in the previous year, the growth rate of net assets, the growth rate of main business income, the rate of change in costs and expenses, and the growth rate of net profit , the growth rate of profits and taxes paid, the growth rate of R&D expenses, the rate of change in the proportion of main business, the growth rate of net profit margin, the growth rate of R&D expenses, the growth rate of intellectual property rights, and the growth rate of R&D personnel. 6.根据权利要求2所述的基于大数据分析的企业创新与成长能力的评价方法,其特征在于,所述异常数据值的处理方法为:对于占比类数据不在0~1范围的数据值,大于1时的设置为相应项的75%值,小于1时的设置为相应项的25%值;对于增长率大于20的数据设置为相应项的75%值,增长率小于-20的数据设置为相应项的25%值。6 . The method for evaluating enterprise innovation and growth capability based on big data analysis according to claim 2 , wherein the processing method of the abnormal data value is: for the data value whose proportion type data is not in the range of 0 to 1. 7 . , when it is greater than 1, it is set as the 75% value of the corresponding item, and when it is less than 1, it is set as the 25% value of the corresponding item; for the data whose growth rate is greater than 20, it is set as the 75% value of the corresponding item, and the data whose growth rate is less than -20 Set to the 25% value of the corresponding item. 7.一种基于大数据分析的企业创新与成长能力的评价系统,其特征在于,包括:7. An evaluation system for enterprise innovation and growth capability based on big data analysis, characterized in that it includes: 企业数据收集处理模块,用于收集样本企业各企业维度的样本数据,按照时间序列对样本数据进行数据清洗、计算和标签分类;其中,企业维度包括工商、员工、资质、知识产权、项目、融资贷款、资产及研发费用;The enterprise data collection and processing module is used to collect sample data of each enterprise dimension of the sample enterprise, and perform data cleaning, calculation and label classification of the sample data according to the time series; among which, the enterprise dimension includes business, employee, qualification, intellectual property, project, financing Loans, assets and research and development expenses; 指标权重模块,用于根据样本企业清洗后的样本数据和标签分类,训练指标权重模型,以便根据指标权重模型获取指标项的相对重要性分数和优序图,进而计算各指标项的最终权重值;The indicator weight module is used to train the indicator weight model according to the cleaned sample data and label classification of the sample enterprises, so as to obtain the relative importance score and priority map of the indicator items according to the indicator weight model, and then calculate the final weight value of each indicator item. ; 分数赋予模块,用于根据各待评价企业的企业数据结合各指标项的最终权重值分别计算各待评价企业各指标项的最终评价分数和排名。The score assigning module is used to calculate the final evaluation score and ranking of each index item of each to-be-evaluated enterprise according to the enterprise data of each to-be-evaluated enterprise and the final weight value of each index item. 8.根据权利要求7所述的基于大数据分析的企业创新与成长能力的评价系统,其特征在于,所述企业数据收集处理模块包括:8. The evaluation system of enterprise innovation and growth capability based on big data analysis according to claim 7, wherein the enterprise data collection and processing module comprises: 第一获取单元,用于获取样本企业的工商信息,以及近三年每一年的研究人员数量和学历占比、资质认证信息、I类知识产权的数量、II类知识产权的数量、项目申报信息、贷款金额、融资数据、研发费用和技术合同信息,其中,贷款金额包括最大贷款金额、最小贷款金额、平均贷款金额和贷款总额;The first acquisition unit is used to obtain the industrial and commercial information of the sample enterprises, as well as the number of researchers and the proportion of educational backgrounds in each of the past three years, information on qualification certification, the number of Class I intellectual property rights, the number of Class II intellectual property rights, and project declarations Information, loan amount, financing data, research and development expenses and technical contract information, wherein the loan amount includes the maximum loan amount, the minimum loan amount, the average loan amount and the total loan amount; 清洗及归一化单元,用于清洗和归一化获取单元获取的数据,包括删去重复数据、评价不相关数据和替换中文内容为数字表示,获得数值型企业数据;The cleaning and normalization unit is used to clean and normalize the data acquired by the acquisition unit, including deleting duplicate data, evaluating irrelevant data, and replacing Chinese content with digital representation to obtain numerical enterprise data; 分类单元,用于对数值型企业数据进行标签分类确定各指标项,将各指标项的数据放入企业数据矩阵的对象中,并分别计算样本企业在各指标项下的数据值;The classification unit is used to label and classify the numerical enterprise data to determine each index item, put the data of each index item into the object of the enterprise data matrix, and calculate the data value of the sample enterprise under each index item respectively; 计算单元,用于计算所有样本企业在各指标项下数据值的总和、平均值、最大值、最小值、25%值和75%值,并处理异常数据值。The calculation unit is used to calculate the sum, average, maximum value, minimum value, 25% value and 75% value of data values of all sample enterprises under each index item, and process abnormal data values. 9.根据权利要求8所述的基于大数据分析的企业创新与成长能力的评价系统,其特征在于,所述指标权重模块包括:9. The evaluation system of enterprise innovation and growth capability based on big data analysis according to claim 8, wherein the indicator weight module comprises: 预定义单元,用于预定义一个多级的数据分类标签作为计算指标项;The predefined unit is used to predefine a multi-level data classification label as the calculation index item; 机器学习单元,用于接收样本企业的所有样本数据,利用机器学习xgboost库的梯度提升决策树算法提取每一个样本数据的特征并进行分类排序分组,结合输入的计算指标项进行softmax损失函数加正则项计算,逐个获取每个指标项的相对重要性分数和优序图;The machine learning unit is used to receive all the sample data of the sample enterprise, use the gradient boosting decision tree algorithm of the machine learning xgboost library to extract the features of each sample data and classify, sort and group, and combine the input calculation index items to perform softmax loss function plus regularization Item calculation, obtain the relative importance score and ranking diagram of each index item one by one; 权重计算单元,用于计算指标项的相对重要性分数与预设对应指标项占比分数和的均值,并以该均值作为该指标项的最终权重值。The weight calculation unit is configured to calculate the mean value of the relative importance score of the index item and the proportion score of the preset corresponding index item, and use the mean value as the final weight value of the index item. 10.根据权利要求9所述的基于大数据分析的企业创新与成长能力的评价系统,其特征在于,所述分数赋予模块,包括:10. The evaluation system for enterprise innovation and growth capability based on big data analysis according to claim 9, wherein the score giving module comprises: 第二获取单元,用于获取各待评价企业的各企业维度的样本数据并进行数据处理,获得各待评价企业的企业数据矩阵;The second obtaining unit is used to obtain sample data of each enterprise dimension of each enterprise to be evaluated and perform data processing to obtain the enterprise data matrix of each enterprise to be evaluated; 划分评价单元,用于划分指标权重模型计算的各指标项为五组评价维度,包括内生创新、协作引资、管理运营能力、成果转化能力和持续成长能力,将每一组评价维度内任一指标项在企业数据矩阵中对应的数值与其对应的最终权重值进行矩阵点乘,获得对应指标项的评价分数;The evaluation unit is divided, and each index item used for the calculation of the index weight model is divided into five groups of evaluation dimensions, including endogenous innovation, collaborative investment, management and operation ability, achievement transformation ability and sustainable growth ability. The corresponding value of the index item in the enterprise data matrix is multiplied by the matrix dot with its corresponding final weight value to obtain the evaluation score of the corresponding index item; 分值转换单元,用于根据正态分布特征,将各组评价维度的分数转为100分制结果作为企业指标项的最终评价分数。The score conversion unit is used to convert the scores of each group of evaluation dimensions into a 100-point system according to the normal distribution characteristics as the final evaluation score of the enterprise index item.
CN202111669894.XA 2021-12-31 2021-12-31 An evaluation method and system for enterprise innovation and growth capabilities based on big data analysis Active CN114328461B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111669894.XA CN114328461B (en) 2021-12-31 2021-12-31 An evaluation method and system for enterprise innovation and growth capabilities based on big data analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111669894.XA CN114328461B (en) 2021-12-31 2021-12-31 An evaluation method and system for enterprise innovation and growth capabilities based on big data analysis

Publications (2)

Publication Number Publication Date
CN114328461A true CN114328461A (en) 2022-04-12
CN114328461B CN114328461B (en) 2025-04-25

Family

ID=81020284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111669894.XA Active CN114328461B (en) 2021-12-31 2021-12-31 An evaluation method and system for enterprise innovation and growth capabilities based on big data analysis

Country Status (1)

Country Link
CN (1) CN114328461B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115760432A (en) * 2022-11-22 2023-03-07 东方微银科技股份有限公司 A precise positioning method and system for the life cycle of a technology enterprise
CN116186177A (en) * 2023-04-27 2023-05-30 华智众创(北京)投资管理有限责任公司 Data processing method and device, computing equipment and storage medium
CN116775900A (en) * 2023-06-13 2023-09-19 南京智绘星图信息科技有限公司 Government affair auxiliary management method and system based on rule knowledge graph driving
CN116823022A (en) * 2023-05-26 2023-09-29 科学技术部火炬高技术产业开发中心 An analysis method for innovation management of scientific and technological enterprises
CN117436726A (en) * 2023-12-14 2024-01-23 惠民县黄河先进技术研究院 A regional high-tech industry assessment method and system
CN119168194A (en) * 2024-08-07 2024-12-20 广州博士信息技术研究院有限公司 Enterprise growth evaluation prediction method, system and device based on model training
CN120725540A (en) * 2025-08-21 2025-09-30 中交一公局厦门工程有限公司 A system and method for evaluating the relative transformation capability of scientific and technological achievements

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006095746A1 (en) * 2005-03-07 2006-09-14 Intellectual Property Bank Corp. Company evaluation assisting device
US20170124464A1 (en) * 2015-10-28 2017-05-04 Fractal Industries, Inc. Rapid predictive analysis of very large data sets using the distributed computational graph
CN107644375A (en) * 2016-07-22 2018-01-30 花生米浙江数据信息服务股份有限公司 Small trade company's credit estimation method that a kind of expert model merges with machine learning model
US20180293320A1 (en) * 2017-04-05 2018-10-11 Planbox Inc. System and methods for cloud based autonomous and collaborative authoring
CN110046984A (en) * 2019-03-01 2019-07-23 安徽省优质采科技发展有限责任公司 Enterprise credit risk system and evaluation method
CN112330441A (en) * 2020-11-12 2021-02-05 北京宸信征信有限公司 Method for evaluating business value credit loan of medium and small enterprises
CN113205403A (en) * 2021-03-30 2021-08-03 北京中交兴路信息科技有限公司 Method and device for calculating enterprise credit level, storage medium and terminal
CN113256075A (en) * 2021-04-29 2021-08-13 浙江非线数联科技股份有限公司 Enterprise risk level evaluation method based on hierarchical analysis and fuzzy comprehensive evaluation method
CN113537807A (en) * 2021-07-27 2021-10-22 天元大数据信用管理有限公司 Enterprise intelligent wind control method and device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006095746A1 (en) * 2005-03-07 2006-09-14 Intellectual Property Bank Corp. Company evaluation assisting device
US20170124464A1 (en) * 2015-10-28 2017-05-04 Fractal Industries, Inc. Rapid predictive analysis of very large data sets using the distributed computational graph
CN107644375A (en) * 2016-07-22 2018-01-30 花生米浙江数据信息服务股份有限公司 Small trade company's credit estimation method that a kind of expert model merges with machine learning model
US20180293320A1 (en) * 2017-04-05 2018-10-11 Planbox Inc. System and methods for cloud based autonomous and collaborative authoring
CN110046984A (en) * 2019-03-01 2019-07-23 安徽省优质采科技发展有限责任公司 Enterprise credit risk system and evaluation method
CN112330441A (en) * 2020-11-12 2021-02-05 北京宸信征信有限公司 Method for evaluating business value credit loan of medium and small enterprises
CN113205403A (en) * 2021-03-30 2021-08-03 北京中交兴路信息科技有限公司 Method and device for calculating enterprise credit level, storage medium and terminal
CN113256075A (en) * 2021-04-29 2021-08-13 浙江非线数联科技股份有限公司 Enterprise risk level evaluation method based on hierarchical analysis and fuzzy comprehensive evaluation method
CN113537807A (en) * 2021-07-27 2021-10-22 天元大数据信用管理有限公司 Enterprise intelligent wind control method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
叶琦林;张健明: "上海高新技术产业竞争力评价", 科技管理研究, no. 02, 20 January 2020 (2020-01-20) *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115760432A (en) * 2022-11-22 2023-03-07 东方微银科技股份有限公司 A precise positioning method and system for the life cycle of a technology enterprise
CN116186177A (en) * 2023-04-27 2023-05-30 华智众创(北京)投资管理有限责任公司 Data processing method and device, computing equipment and storage medium
CN116823022A (en) * 2023-05-26 2023-09-29 科学技术部火炬高技术产业开发中心 An analysis method for innovation management of scientific and technological enterprises
CN116823022B (en) * 2023-05-26 2024-06-28 科学技术部火炬高技术产业开发中心 Innovative management analysis method for scientific and technological enterprises
CN116775900A (en) * 2023-06-13 2023-09-19 南京智绘星图信息科技有限公司 Government affair auxiliary management method and system based on rule knowledge graph driving
CN116775900B (en) * 2023-06-13 2024-02-02 南京智绘星图信息科技有限公司 Government affair auxiliary management method and system based on rule knowledge graph driving
CN117436726A (en) * 2023-12-14 2024-01-23 惠民县黄河先进技术研究院 A regional high-tech industry assessment method and system
CN119168194A (en) * 2024-08-07 2024-12-20 广州博士信息技术研究院有限公司 Enterprise growth evaluation prediction method, system and device based on model training
CN120725540A (en) * 2025-08-21 2025-09-30 中交一公局厦门工程有限公司 A system and method for evaluating the relative transformation capability of scientific and technological achievements

Also Published As

Publication number Publication date
CN114328461B (en) 2025-04-25

Similar Documents

Publication Publication Date Title
CN114328461A (en) Big data analysis-based enterprise innovation and growth capacity evaluation method and system
Budianto Research mapping on credit risk in Islamic and conventional banking
Zarutska et al. Structural-functional analysis as innovation in public governance (case of of banking supervision)
JP5096642B2 (en) Economic activity index presentation system
CN112541817A (en) Marketing response processing method and system for potential customers of personal consumption loan
CN112613977A (en) Personal credit loan admission credit granting method and system based on government affair data
Rathi et al. A study on application of artificial intelligence and machine learning in indian taxation system
Li et al. Performance management of university financial budget execution relying on comprehensive budget management strengthening model
Biswas et al. Automated credit assessment framework using ETL process and machine learning
Carboni et al. Innovative activities and investment decision: evidence from European firms
Fomina et al. Managing the agricultural enterprises’ valuation: Actuarial approach
Shi et al. How Is Gross Profit Margin Overestimated in China?
Balakayeva et al. The solution to the problem of processing big data using the example of assessing the solvency of borrowers
Ozili Artificial intelligence application and research in accounting, finance, economics, business, and management
Mackenzie Efficiency of tax revenue administration in Africa
Wang et al. Forecasting and measuring financial performance in banking sector
Wang et al. Applying TOPSIS method to evaluate the business operation performance of Vietnam listing securities companies
Gai [Retracted] Buffett’s Value Investment Theory Based on Improved DCF Model
AU2020101902A4 (en) Framework for business intelligence adoption in banks for performance enhancement
Dabour RegTech and accounting of financial institutions under financial inclusion: Conceptual framework analysis
Fitriyani et al. Analysis of Financial Statements to Measure Performance at PT. Kino Indonesia, Tbk 2015-2019
Dash et al. An empirical study for customer relationship management in banking sector using machine learning techniques
Jafari et al. Presenting a mathematical approach to developing the internal rate of Return method: An application of Horner's theorem in econometrics
Fomina et al. Managing the capital force impulse of the agrarian enterprise: transfer approach
Filipova et al. IMPROVING COMMUNICATION THROUGH FINANCIAL STATEMENTS: SOME EFFECTS OF АPPLYING THE INTERNATIONAL ACCOUNTING STANDARDS BOARD INITIATIVE

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant