[go: up one dir, main page]

CN101833587A - Internet Video Search System - Google Patents

Internet Video Search System Download PDF

Info

Publication number
CN101833587A
CN101833587A CN 201010186145 CN201010186145A CN101833587A CN 101833587 A CN101833587 A CN 101833587A CN 201010186145 CN201010186145 CN 201010186145 CN 201010186145 A CN201010186145 A CN 201010186145A CN 101833587 A CN101833587 A CN 101833587A
Authority
CN
China
Prior art keywords
submodule
unit
video
database
links
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010186145
Other languages
Chinese (zh)
Inventor
蒋兴浩
孙锬锋
傅光磊
李荣杰
冯冰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiao Tong University
Original Assignee
Shanghai Jiao Tong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiao Tong University filed Critical Shanghai Jiao Tong University
Priority to CN 201010186145 priority Critical patent/CN101833587A/en
Publication of CN101833587A publication Critical patent/CN101833587A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a network video searching system in the technical field of network application, which comprises a data storage module, a service processing module and a user interface module, wherein the data storage module comprises a network video crawler module and database submodule; and the service processing module comprises a searching interface submodule, a database operation submodule, a network video hot word analysis submodule, a user interest model management submodule, a network video address detection submodule and a system configuration management submodule. The network video searching system provides multi-level network video searching service for a user, combines the interest of the user to actively and intelligently provide the network video for the user, provides the network video downloading function, and has the capacity of analyzing the network video hot words. Simultaneously, the system manager can reasonably and effectively manage the system through the multi-selectivity system operation configuration management submodule; and the time used by the network video searching is greatly reduced, and the accuracy is improved.

Description

网络视频搜索系统 Internet Video Search System

技术领域technical field

本发明涉及的是一种网络应用技术领域的系统,具体是一种网络视频搜索系统。The invention relates to a system in the field of network application technology, in particular to a network video search system.

背景技术Background technique

随着互联网技术的不断发展,目前网络视频已经成为了传递信息中比较广泛的媒介。主流视频运营网站层出不穷,国内的比较知名的有优酷网(youku)、土豆网(tudou)等等。国外的知名的有youtube等等。一些综合门户网站也都有了各自视频服务,用来播放新闻等媒体内容。搜索引擎除了向用户提供搜索的功能外,从本质上来说是一个数据信息统计分析的工具。早期互联网以文字信息为主,如今网络视频成为了主要信息载体,搜索引擎需要针对视频这种媒体有相应的统计分析办法。With the continuous development of Internet technology, the current network video has become a relatively wide medium in the transmission of information. Mainstream video operation websites emerge in endlessly, and the well-known ones in China include Youku, Tudou and so on. The well-known ones abroad include youtube and so on. Some comprehensive portal websites also have their own video services to broadcast news and other media content. In addition to providing users with search functions, search engines are essentially a tool for statistical analysis of data information. The early Internet was dominated by text information, and now online video has become the main information carrier. Search engines need to have corresponding statistical analysis methods for video media.

经对现有文献检索发现,2007年北京交通大学硕士学位论文《基于网络视频业务的搜索引擎的设计与实现》,分类号为TP393.09,该文主要讨论了基于网络视频业务的搜索引擎系统,其系统架构主要包括了3个模块:信息抓取模块,信息索引模块,信息检索模块,其中信息抓取模块主要包括网络蜘蛛从视频网站上抓取信息,并进行分析提取,把最终的信息存入到数据库中。信息检索模块主要是从数据库中读取视频的属性信息,经过中文文辞处理,采用Lucene生成索引文件。信息检索模块包括用户接口和索引器,用户接口主要是接受用户输入的关键字,把搜索结果返回给用户,索引器主要根据用户的关键词,检索索引文件,并按照一定的要求进行排序。该论文虽然讨论了实现网络视频搜索引擎的办法,但是其所设计的搜索系统提供的功能较单一,只能够搜索网络视频,且搜索所用时间长,不能提供多层次的搜索接口,也不具有网络视频热词分析功能、用户兴趣管理功能和网络视频地址探测功能。After searching the existing literature, it is found that the 2007 Beijing Jiaotong University master's degree thesis "Design and Implementation of Search Engine Based on Network Video Service", the classification number is TP393.09, this paper mainly discusses the search engine system based on network video service , its system architecture mainly includes three modules: information capture module, information index module, and information retrieval module. stored in the database. The information retrieval module mainly reads the attribute information of the video from the database, and after Chinese word processing, uses Lucene to generate index files. The information retrieval module includes a user interface and an indexer. The user interface mainly accepts the keywords input by the user and returns the search results to the user. The indexer mainly retrieves the index files according to the keywords of the user and sorts them according to certain requirements. Although this paper discusses the method of implementing a network video search engine, the search system designed by it provides relatively single functions, can only search for network videos, and the search takes a long time, cannot provide multi-level search interfaces, and does not have network Video hot word analysis function, user interest management function and network video address detection function.

发明内容Contents of the invention

本发明的目的在于克服现有技术中存在的上述不足,提供一种网络视频搜索系统。本发明通过分析网络视频数据和用户搜索网络视频的行为,实现了多功能智能化的网络视频搜索监控系统,具有用户个性化,网络视频可下载,视频搜索多层次,系统运行配置灵活性和网络视频热点可统计性的优点。The object of the present invention is to overcome the above-mentioned shortcomings in the prior art, and provide a network video search system. The present invention realizes a multifunctional and intelligent network video search monitoring system by analyzing network video data and user search behaviors of network video, which has user personalization, network video downloadable, multi-level video search, system operation configuration flexibility and network Advantages of video hotspot statistics.

本发明是通过以下技术方案实现的:The present invention is achieved through the following technical solutions:

本发明包括:数据存储模块、业务处理模块和用户接口模块,其中:数据存储模块和业务处理模块相连传输数据信息和处理信息,业务处理模块与用户接口模块相连传输搜索请求信息和搜索结果信息。The invention includes: a data storage module, a service processing module and a user interface module, wherein the data storage module and the service processing module are connected to transmit data information and processing information, and the service processing module is connected to the user interface module to transmit search request information and search result information.

所述的数据存储模块获取并存储网络视频数据,包括:网络视频爬虫子模块和数据库子模块,其中:网络视频爬虫子模块和数据库子模块相连传输网络视频数据信息,网络视频爬虫子模块与业务处理模块相连传输模块运行配置信息,数据库子模块与业务处理模块相连传输数据库操作请求信息和数据库操作返回数据。The data storage module acquires and stores network video data, including: a network video crawler submodule and a database submodule, wherein: the network video crawler submodule and the database submodule are connected to transmit network video data information, and the network video crawler submodule is connected to the business The processing module is connected to transmit module operation configuration information, and the database sub-module is connected to the business processing module to transmit database operation request information and database operation return data.

所述的网络视频爬虫子模块包括:页面下载单元、页面内容分析单元和视频信息提取单元,其中:页面下载单元与页面内容分析单元相连待爬取视频网站页面数据,页面内容分析单元与视频信息提取单元相连传输经过分析后页面中相关视频信息,视频信息提取单元和数据库子模块相连传输页面中提取出的精确视频信息。Described network video crawler sub-module comprises: page download unit, page content analysis unit and video information extraction unit, wherein: page download unit is connected with page content analysis unit to be crawled video website page data, page content analysis unit and video information The extraction unit is connected to transmit the relevant video information in the analyzed page, and the video information extraction unit is connected to the database sub-module to transmit the accurate video information extracted from the page.

所述的业务处理模块包括:搜索接口子模块、数据库操作子模块、网络视频热词分析子模块、用户兴趣模型管理子模块、网络视频地址探测子模块和系统配置管理子模块,其中:搜索接口子模块与用户接口模块相连传输搜索请求信息和搜索结果信息,数据库操作子模块与搜索接口子模块相连传输搜索条件信息和搜索返回信息,数据库操作子模块与网络视频热词分析子模块相连传输数据库操作信息和热词分析结果信息,数据库操作子模块与用户兴趣模型管理子模块相连传输数据库操作信息和用户兴趣模型更新信息,数据库操作子模块与网络视频地址探测子模块相连传输数据库操作信息和网络视频地址信息,系统配置管理子模块与用户兴趣模型管理子模块相连传输块运行配置信息,系统配置管理子模块与网络视频热词分析子模块相连传输运行配置信息,系统配置管理子模块与网络视频地址探测子模块相连传输运行配置信息,系统配置管理子模块与数据库操作子模块相连传输运行配置信息,系统配置管理子模块与网络视频爬虫子模块相连传输运行配置信息,数据库操作子模块与数据存储模块相连传输数据库存储信息。The business processing module includes: a search interface submodule, a database operation submodule, a network video hot word analysis submodule, a user interest model management submodule, a network video address detection submodule and a system configuration management submodule, wherein: the search interface The sub-module is connected to the user interface module to transmit search request information and search result information, the database operation sub-module is connected to the search interface sub-module to transmit search condition information and search return information, and the database operation sub-module is connected to the network video hot word analysis sub-module to transmit the database Operation information and hot word analysis result information, the database operation sub-module is connected to the user interest model management sub-module to transmit database operation information and user interest model update information, the database operation sub-module is connected to the network video address detection sub-module to transmit database operation information and network Video address information, the system configuration management submodule is connected with the user interest model management submodule to transmit block operation configuration information, the system configuration management submodule is connected with the network video hot word analysis submodule to transmit operation configuration information, the system configuration management submodule is connected with the network video The address detection sub-module is connected to transmit operation configuration information, the system configuration management sub-module is connected to the database operation sub-module to transmit operation configuration information, the system configuration management sub-module is connected to the network video crawler sub-module to transmit operation configuration information, the database operation sub-module is connected to the data storage Modules are connected to transmit database storage information.

所述的数据库操作子模块包括:数据库检索数据单元、数据库添加数据单元、数据库删除数据单元、数据库更新数据单元和数据库更新视图单元,其中:数据库检索数据单元与数据存储模块相连传输数据库检索语句和返回结果数据,数据库添加数据单元与数据存储模块相连传输数据库添加数据命令,数据库删除数据单元与数据存储模块相连传输数据库删除数据命令,数据库更新数据单元与数据存储模块相连传输数据库更新数据命令,数据库更新视图单元与数据存储模块相连传输数据库视频更新命令。Described database operation sub-module comprises: database retrieval data unit, database add data unit, database delete data unit, database update data unit and database update view unit, wherein: database retrieval data unit is connected with data storage module and transmits database retrieval statement and Return result data, the database adding data unit is connected with the data storage module to transmit the database adding data command, the database deleting data unit is connected with the data storage module and transmitting the database deleting data command, the database updating data unit is connected with the data storage module and transmitting the database updating data command, the database The update view unit is connected with the data storage module to transmit the video update command of the database.

所述的搜索接口子模块包括:用户搜索条件接受单元、用户搜索结果返回单元、搜索条件处理单元和搜索执行单元,其中:用户搜索条件接受单元与搜索条件处理单元相连传输用户原始搜索条件信息,搜索条件处理单元与搜索执行单元相连传输经过系统处理后的搜索指令信息,搜索执行单元与数据库操作子模块相连传输数据库操作信息,用户搜索结果返回单元与数据库操作子模块相连传输数据库操作返回数据。The search interface sub-module includes: a user search condition acceptance unit, a user search result return unit, a search condition processing unit and a search execution unit, wherein: the user search condition acceptance unit is connected to the search condition processing unit to transmit the user's original search condition information, The search condition processing unit is connected to the search execution unit to transmit the search instruction information processed by the system, the search execution unit is connected to the database operation sub-module to transmit the database operation information, and the user search result return unit is connected to the database operation sub-module to transmit the return data of the database operation.

所述的网络视频热词分析子模块包括:视频标题提取单元、视频标题分词单元、标题词汇聚类单元、标题词汇统计单元和热点词汇更新单元,其中:视频标题提取单元与视频标题分词单元相连传输被分析的视频标题集合,视频标题分词单元与标题词汇聚类单元相连传输构成视频标题的词组,标题词汇聚类单元与标题词汇统计单元相连传输聚类完成的词汇空间,标题词汇统计单元与热点词汇更新单元相连传输出现率较高的词汇集合,热点词汇更新单元与数据库操作子模块相连传输添加数据的数据库操作命令,视频标题提取单元与数据库操作子模块相连传输获取视频标题的数据库操作语句。The network video hot word analysis sub-module includes: video title extraction unit, video title word segmentation unit, title vocabulary clustering unit, title vocabulary statistics unit and hot word update unit, wherein: video title extraction unit is connected with video title word segmentation unit The analyzed video title set is transmitted, the video title word segmentation unit is connected with the title vocabulary clustering unit to transmit the phrases forming the video title, the title vocabulary clustering unit is connected with the title vocabulary statistics unit to transmit the clustered vocabulary space, and the title vocabulary statistics unit is connected with the The hot vocabulary update unit is connected to transmit a collection of vocabulary with a high occurrence rate, the hot vocabulary update unit is connected to the database operation sub-module to transmit the database operation command for adding data, and the video title extraction unit is connected to the database operation sub-module to transmit the database operation statement for obtaining the video title .

所述的用户兴趣模型管理子模块包括:用户行为表示单元、用户兴趣模型更新单元和用户兴趣推荐单元,其中:用户行为表示单元与用户兴趣模型更新单元相连传输用户搜素行为标识信息,用户兴趣模型更新单元与数据库操作子模块相连传输更新用户兴趣的数据库操作语句信息,用户兴趣推荐单元与数据库操作子模块相连传输用户推荐视频信息提取数据库操作语句信息。The described user interest model management sub-module includes: a user behavior representation unit, a user interest model update unit and a user interest recommendation unit, wherein: the user behavior representation unit is connected with the user interest model update unit to transmit user search behavior identification information, user interest The model update unit is connected with the database operation sub-module to transmit the database operation sentence information for updating user interest, and the user interest recommendation unit is connected with the database operation sub-module to transmit the user recommended video information to extract the database operation sentence information.

所述的网络视频地址探测子模块包括:数据包获取单元、数据包内容分析单元和文件地址提取单元,其中:数据包获取单元与数据包内容分析单元相连传输抓取的HTTP(HyperText Transfer Protocol,超文本传输协议)数据包,数据包内容分析单元与文件地址提取单元相连传输含有文件地址的HTTP数据包,文件地址提取单元与数据库操作子模块相连传输更新视频信息的数据库更新操作语句。Described network video address detection submodule comprises: data packet acquisition unit, data packet content analysis unit and file address extraction unit, wherein: data packet acquisition unit is connected with the data packet content analysis unit and transmits the HTTP (HyperText Transfer Protocol, hypertext transfer protocol) data packet, the data packet content analysis unit is connected with the file address extraction unit to transmit the HTTP data packet containing the file address, and the file address extraction unit is connected with the database operation submodule to transmit the database update operation statement for updating the video information.

所述的系统配置管理子模块包括:网络视频爬虫配置单元、网络视频地址探测配置单元、用户兴趣模型管理配置单元、网络视频热点词汇分析配置单元和陈旧数据删除单元,其中:网络视频爬虫配置单元与数据存储模块相连传输爬虫运行配置修改信息,用户兴趣模型管理配置单元与用户兴趣模型管理子模块相连传输模块运行配置信息,网络视频热点词汇分析配置单元与网络视频热词分析子模块相连传输模块运行配置信息,网络视频地址探测配置单元与网络视频地址探测子模块相连传输模块运行配置信息,陈旧数据删除单元与数据库操作子模块相连传输删除视频信息的数据库操作语句信息。The system configuration management sub-module includes: a network video crawler configuration unit, a network video address detection configuration unit, a user interest model management configuration unit, a network video hotspot vocabulary analysis configuration unit and an obsolete data deletion unit, wherein: the network video crawler configuration unit Connected with the data storage module to transmit crawler operation configuration modification information, the user interest model management configuration unit is connected to the user interest model management submodule to transmit module operation configuration information, and the network video hot word analysis configuration unit is connected to the network video hot word analysis submodule to transmit the module The operation configuration information, the network video address detection configuration unit is connected with the network video address detection sub-module to transmit the module operation configuration information, and the obsolete data deletion unit is connected with the database operation sub-module to transmit the database operation statement information for deleting video information.

与现有技术相比,本发明的有益效果是:本发明能够为用户提供多层次的网络视频搜索服务,能够结合用户的兴趣主动智能地为用户提供网络视频,提供网络视频下载功能,系统具有分析网络视频热点词汇的能力,同时系统管理员能够通过多选择性的系统运行配置管理子模块去合理有效地管理系统,网络视频搜索所用的时间大大降低,且准确率得到提高。Compared with the prior art, the beneficial effects of the present invention are: the present invention can provide users with multi-level network video search services, can actively and intelligently provide users with network videos in combination with users' interests, and provide network video download functions, and the system has The ability to analyze network video hot words, and at the same time, the system administrator can manage the system reasonably and effectively through the multi-selective system operation configuration management sub-module, the time spent on network video search is greatly reduced, and the accuracy rate is improved.

附图说明Description of drawings

图1是本发明系统的组成连接示意图。Fig. 1 is a schematic diagram of composition and connection of the system of the present invention.

具体实施方式Detailed ways

以下结合附图对本发明的系统进一步描述:本实施例在以本发明技术方案为前提下进行实施,给出了详细的实施方式和具体的操作过程,但本发明的保护范围不限于下述的实施例。The system of the present invention is further described below in conjunction with the accompanying drawings: this embodiment is implemented on the premise of the technical solution of the present invention, and detailed implementation methods and specific operating procedures are provided, but the protection scope of the present invention is not limited to the following Example.

实施例Example

如图1所示,本实施例包括:数据存储模块、业务处理模块和用户接口模块,其中:数据存储模块和业务处理模块相连传输数据信息和处理信息,业务处理模块与用户接口模块相连传输搜索请求信息和搜索结果信息。As shown in Figure 1, this embodiment includes: a data storage module, a business processing module and a user interface module, wherein: the data storage module is connected with the business processing module to transmit data information and processing information, and the business processing module is connected with the user interface module to transmit and search Request information and search result information.

所述的数据存储模块获取并存储网络视频数据,包括:网络视频爬虫子模块和数据库子模块,其中:网络视频爬虫子模块和数据库子模块相连传输网络视频数据信息,网络视频爬虫子模块与业务处理模块相连传输模块运行配置信息,数据库子模块与业务处理模块相连传输数据库操作请求信息和数据库操作返回数据。The data storage module acquires and stores network video data, including: a network video crawler submodule and a database submodule, wherein: the network video crawler submodule and the database submodule are connected to transmit network video data information, and the network video crawler submodule is connected to the business The processing module is connected to transmit module operation configuration information, and the database sub-module is connected to the business processing module to transmit database operation request information and database operation return data.

所述的网络视频爬虫子模块包括:页面下载单元、页面内容分析单元和视频信息提取单元,其中:页面下载单元与页面内容分析单元相连待爬取视频网站页面数据,页面内容分析单元与视频信息提取单元相连传输经过分析后页面中相关视频信息,视频信息提取单元和数据库子模块相连传输页面中提取出的精确视频信息。Described network video crawler sub-module comprises: page download unit, page content analysis unit and video information extraction unit, wherein: page download unit is connected with page content analysis unit to be crawled video website page data, page content analysis unit and video information The extraction unit is connected to transmit the relevant video information in the analyzed page, and the video information extraction unit is connected to the database sub-module to transmit the accurate video information extracted from the page.

本实施例中系统提供给用户的每一个网络视频的数据包含:(1)视频的图片、(2)视屏的来源网站、(3)视频的标题、(4)视频的点击率、(5)视频的下载地址和(6)视频的播放链接地址。由于每个视频网站其页面布局都大相径庭,所以网络视频爬虫需要针对每个视频网站设计一个,在爬虫模块设计之前,需要对目标视频网站进行网页元素结构分析。网络视频爬虫子模块将根据视频网站其自身网页的一些组织信息进行视频数据的爬取,这些包括视频网站自身的分类结构和热点排序。网络视频爬虫子模块爬取到的网络视频数据将存储到服务器的数据库相关表中,同时相关的视图将得到更新。由于是聚焦爬取,爬取的信息需要精确到位,考虑到目前网络视频网站页面结构各不一样,实施中针对每一个主流的网络视频网站建立一个爬虫,整体构成一个爬虫池。In this embodiment, the data of each network video that the system provides to the user includes: (1) the picture of the video, (2) the source website of the video screen, (3) the title of the video, (4) the click-through rate of the video, (5) The download address of the video and (6) the playback link address of the video. Since the page layout of each video website is quite different, a web video crawler needs to be designed for each video website. Before designing the crawler module, it is necessary to analyze the web page element structure of the target video website. The network video crawler sub-module will crawl video data according to some organizational information of the video website itself, including the classification structure and hotspot ranking of the video website itself. The network video data crawled by the network video crawler sub-module will be stored in the database related table of the server, and the relevant views will be updated at the same time. Because it is focused on crawling, the crawled information needs to be accurate. Considering that the current web video website page structure is different, a crawler is established for each mainstream online video website in the implementation, and a crawler pool is formed as a whole.

本实施例中用SQL SERVER 2005来搭建数据库子模块,针对不同的网络视频网站分别设计存储视频数据的视频数据表,这些表拥有相同的表结构:表的主标识、网络视频的播放地址、网络视频的标题、网络视频的显示图片链接地址、网络视频的播放数、网络视频文件的远程地址和网络视频数据被更新的时间。针对用户兴趣库模型需要建立用户兴趣库模型的表,表的结构如下:用户ID(表的主标识)、针对每个视频类别用户的点击率、针对每个视频网站用户的点击率和用户经常搜索的关键词。针对整个系统运行建立事件记录表,用来记录系统运行过程中的事件,表的结构如下:事件ID(表的主标识)、事件内容、事件发生源和事件发生时间。针对不同类别的网络视频,需要建立针对每个类别的视图,每次视频数据表发生更新,需要重新刷新相应的视图。视图建立的规范以每个视频网站的每个分类建立,比如针对优酷网站的体育类网络视频,可以建立名字为youku_sport_view的视图。In this embodiment, SQL SERVER 2005 is used to build the database submodule, and video data tables for storing video data are designed respectively for different online video websites. These tables have the same table structure: the main identifier of the table, the playback address of the online video, the The title of the video, the link address of the display picture of the network video, the play number of the network video, the remote address of the network video file and the time when the data of the network video is updated. For the user interest library model, it is necessary to establish a table of the user interest library model. The structure of the table is as follows: user ID (the main identifier of the table), the click-through rate for each video category user, the click-through rate for each video website user, and the user's frequent Keywords to search for. An event record table is established for the operation of the entire system to record events during system operation. The structure of the table is as follows: event ID (the main identifier of the table), event content, event source and event time. For different categories of online video, it is necessary to establish a view for each category, and each time the video data table is updated, the corresponding view needs to be refreshed. The specification of view establishment is established for each category of each video website, for example, for the sports online video of Youku website, a view named youku_sport_view can be established.

所述的业务处理模块包括:搜索接口子模块、数据库操作子模块、网络视频热词分析子模块、用户兴趣模型管理子模块、网络视频地址探测子模块和系统配置管理子模块,其中:搜索接口子模块与用户接口模块相连传输搜索请求信息和搜索结果信息,数据库操作子模块与搜索接口子模块相连传输搜索条件信息和搜索返回信息,数据库操作子模块与网络视频热词分析子模块相连传输数据库操作信息和热词分析结果信息,数据库操作子模块与用户兴趣模型管理子模块相连传输数据库操作信息和用户兴趣模型更新信息,数据库操作子模块与网络视频地址探测子模块相连传输数据库操作信息和网络视频地址信息,系统配置管理子模块与用户兴趣模型管理子模块相连传输块运行配置信息,系统配置管理子模块与网络视频热词分析子模块相连传输运行配置信息,系统配置管理子模块与网络视频地址探测子模块相连传输运行配置信息,系统配置管理子模块与数据库操作子模块相连传输运行配置信息,系统配置管理子模块与网络视频爬虫子模块相连传输运行配置信息,数据库操作子模块与数据存储模块相连传输数据库存储信息。The business processing module includes: a search interface submodule, a database operation submodule, a network video hot word analysis submodule, a user interest model management submodule, a network video address detection submodule and a system configuration management submodule, wherein: the search interface The sub-module is connected to the user interface module to transmit search request information and search result information, the database operation sub-module is connected to the search interface sub-module to transmit search condition information and search return information, and the database operation sub-module is connected to the network video hot word analysis sub-module to transmit the database Operation information and hot word analysis result information, the database operation sub-module is connected to the user interest model management sub-module to transmit database operation information and user interest model update information, the database operation sub-module is connected to the network video address detection sub-module to transmit database operation information and network Video address information, the system configuration management submodule is connected with the user interest model management submodule to transmit block operation configuration information, the system configuration management submodule is connected with the network video hot word analysis submodule to transmit operation configuration information, the system configuration management submodule is connected with the network video The address detection sub-module is connected to transmit operation configuration information, the system configuration management sub-module is connected to the database operation sub-module to transmit operation configuration information, the system configuration management sub-module is connected to the network video crawler sub-module to transmit operation configuration information, the database operation sub-module is connected to the data storage Modules are connected to transmit database storage information.

所述的数据库操作子模块包括:数据库检索数据单元、数据库添加数据单元、数据库删除数据单元、数据库更新数据单元和数据库更新视图单元,其中:数据库检索数据单元与数据存储模块相连传输数据库检索语句和返回结果数据,数据库添加数据单元与数据存储模块相连传输数据库添加数据命令,数据库删除数据单元与数据存储模块相连传输数据库删除数据命令,数据库更新数据单元与数据存储模块相连传输数据库更新数据命令,数据库更新视图单元与数据存储模块相连传输数据库视频更新命令。Described database operation sub-module comprises: database retrieval data unit, database add data unit, database delete data unit, database update data unit and database update view unit, wherein: database retrieval data unit is connected with data storage module and transmits database retrieval sentence and Return result data, the database adding data unit is connected with the data storage module to transmit the database adding data command, the database deleting data unit is connected with the data storage module and transmitting the database deleting data command, the database updating data unit is connected with the data storage module and transmitting the database updating data command, the database The update view unit is connected with the data storage module to transmit the video update command of the database.

本实施例中采用ADO.NET建立数据库操作子模块,ADO.NET库具有良好的对于数据库的可操作性。在实施数据库操作子模块的过程中,需要建立以下一些功能接口:1向数据库中相关表添加数据的功能接口;2从数据库中相关表中删除数据的功能接口;3修改数据库中相关表数据的功能接口;4从数据库中检索相关数据的功能接口。实施中需要考虑到所有的功能接口的参数,比如添加数据的功能,需要考虑到向哪个表中添加数据,添加什么数据。In this embodiment, ADO.NET is used to establish the database operation sub-module, and the ADO.NET library has good operability for the database. In the process of implementing the database operation sub-module, the following functional interfaces need to be established: 1 the functional interface for adding data to the related table in the database; 2 the functional interface for deleting data from the related table in the database; 3 the function interface for modifying the data of the related table in the database Functional interface; 4 Functional interface for retrieving relevant data from the database. The implementation needs to take into account all the parameters of the functional interface, such as the function of adding data, which table to add data to and what data to add.

所述的搜索接口子模块包括:用户搜索条件接受单元、用户搜索结果返回单元、搜索条件处理单元和搜索执行单元,其中:用户搜索条件接受单元与搜索条件处理单元相连传输用户原始搜索条件信息,搜索条件处理单元与搜索执行单元相连传输经过系统处理后的搜索指令信息,搜索执行单元与数据库操作子模块相连传输数据库操作信息,用户搜索结果返回单元与数据库操作子模块相连传输数据库操作返回数据。The search interface sub-module includes: a user search condition acceptance unit, a user search result return unit, a search condition processing unit and a search execution unit, wherein: the user search condition acceptance unit is connected to the search condition processing unit to transmit the user's original search condition information, The search condition processing unit is connected to the search execution unit to transmit the search instruction information processed by the system, the search execution unit is connected to the database operation sub-module to transmit the database operation information, and the user search result return unit is connected to the database operation sub-module to transmit the return data of the database operation.

本实施例中有四种搜索接口:1从网站搜索网络视频,这时搜索接口将把特定网站名字传输给数据库操作模块,而数据库操作模块将根据网站名字检索出相关视频数据返回;2从类别搜索网络视频,这时搜索接口将把特定类别名字传输给数据库操作模块,而数据库操作模块将根据类别名字检索出相关视频数据返回;3从关键词搜索网络视频,这时搜索接口将把特定关键词传输给数据库操作模块,而数据库操作模块将根据关键词检索出相关视频数据返回;4高级网络视频搜索,分别将网站名字,类别名字,关键词名字,播放数数量和搜索返回数量条件组合进行网络视频的搜索,将获得更精确的网络视频。There are four kinds of search interfaces in the present embodiment: 1, search the network video from the website, at this moment the search interface will transfer the specific website name to the database operation module, and the database operation module will retrieve relevant video data according to the website name and return; 2 from the category Search the network video, at this time the search interface will transfer the specific category name to the database operation module, and the database operation module will retrieve relevant video data according to the category name and return; Words are transmitted to the database operation module, and the database operation module will retrieve relevant video data according to keywords and return; 4. Advanced network video search, respectively combine website name, category name, keyword name, number of playbacks and number of search returns. Internet video search, will get more accurate network video.

所述的网络视频热词分析子模块包括:视频标题提取单元、视频标题分词单元、标题词汇聚类单元、标题词汇统计单元和热点词汇更新单元,其中:视频标题提取单元与视频标题分词单元相连传输被分析的视频标题集合,视频标题分词单元与标题词汇聚类单元相连传输构成视频标题的词组,标题词汇聚类单元与标题词汇统计单元相连传输聚类完成的词汇空间,标题词汇统计单元与热点词汇更新单元相连传输出现率较高的词汇集合,热点词汇更新单元与数据库操作子模块相连传输添加数据的数据库操作命令,视频标题提取单元与数据库操作子模块相连传输获取视频标题的数据库操作语句。The network video hot word analysis sub-module includes: video title extraction unit, video title word segmentation unit, title vocabulary clustering unit, title vocabulary statistics unit and hot word update unit, wherein: video title extraction unit is connected with video title word segmentation unit The analyzed video title set is transmitted, the video title word segmentation unit is connected with the title vocabulary clustering unit to transmit the phrases forming the video title, the title vocabulary clustering unit is connected with the title vocabulary statistics unit to transmit the clustered vocabulary space, and the title vocabulary statistics unit is connected with the The hot vocabulary update unit is connected to transmit a collection of vocabulary with a high occurrence rate, the hot vocabulary update unit is connected to the database operation sub-module to transmit the database operation command for adding data, and the video title extraction unit is connected to the database operation sub-module to transmit the database operation statement for obtaining the video title .

本实施例中把每个视频的标题看做是一串词汇的向量,然后找出出现率最多的词汇。热点词汇会随着时间的变化而变化,所以该流程要间隔一段时间运作一次,使热词排序的内容随时间而动态变化。通过对于视频热点词汇的统计分析,系统能够对特定时间内的网络视频焦点做监控。网络视频热词分析子模块通过数据库操作子模块将网络视频数据库中播放数排列靠前的网络视频的标题检索出来,然后对这些标题进行分析,具体流程如下:1从数据库中检索出点击率最高的若干视频(数量可配置);2对这些视频的标题作分词,使视频标题成为词的向量;3对这些视频标题词向量进行聚类;4在每一个类中,分别统计出最高出现频率的相关词;5删除频率较高的无用词;6对热点词汇列表进行更新。热词分析以一定周期为间隔进行运行,每一次运行后将更新系统中的网络视频热词排行。In this embodiment, the title of each video is regarded as a vector of a string of words, and then the word with the highest frequency of occurrence is found. Hot words will change with time, so the process needs to be run at intervals, so that the content of hot words can change dynamically over time. Through the statistical analysis of video hot words, the system can monitor the focus of network video within a specific period of time. The network video hot word analysis sub-module retrieves the titles of the network videos with the highest number of plays in the network video database through the database operation sub-module, and then analyzes these titles. The specific process is as follows: 1. Retrieve from the database 2. Segment the titles of these videos so that the video titles become word vectors; 3. Cluster the word vectors of these video titles; 4. In each category, count the highest frequency of occurrence 5. Delete the useless words with high frequency; 6. Update the list of hot words. The hot word analysis runs at intervals of a certain period, and after each run, the network video hot word ranking in the system will be updated.

所述的用户兴趣模型管理子模块将根据用户的搜索行为建立针对用户的兴趣模型,当用户再一次登录系统时,系统将根据用户的兴趣模型库向用户推荐可能令该用户感兴趣的网络视频,包括:用户行为表示单元、用户兴趣模型更新单元和用户兴趣推荐单元,其中:用户行为表示单元与用户兴趣模型更新单元相连传输用户搜素行为标识信息,用户兴趣模型更新单元与数据库操作子模块相连传输更新用户兴趣的数据库操作语句信息,用户兴趣推荐单元与数据库操作子模块相连传输用户推荐视频信息提取数据库操作语句信息。The user interest model management sub-module will establish an interest model for the user according to the user's search behavior, and when the user logs in to the system again, the system will recommend to the user network videos that may interest the user according to the user's interest model library , including: a user behavior representation unit, a user interest model update unit and a user interest recommendation unit, wherein: the user behavior representation unit is connected to the user interest model update unit to transmit user search behavior identification information, and the user interest model update unit is connected to the database operation submodule The database operation statement information for updating the user's interest is transmitted in connection, and the user interest recommendation unit is connected with the database operation sub-module to transmit the user recommended video information to extract the database operation statement information.

所述的用户兴趣模型管理子模块的工作过程为:1用户链接至平台页面;2如果用户使以前有过搜索历史,则根据客户端的token,平台将根据数据库中的该用户的兴趣模型推荐网络视频;3如果该用户没有搜索历史,则为该用户新建一个兴趣模型,同时在客户端的cookie中设置好token;4用户的每一次视频搜索,都将修改服务器中该用户的兴趣模型;5对于很长时间没有变动的用户兴趣模型,这些数据将被删除,以便控制数据中数据的大小。在数据库中用户兴趣模型是以树状的形式保存用户兴趣信息的,模型中设置两个类分别是视频网站和视频分类,而各个分类下面又有子分类,如视频网站中有优酷网,或者视频类别中有体育类,各个子类下面保存有用户搜索行为对应的权重,根据这些权重可以挑选出推荐视频。The working process of the user interest model management sub-module is as follows: 1. The user links to the platform page; 2. If the user has search history before, then according to the token of the client, the platform will recommend the network according to the user's interest model in the database Video; 3 If the user has no search history, create a new interest model for the user, and set the token in the cookie of the client at the same time; 4 Every video search by the user will modify the user's interest model in the server; 5 For User interest models that have not changed for a long time, these data will be deleted in order to control the size of the data in the data. In the database, the user interest model saves user interest information in the form of a tree. Two categories are set in the model, which are video websites and video categories, and each category has subcategories, such as Youku in the video website, or There are sports categories in the video category, and the weights corresponding to the user's search behavior are stored under each subcategory, and recommended videos can be selected based on these weights.

所述的网络视频地址探测子模块探测网络视频文件的远程地址,从而提供用户下载网络视频的功能,包括:数据包获取单元、数据包内容分析单元和文件地址提取单元,其中:数据包获取单元与数据包内容分析单元相连传输抓取的HTTP数据包,数据包内容分析单元与文件地址提取单元相连传输含有文件地址的HTTP数据包,文件地址提取单元与数据库操作子模块相连传输更新视频信息的数据库更新操作语句。The remote address of described network video address detection sub-module detects network video file, thereby provides the function that user downloads network video, comprises: data packet acquisition unit, data packet content analysis unit and file address extraction unit, wherein: data packet acquisition unit Connect with the data packet content analysis unit to transmit the captured HTTP data packet, the data packet content analysis unit is connected with the file address extraction unit to transmit the HTTP data packet containing the file address, and the file address extraction unit is connected with the database operation sub-module to transmit the updated video information Database update operation statement.

所述的网络视频地址探测子模块的工作过程为:通过获得网络视频的播放页面地址,链接至该网络视频的播放页面,同时应用winpcap抓包程序获得本地向视频服务器的HTTP请求包,分析包头中的请求内容,可以获得网络视频文件的远程地址。该模块通过数据库操作子模块获得网络视频的播放地址,处理完毕后,又通过数据库操作子模块将文件地址存储入相应的表中。The work process of described network video address detection submodule is: by obtaining the play page address of network video, link to the play page of this network video, apply winpcap capture bag program to obtain local HTTP request bag to video server simultaneously, analyze Baotou The request content in can get the remote address of the network video file. The module obtains the playing address of the network video through the database operation sub-module, and stores the file address into the corresponding table through the database operation sub-module after processing.

所述的系统配置管理子模块包括:网络视频爬虫配置单元、网络视频地址探测配置单元、用户兴趣模型管理配置单元、网络视频热点词汇分析配置单元和陈旧数据删除单元,其中:网络视频爬虫配置单元与数据存储模块相连传输爬虫运行配置修改信息,用户兴趣模型管理配置单元与用户兴趣模型管理子模块相连传输模块运行配置信息,网络视频热点词汇分析配置单元与网络视频热词分析子模块相连传输模块运行配置信息,网络视频地址探测配置单元与网络视频地址探测子模块相连传输模块运行配置信息,陈旧数据删除单元与数据库操作子模块相连传输删除视频信息的数据库操作语句信息。The system configuration management sub-module includes: a network video crawler configuration unit, a network video address detection configuration unit, a user interest model management configuration unit, a network video hotspot vocabulary analysis configuration unit and an obsolete data deletion unit, wherein: the network video crawler configuration unit Connected with the data storage module to transmit crawler operation configuration modification information, the user interest model management configuration unit is connected to the user interest model management submodule to transmit module operation configuration information, and the network video hot word analysis configuration unit is connected to the network video hot word analysis submodule to transmit the module The operation configuration information, the network video address detection configuration unit is connected with the network video address detection sub-module to transmit the module operation configuration information, and the obsolete data deletion unit is connected with the database operation sub-module to transmit the database operation statement information for deleting video information.

本实施例中用xml文件来记录各模块的配置参数。针对多层次的搜索接口子模块,可以配置其提供对于哪些网站和哪些类别的搜索接口;针对网络视频地址探测子模块,可以配置其运行还是停止以及先对哪些网络视频进行地址探测;针对网络视频热词分析子模块,可以配置其运行还是停止以及运行的周期间隔;针对用户兴趣模型管理子模块,可以配置其是否启用以及删除时间;针对网络视频爬虫子模块,可以配置哪些爬虫启动、哪些爬虫停止以及爬虫爬取的起始时间和爬取间隔。系统配置管理子模块将根据系统管理员的需求将改变写到文件中,而各子模块以服务的形式运行在操作系统后台中,一旦配置改变,系统通过服务管理将消息发至相应的服务,服务根据新的配置参数重启运行。In this embodiment, an xml file is used to record the configuration parameters of each module. For the multi-level search interface sub-module, it can be configured which websites and which categories of search interfaces it provides; for the network video address detection sub-module, it can be configured to run or stop and which network video address detection is performed first; for network video For the hot word analysis sub-module, you can configure whether to run or stop it and the cycle interval; for the user interest model management sub-module, you can configure whether it is enabled and when to delete it; for the network video crawler sub-module, you can configure which crawlers to start and which crawlers Stop and crawl start time and crawl interval. The system configuration management sub-module will write changes to the file according to the requirements of the system administrator, and each sub-module runs in the background of the operating system in the form of a service. Once the configuration is changed, the system will send a message to the corresponding service through the service management. The service restarts with the new configuration parameters.

本实施例中当该系统拥有当前用户的兴趣模型时,则系统会根据该兴趣模型从数据库子模块中检索视频数据,将这些视频数据传给用户接口模块,通过网页显示出来,从而用户能够知道当前网络视频有哪些热点;若系统没有该用户的兴趣模型,则系统将会将最流行的网络视频数据从数据库子模块中检索出来传给用户接口模块,通过网页显示出来。In this embodiment, when the system has the interest model of the current user, the system will retrieve video data from the database submodule according to the interest model, pass these video data to the user interface module, and display it through the webpage, so that the user can know What are the hot spots of the current network video; if the system does not have the user's interest model, then the system will retrieve the most popular network video data from the database sub-module and pass it to the user interface module, which will be displayed through the web page.

当采用本实施例系统分别对中国五大视频网站:1优酷网;2土豆网;3酷六网;4六间房;5五六视频网,其中:优酷网视频数量5625,土豆网视频数量3403,酷六网视频数量1495,六间房视频数量2355,五六网视频数量3320,同时主要针对四类视频进行爬取:1资讯类视频;2体育类视频;3动漫类视频;4娱乐类视频,其中:资讯类视频数量3309,体育类视频数量3160,动漫类视频数量2696,娱乐类视频数量3173,不同搜索接口和搜索参数下得到的视频数量和消耗的时间如表1所示。由于本实施例已做好相关的索引和视图,所以针对网站和类别搜索视频所用的时间复杂度很小,消耗的时间很少,且能准确的搜索和下载符合条件的视频。When adopting present embodiment system respectively to China's five major video websites: 1 Youku; 2 Tudou; 3 Kuliu; 4 Liujianfang; , the number of videos on Kuliu.com is 1495, the number of videos on Liujianfang is 2355, and the number of videos on Wuliu.com is 3320. At the same time, it mainly crawls four types of videos: 1 information videos; 2 sports videos; 3 animation videos; 4 entertainment Videos, including: 3309 informational videos, 3160 sports videos, 2696 animation videos, and 3173 entertainment videos. The number of videos obtained under different search interfaces and search parameters and the time consumed are shown in Table 1. Since relevant indexes and views have been prepared in this embodiment, the time complexity of searching videos for websites and categories is very small, the time consumed is very little, and qualified videos can be accurately searched and downloaded.

表1Table 1

  搜索接口Search interface  搜索参数search parameters   搜索到的视频数量The number of videos found   消耗的时间(单位:秒)Elapsed time (unit: second)   来源网站source website  优酷 Youku   400400   0.090.09   来源网站source website  土豆 Potato   400400   0.060.06   来源网站source website  六间房six rooms   400400   0.070.07   来源网站source website  酷六cool six   400400   0.100.10   来源网站source website  五六视频five six video   400400   0.070.07   视频类别video category  全部 all   500500   0.130.13   视频类别video category  资讯 Information   500500   0.170.17   视频类别video category  体育 physical education   500500   0.110.11   视频类别video category  娱乐 entertainment   500500   0.180.18   视频类别video category  卡通 Cartoon   500500   0.150.15   关键字搜索keyword search  “NBA”"NBA"   153153   0.080.08   高级搜索 Advanced Search  优酷+体育+NBA+200播放次数+200返回次数Youku+Sports+NBA+200 playback times+200 return times   22 twenty two   0.090.09

Claims (8)

1. network video searching system, it is characterized in that, comprise: data memory module, Service Processing Module and Subscriber Interface Module SIM, wherein: data memory module link to each other with Service Processing Module transmitting data information and process information, Service Processing Module link to each other with Subscriber Interface Module SIM transmission searching request information and search result information;
Described data memory module obtains and the storage networking video data, comprise: Internet video reptile submodule and database submodule, wherein: the Internet video reptile submodule transmission network video data information that links to each other with the database submodule, Internet video reptile submodule links to each other with Service Processing Module and transmits the operation configuration information, and the database submodule links to each other with Service Processing Module and transmits database manipulation solicited message and database manipulation return data;
Described Service Processing Module comprises: the search interface submodule, the database manipulation submodule, submodule analyzed in the hot speech of Internet video, user interest model management submodule, submodule and system configuration management submodule are surveyed in the Internet video address, wherein: the search interface submodule links to each other with Subscriber Interface Module SIM and transmits searching request information and search result information, the database manipulation submodule links to each other with the search interface submodule and transmits search condition information and search return message, the database manipulation submodule links to each other with the hot speech analysis of Internet video submodule and transmits database manipulation message and hot speech analysis result information, the database manipulation submodule links to each other with user interest model management submodule and transmits database manipulation message and user interest model lastest imformation, the database manipulation submodule links to each other with Internet video address detection submodule and transmits database manipulation message and Internet video address information, the system configuration management submodule transmission block that links to each other with user interest model management submodule moves configuration information, system configuration management submodule links to each other with the hot speech analysis of Internet video submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video address detection submodule and transmits the operation configuration information, system configuration management submodule links to each other with the database manipulation submodule and transmits the operation configuration information, system configuration management submodule links to each other with Internet video reptile submodule and transmits the operation configuration information, and the database manipulation submodule links to each other with data memory module and transmits database stores information.
2. network video searching system according to claim 1, it is characterized in that, described Internet video reptile submodule comprises: page download unit, content of pages analytic unit and video information extraction unit, wherein: page download unit links to each other with the content of pages analytic unit to wait to climb and gets the video website page data, the content of pages analytic unit transmission associated video information in the page of back by analysis that links to each other with the video information extraction unit, the precise video information that the video information extraction unit links to each other with the database submodule and extracts in the transmits page.
3. network video searching system according to claim 1, it is characterized in that, described database manipulation submodule comprises: the database retrieval data cell, database adds data cell, database deleted data unit, database update data cell and database update view unit, wherein: the database retrieval data cell links to each other with data memory module and transmits database retrieval statement and return results data, database interpolation data cell links to each other with data memory module and transmits database interpolation data command, database deleted data unit links to each other with data memory module and transmits the order of database deleted data, the database update data cell links to each other with data memory module and transmits the database update data command, and the database update view unit links to each other with data memory module and transmits the database update view command.
4. network video searching system according to claim 1, it is characterized in that, described search interface submodule comprises: the user search condition is accepted the unit, the user search result returns the unit, search condition processing unit and search execution unit, wherein: the user search condition is accepted the unit original search condition information of transmission user that links to each other with the search condition processing unit, the search condition processing unit links to each other transmission through the search instruction information after the system handles with search execution unit, the search execution unit transmission database manipulation message that links to each other with the database manipulation submodule, the user search result returns the unit and links to each other with the database manipulation submodule and transmit the database manipulation return data.
5. network video searching system according to claim 1, it is characterized in that, the hot speech of described Internet video is analyzed submodule and is comprised: the video title extraction unit, video title participle unit, heading converges the class unit, title vocabulary statistic unit and focus vocabulary updating block, wherein: the video title extraction unit links to each other with video title participle unit and transmits analyzed video title set, video title participle unit and heading converge the transmission that links to each other of class unit and constitute the phrase of video title, heading converges the class unit lexical space finished of transmission cluster that links to each other with title vocabulary statistic unit, title vocabulary statistic unit links to each other with focus vocabulary updating block and transmits the higher lexical set of occurrence rate, focus vocabulary updating block links to each other with the database manipulation submodule and transmits the database manipulation order of adding data, and the video title extraction unit links to each other with the database manipulation submodule and transmits the data base manipulation statement that obtains video title.
6. network video searching system according to claim 1, it is characterized in that, described user interest model management submodule comprises: user behavior is represented the unit, user interest model updating block and user interest recommendation unit, wherein: user behavior represents that the unit transmission user that links to each other with the user interest model updating block searches plain behavior identification information, the data base manipulation statement information of user interest is upgraded in the transmission that links to each other with the database manipulation submodule of user interest model updating block, and the user interest recommendation unit transmission user that links to each other with the database manipulation submodule is recommended video information extraction data base manipulation statement information.
7. network video searching system according to claim 1, it is characterized in that, described Internet video address is surveyed submodule and is comprised: the packet acquiring unit, packet content analytic unit and file address extraction unit, wherein: the packet acquiring unit links to each other with the packet content analytic unit and transmits the HTTP packet that grasps, the packet content analytic unit links to each other with the file address extraction unit and transmits the HTTP packet that contains the file address, and the file address extraction unit links to each other with the database manipulation submodule and transmits the database update action statement information of upgrading video information.
8. network video searching system according to claim 1, it is characterized in that, described system configuration management submodule comprises: Internet video reptile dispensing unit, dispensing unit is surveyed in the Internet video address, user interest model administration configuration unit, Internet video focus lexical analysis dispensing unit and stale data delete cells, wherein: Internet video reptile dispensing unit links to each other with data memory module and transmits reptile operation configuration modification information, user interest model administration configuration unit and the user interest model management submodule transport module that links to each other moves configuration information, the hot speech of Internet video focus lexical analysis dispensing unit and Internet video is analyzed the submodule transport module that links to each other and is moved configuration information, the Internet video address is surveyed dispensing unit and Internet video address and is surveyed the submodule transport module operation configuration information that links to each other, and the stale data delete cells links to each other with the database manipulation submodule and transmits the data base manipulation statement information of deleting video information.
CN 201010186145 2010-05-28 2010-05-28 Internet Video Search System Pending CN101833587A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010186145 CN101833587A (en) 2010-05-28 2010-05-28 Internet Video Search System

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010186145 CN101833587A (en) 2010-05-28 2010-05-28 Internet Video Search System

Publications (1)

Publication Number Publication Date
CN101833587A true CN101833587A (en) 2010-09-15

Family

ID=42717656

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010186145 Pending CN101833587A (en) 2010-05-28 2010-05-28 Internet Video Search System

Country Status (1)

Country Link
CN (1) CN101833587A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102625158A (en) * 2011-08-10 2012-08-01 苏州闻道网络科技有限公司 Video management system
CN102630049A (en) * 2011-12-31 2012-08-08 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102760058A (en) * 2012-04-05 2012-10-31 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 A method and device for discovering hot videos in real time based on user query logs
CN103179441A (en) * 2011-12-21 2013-06-26 腾讯科技(深圳)有限公司 Method and server for playing contents
CN103186539A (en) * 2011-12-27 2013-07-03 阿里巴巴集团控股有限公司 Method and system for confirming user groups, inquiring information and recommending
CN103501470A (en) * 2013-10-17 2014-01-08 珠海迈科电子科技有限公司 Network data screening method and device
CN103605773A (en) * 2013-11-27 2014-02-26 乐视网信息技术(北京)股份有限公司 Multimedia file searching method and device
CN103699661A (en) * 2013-12-26 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for acquiring data of video resources
CN104980770A (en) * 2014-04-09 2015-10-14 杭州迪普科技有限公司 Method and device for downloading video data contents
CN105025369A (en) * 2015-06-30 2015-11-04 北京奇艺世纪科技有限公司 Method and device for determining recommended resources in channel combination
CN105893559A (en) * 2016-03-31 2016-08-24 北京奇艺世纪科技有限公司 Data pushing method and device
CN106453348A (en) * 2016-10-31 2017-02-22 南京邮电大学 Login authentication method based on user interest in social network
CN108399223A (en) * 2018-02-12 2018-08-14 北京奇艺世纪科技有限公司 A kind of data capture method, device and electronic equipment
CN109951739A (en) * 2019-03-27 2019-06-28 北京市博汇科技股份有限公司 Video traffic processing method, device and electronic equipment
CN113297450A (en) * 2021-05-24 2021-08-24 华北科技学院(中国煤矿安全技术培训中心) Crawler method, system, medium and electronic device based on fuzzy comprehensive evaluation method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021852A (en) * 2006-10-10 2007-08-22 鲍东山 Video search dispatching system based on content

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101021852A (en) * 2006-10-10 2007-08-22 鲍东山 Video search dispatching system based on content

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
《中国优秀硕士学位论文全文数据库》 20080531 任严 基于网络视频业务的搜索引擎的设计与实现 第15页,33-43页 1-8 , 2 *
《中国科技信息》 20070630 任严等 基于网络视频的搜索引擎的设计与实现 第120-121页 1-8 , 第11期 2 *
《信息技术》 20060731 刘春祥等 基于MVC模式的网络视频检索系统设计与实现 第7-10,第37页 1-8 , 第7期 2 *
《计算机工程与应用》 20050331 费洪晓等 基于词频统计的中文分词的研究 第67-68,100页 1-8 , 第7期 2 *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102625158A (en) * 2011-08-10 2012-08-01 苏州闻道网络科技有限公司 Video management system
CN103179441A (en) * 2011-12-21 2013-06-26 腾讯科技(深圳)有限公司 Method and server for playing contents
US9400831B2 (en) 2011-12-27 2016-07-26 Alibaba Group Holding Limited Providing information recommendations based on determined user groups
CN103186539A (en) * 2011-12-27 2013-07-03 阿里巴巴集团控股有限公司 Method and system for confirming user groups, inquiring information and recommending
CN103186539B (en) * 2011-12-27 2016-07-27 阿里巴巴集团控股有限公司 A kind of method and system determining user group, information inquiry and recommendation
CN102630049A (en) * 2011-12-31 2012-08-08 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102630049B (en) * 2011-12-31 2014-12-10 上海聚力传媒技术有限公司 Method for determining interest degree of user about playing video and equipment thereof
CN102760058A (en) * 2012-04-05 2012-10-31 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN102760058B (en) * 2012-04-05 2015-03-11 中国人民解放军国防科学技术大学 Massive software project sharing method oriented to large-scale collaborative development
CN103020212A (en) * 2012-12-07 2013-04-03 合一网络技术(北京)有限公司 A method and device for discovering hot videos in real time based on user query logs
CN106909638A (en) * 2012-12-07 2017-06-30 合网络技术(北京)有限公司 A kind of method and apparatus for finding hot video in real time based on user's inquiry log
CN103020212B (en) * 2012-12-07 2017-05-10 合一网络技术(北京)有限公司 A method and device for discovering hot videos in real time based on user query logs
CN103501470A (en) * 2013-10-17 2014-01-08 珠海迈科电子科技有限公司 Network data screening method and device
CN103605773A (en) * 2013-11-27 2014-02-26 乐视网信息技术(北京)股份有限公司 Multimedia file searching method and device
CN103699661A (en) * 2013-12-26 2014-04-02 乐视网信息技术(北京)股份有限公司 Method and system for acquiring data of video resources
CN104980770A (en) * 2014-04-09 2015-10-14 杭州迪普科技有限公司 Method and device for downloading video data contents
CN105025369A (en) * 2015-06-30 2015-11-04 北京奇艺世纪科技有限公司 Method and device for determining recommended resources in channel combination
CN105025369B (en) * 2015-06-30 2018-07-17 北京奇艺世纪科技有限公司 Recommend the method and device of resource in a kind of determining combiner channel
CN105893559A (en) * 2016-03-31 2016-08-24 北京奇艺世纪科技有限公司 Data pushing method and device
CN106453348A (en) * 2016-10-31 2017-02-22 南京邮电大学 Login authentication method based on user interest in social network
CN106453348B (en) * 2016-10-31 2019-11-15 南京邮电大学 Login authentication method based on user interest in social network
CN108399223A (en) * 2018-02-12 2018-08-14 北京奇艺世纪科技有限公司 A kind of data capture method, device and electronic equipment
CN109951739A (en) * 2019-03-27 2019-06-28 北京市博汇科技股份有限公司 Video traffic processing method, device and electronic equipment
CN109951739B (en) * 2019-03-27 2021-06-08 北京市博汇科技股份有限公司 Video service processing method and device and electronic equipment
CN113297450A (en) * 2021-05-24 2021-08-24 华北科技学院(中国煤矿安全技术培训中心) Crawler method, system, medium and electronic device based on fuzzy comprehensive evaluation method

Similar Documents

Publication Publication Date Title
CN101833587A (en) Internet Video Search System
US8626768B2 (en) Automated discovery aggregation and organization of subject area discussions
US9361385B2 (en) Generating content for topics based on user demand
US7870279B2 (en) Method and apparatus for scanning, personalizing, and casting multimedia data streams via a communication network and television
US9355185B2 (en) Infinite browse
US10180967B2 (en) Performing application searches
JP2021108183A (en) Method, apparatus, device and storage medium for intention recommendation
CN102708174B (en) Method and device for displaying rich media information in a browser
US8312022B2 (en) Search engine optimization
CN104254852B (en) Method and system for mixed information inquiry
US10198776B2 (en) System and method for delivering an open profile personalization system through social media based on profile data structures that contain interest nodes or channels
US20090100015A1 (en) Web-based workspace for enhancing internet search experience
US20140229487A1 (en) System and method for user preference augmentation through social network inner-circle knowledge discovery
US10078686B2 (en) Combination filter for search query suggestions
CN104090757A (en) Method and device for displaying rich media information in browser
CN104090923A (en) Method and device for displaying rich media information in browser
CN115630170B (en) Document recommendation method, system, terminal and storage medium
JP7462198B1 (en) Keyword collection method, information processing device, and program
CN118797166A (en) Information push method, system, device and storage medium
JP5384567B2 (en) Latent class analyzer, latent class analyzing method and program
CN100578502C (en) Browsing method and system of embedded browser
KR20090110764A (en) Meta Information-based Keyword Advertising Service Method Included in Multimedia Content Information and System for the Service
Cao et al. On application-unbiased benchmarking of web videos from a social network perspective
KR101696026B1 (en) Design trend information supplying method by open API
CN120929636A (en) Intelligent film watching recommendation system and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20100915