CN108470296A - A kind of business object information processing method and processing device - Google Patents

A kind of business object information processing method and processing device Download PDF

Info

Publication number
CN108470296A
CN108470296A CN201710099966.9A CN201710099966A CN108470296A CN 108470296 A CN108470296 A CN 108470296A CN 201710099966 A CN201710099966 A CN 201710099966A CN 108470296 A CN108470296 A CN 108470296A
Authority
CN
China
Prior art keywords
business
analyzed
business object
information
extraction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710099966.9A
Other languages
Chinese (zh)
Other versions
CN108470296B (en
Inventor
李前雷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710099966.9A priority Critical patent/CN108470296B/en
Publication of CN108470296A publication Critical patent/CN108470296A/en
Application granted granted Critical
Publication of CN108470296B publication Critical patent/CN108470296B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0641Shopping interfaces
    • G06Q30/0643Graphical representation of items or shoppers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Computational Linguistics (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

This application involves Internet technical field more particularly to a kind of business object information processing method and processing device, to realize in the case where not increasing businessman and filling in cost, abundant digging utilization is carried out to the content of businessman's publication.The embodiment of the present application provides a kind of business object information processing method:The details page content storage files of business object to be analyzed are obtained from service platform;Include picture access address and content of text in the details page content storage files;The business tine with professional knowledge storehouse matching is extracted respectively from the picture obtained based on the picture access address and in content of text according to preset professional knowledge library;The business tine of extraction is associated and is stored with business object to be analyzed.

Description

A kind of business object information processing method and processing device
Technical field
This application involves Internet technical field more particularly to a kind of business object information processing method and processing devices.
Background technology
In electric business platform, businessman can issue a large amount of commodity details page content.In commodity details page, for page U.S. It sees and typesetting is convenient, often place some pictures, can include departure place, journey routing, surcharge on these pictures With information such as explanation, local playing method, sight spot introductions.
In order to analyze merchandise news, usually it is required for that commodity details page content is arranged, analyzed and handled. But for the picture in details page, the information content that can not be directly obtained on picture.At present, if it is desired to make full use of figure The information content of on piece, it is necessary to when businessman issues picture, it is desirable to which businessman is by the information content on picture with structuring Form is stored.For example, when issuing picture shown in FIG. 1, it is necessary to which businessman fills title " 5 days 4, Beijing in storage Late parent-offspring trip ", slogan " pure to play without shopping, the Spring Festival general do not appreciate ", neighbouring hotel name " ten thousand persons of outstanding talent/Westinghouse/Kai Binsi Base/Fu Peng Sheraton Corp.s ".Obviously, this mode can increase the cost that businessman's progress commodity details page content is filled in, and reduce commodity hair Cloth efficiency, if but businessman do not fill in the data of these structurings in publishing commodity information, just will be unable to extract one in picture A little valuable merchandise newss, can not will so make full use of these valuable merchandise newss.
As it can be seen that in the case where not increasing businessman and filling in cost, abundant digging utilization is carried out to the content of businessman's publication, is The direction realized is needed at present.
Invention content
The embodiment of the present application provides a kind of business object information processing method and processing device, is filled out to realize not increasing businessman In the case of being write as this, abundant digging utilization is carried out to the business object information of businessman's publication.
The embodiment of the present application provides a kind of information extracting method, including:
The details page content storage files of business object to be analyzed are obtained from service platform;The details page content storage text Include picture access address and content of text in part;
According to preset professional knowledge library, from the picture obtained based on the picture access address and in the text Rong Zhong extracts the business tine with the professional knowledge storehouse matching respectively;
The business tine of extraction is associated and is stored with the business object to be analyzed.
Optionally, the business object is travelling products, and the professional knowledge library is travelling knowledge base, the industry to be analyzed The corresponding service item of object of being engaged in refers to the corresponding tourist attractions of the travelling products to be analyzed.
Optionally, the travelling knowledge base includes one or more in following information:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines letter Breath, pricing information.
Optionally, the business tine of extraction is associated and is stored with the business object to be analyzed, including:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established, And store the mapping relations of foundation;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are advance at heading message The mark id information of setting.
Optionally, after the business tine of extraction and the business object to be analyzed being associated and stored, also Including:
The business tine based on extraction controls the publication or recommendation of associated business objects;
Wherein, the associated business objects include the business object to be analyzed, or including with the business to be analyzed The identical other business objects of service item corresponding to object, or waited for point including the business object to be analyzed and with described Analyse the identical other business objects of service item corresponding to business object.
Optionally, the business tine based on extraction controls the publication of associated business objects, including:
The business tine based on extraction and preset illegal keyword message detect the business pair to be analyzed The legitimacy of elephant is to confirm that whether the business object to be analyzed can be issued.
Optionally, the business tine based on extraction controls the recommendation of associated business objects, including:
In the business object searching request for receiving user, according in the business carried in the business object searching request Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword At least one business object of information matches;Determining at least one business object is recommended into user.
Optionally, the business tine based on extraction controls the publication of associated business objects, including:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the industry to be analyzed are established Mapping relations between the service item of object of being engaged in, and preserve the mapping relations;
After obtaining business object to be released, according to the corresponding service item of business object to be released, from advance In the mapping relations of storage, search and the associated business tine of the service item;
According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
Optionally, from the corresponding picture in the picture access address, extraction matches in the business in the professional knowledge library Hold, including:
According to the corresponding business object identification information in each picture access address in the details page content storage files, And the identification information of the business object to be analyzed, from the picture access address in the details page content storage files, Delete the picture access address for being not belonging to the business object to be analyzed;
From the corresponding picture in remaining picture access address, extraction matches the business tine in the professional knowledge library.
The embodiment of the present application provides a kind of information extracting device, including:
Acquisition module, the details page content storage files for obtaining business object to be analyzed from service platform;It is described detailed Include picture access address and content of text in feelings page content storage files;
Extraction module is used for according to preset professional knowledge library, from the picture obtained based on the picture access address, In the content of text, the business tine with the professional knowledge storehouse matching is extracted respectively;
Memory module, for the business tine of extraction to be associated and store with the business object to be analyzed.
Using the embodiment of the present application, the details page content storage text of business object to be analyzed can be obtained from service platform Part therefrom extracts picture access address and content of text, then out of, extraction the corresponding picture in picture access address and text Rong Zhong, extraction respectively matches the business tine in professional knowledge library, then by the business tine of extraction and the industry to be analyzed Business object is associated and stores;Based on these business tines of storage, can control later associated business objects publication and Recommend.To use application scheme, the cost that manpower fills in merchandise news in picture is saved, and businessman can be issued Content carries out abundant digging utilization.
Description of the drawings
Fig. 1 is the business object information process flow figure that the embodiment of the present application one provides;
Fig. 2 is the picture schematic diagram of travelling products " Zhaoqing Guangdong 2 days with trip ";
Fig. 3 is the business object information process flow figure that the embodiment of the present application two provides;
Fig. 4 (a) is the picture schematic diagram of travelling products " Yi Ou comes Suzhou shopping village ";
Fig. 4 (b) is the picture schematic diagram of travelling products " visit of Beijing museum ";
Fig. 4 (c) is the picture schematic diagram of travelling products " Yanqi Lake one-day tour ";
Fig. 5 is business object information processing device structure diagram provided by the embodiments of the present application.
Specific implementation mode
The embodiment of the present application is applied to that the business object information that service side issues is automatically analyzed and excavated.Preparing For analyze and excavate data when, other than the data in the content of text of publication, the text in the picture of business object to be analyzed Word information is often more crucial, how (to be such as embodied in figure to the business object information of the various forms of expression of service side's publication Business object information in piece) to carry out abundant digging utilization be the application content to be illustrated.
The embodiment of the present application is described in further detail with reference to the accompanying drawings of the specification.
Embodiment one
As shown in Figure 1, for the business object information process flow figure that the embodiment of the present application one provides, including following step Suddenly:
S101:The details page content storage files of business object to be analyzed are obtained from service platform;The details page content Include picture access address and content of text in storage file.
Business object to be analyzed can refer to it is any be suitable for recommending, show the object of user, including various entity products, Virtual product etc..As a kind of application scenarios, business object to be analyzed here can refer to travelling products, such as 5 days 4 evenings of Beijing Parent-offspring swims, and correspondingly, details page content generally comprises the picture of tourist attractions and introduces the content of text of corresponding travelling products.
In specific implementation, the details page of business object to be analyzed can be obtained from the catalogue in details page information library first Content storage files path.Then, according to the content storage files path of acquisition, details page is obtained from details page information library Content storage files.
Here, there are each business object details page (such as commodity in the catalogue in details page information library (such as commodity library) Details page) content storage files path, can be got from the catalogue based on business object identification information above-mentioned to be analyzed The details page content storage files path of business object.It is then possible to using dedicated file access kit, according to acquisition Content storage files path obtains the details page content storage files of business object to be analyzed from details page information library.
After executing S101 and obtaining details page content storage files, it is also necessary to therefrom extract picture access address and text Content.
Here, since picture file is all bigger, the access of the only picture generally preserved in content storage files Location, such as the uniform resource locator (Uniform Resource Locator, URL) of picture.It is obtained from details page information library Content storage files be hypertext markup language (Hyper Text MarkupLanguage, HTML) format, first can be with Using the content in HTML PARSER (a kind of HTML parsing and analysis tool) parsing the above storage files, Ye Jicong Picture access address is parsed in the content storage files of html format.Later, picture can be obtained according to picture access address File.Here it is possible to according to the URL of picture, picture file is obtained using HttpClient (a kind of client programming kit).
S102:According to preset professional knowledge library, from the picture obtained based on the picture access address and the text In this content, the business tine with the professional knowledge storehouse matching is extracted respectively.
When extracting business tine in picture, optical character identification (Optical may be used CharacterRecognition, OCR) picture recognition technology, extract the text information in picture.
Here, include suitable for some of data mining and analysis business tine keywords in preset professional knowledge library Information.According to specific business scenario difference, keyword message here is also different.For example, for this scene of travelling products, Here professional knowledge library can refer to travelling knowledge base, wherein various travelling keywords are had collected, such as departure place, journey stroke Arrangement, extra charge explanation, local playing method, sight spot introduction etc..In this way, can be extracted from the text information of picture with it is above-mentioned Information that the corresponding departure place information of travelling keyword, journey routing information, extra charge illustrate, local playing method correlation letter The information etc. that breath, sight spot are introduced.
S103:The business tine of extraction is associated and is stored with the business object to be analyzed.
It here, can be by matched business tine according to quotient after the business tine for extracting matching professional knowledge library Product dimension is put in storage, and to carry out data analysis to these matched datas, to excavate, and is executed based on the result of data analysis, excavation Follow-up business processing.Specifically, for travelling products, the mark of the business tine and travelling products to be analyzed of extraction can be established Mapping relations between property information, and store the mapping relations of foundation;Wherein, the significant information of the travelling products to be analyzed It is one or more in following information:Tourist attractions information, departure place information, destination information, waits for heading message for described in Analyze the pre-set mark id information of travelling products.
For example, swimming this product for the 5 days 4 late parent-offsprings in Beijing, wrapped in the details page content of this travelling products of extraction It includes the picture of tourist attractions and introduces the content of text of corresponding travelling products, wherein all include mark in picture and content of text Inscribe information (or title of travelling products) " Beijing 5 days 4 late parent-offspring trip ", tourist attractions information " Beijing ", in content of text also Include departure place " Shanghai " and destination " Beijing ".It, can be by title " Beijing 5 days 4 evenings parent when establishing above-mentioned mapping relations Significant information of the son trip " as the travelling products, or can be by tourist attractions " Beijing " as the travelling products mark Property information, departure place and destination " Beijing-Shanghai " can also be used as to significant information, alternatively, being tourism production by system The mark ID (for example being identified using using number 101) of product setting is used as significant information.In this way, the tourism can be set up The significant information of product and from the mapping relations between the other travel informations extracted in picture and in content of text, then into Row storage.Above-mentioned significant information can as the key message that business object to be analyzed is retrieved and recommended, for example, When user searches for tourism of Beijing, from the tourist attractions information of the travelling products of storage, it is north to extract corresponding tourist attractions Capital or tourist famous-city are Pekinese's travelling products, and recommend user.
In general, tourism electric business platform commodity details page content is issued by businessman, in order to which the page is beautiful and typesetting Convenient, businessman often will place a large amount of picture in commodity details page, can include a large amount of useful informations on picture;Such as set out Ground, journey routing, extra charge explanation, local playing method, sight spot introduction etc..Pass through method provided by the present application, Ke Yicong These details page contents are extracted crucial literal information, and are associated with corresponding commodity especially in image content, form quotient The marking data of product, automatic excavating structural data meet the needs of each dimensional attribute information of structuring business.This is for big Data application is a most basic and essential ring.
In the specific embodiment applied at one, optionally, in the business tine that will be extracted and the industry to be analyzed After business object is associated and stores, the method can also include:S104 (not shown):Business tine based on extraction, Control the publication or recommendation of associated business objects.
Here associated business objects include business object to be analyzed, or include with corresponding to business object to be analyzed The identical other business objects of service item, or including business object to be analyzed and with the clothes corresponding to business object to be analyzed The identical other business objects of business project.
Under a kind of application mode, the publication of the business tine control associated business objects based on extraction, including:
Business tine based on extraction and preset illegal keyword message, detect the legal of business object to be analyzed Property is to confirm that whether the business object to be analyzed can be issued.
Whether include illegal keyword message in the business tine of Detection and Extraction under this application mode, such as certain Travelling products illustrated with the font of very little in picture with force relevant information etc. of doing shopping, if so, then confirming that this is to be analyzed Business object can not be issued, and can refuse to issue the business object to be analyzed at this time, or to issued industry to be analyzed Business object is forced offline.
Under another application mode, the recommendation of the business tine control associated business objects based on extraction, including:
In the business object searching request for receiving user, according in the business carried in the business object searching request Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword At least one business object of information matches;Determining at least one business object is recommended into user.
In this manner, the business tine of the business object to be analyzed of extraction is stored as recommendation information, After receiving the searching request that user carries business tine keyword message, from the recommendation information of storage, determine and the business The business tine of the business object of content-keyword information matches recommends user.For example, being " 5 days 4, Beijing for picture header This travelling products of late parent-offspring's trip ", after the text information and content of text in extracting and storing its corresponding picture file, It, can be by the phase of this travelling products if receiving the searching request that user carries the such keyword message of Beijing tourism in 5 days Close travel information (such as in picture and content of text in travel information) recommend user.
Under another application mode, the publication of the business tine control associated business objects based on extraction, including:
After the business tine for extracting business object to be analyzed, the business tine of extraction and business object to be analyzed are established Mapping relations between service item, and preserve the mapping relations;It is to be released according to this after obtaining business object to be released The corresponding service item of business object, from pre-stored mapping relations, search with the associated business of the service item in Hold;According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
This application mode is for service side user, for example, for travelling products this scenes, business pair to be analyzed As corresponding service item refers to the corresponding tourist attractions of travelling products to be analyzed.Here, by establish tourist attractions with it is corresponding Travelling products official documents and correspondence content (image content and content of text) between mapping relations, when determining that businessman will issue certain tourism When the travelling products at sight spot, it can be pushed to it based on the official documents and correspondence content with the relevant travelling products in the tourist attractions of storage Related official documents and correspondence suggestion, for example suggest the key message etc. of displaying in picture in content arrangement pattern, picture.
As shown in Fig. 2, for travelling products " Zhaoqing Guangdong 2 days with trip ", figure of the user from the travelling products of acquisition Scenic spot cuisines information " stuffed glutinous rice dumpling " is extracted in piece file, and travelling products title can be extracted from the content of text of the travelling products " Zhaoqing Guangdong 2 days with trip ", " first day Shenzhen multiplies bus and goes to Zhaoqing, trip Qixingyan scenic spot, ancient Sung Dynasty Village wall for routing Deng;Second day trip Dinghu Hill, Qingyun temple, flies many places sight spots such as puddle waterfall at tripod garden, rear to return to Shenzhen by bus ", in this way, building Erect mapping of the title " Zhaoqing Guangdong 2 days with trip " of travelling products between above-mentioned scenic spot cuisines information and routing etc. Relationship, and save.Later, when other businessmans need to issue travelling products information relevant with Zhaoqing Guangdong, Ke Yixiang It pushes official documents and correspondence suggestion, for example travelling products title, scenic spot cuisines information, row are shown in picture or content of text to its suggestion Journey arrangement can also show related sight spot, cuisines information successively to its suggestion in picture, illustrate that stroke is pacified in content of text Row etc..
Using the embodiment of the present application, the details page content storage text of business object to be analyzed can be obtained from service platform Part therefrom extracts picture access address and content of text, then from the corresponding picture in picture access address of extraction and text In content, the business tine in extraction matching professional knowledge library, the business tine based on extraction control associated business objects respectively Publication and recommendation save the cost that manpower fills in merchandise news in picture, and can be to businessman to use application scheme The content of publication carries out abundant digging utilization.
In actual implementation, the picture URL that includes in the content storage files of details page may includes current to be analyzed The picture of other unrelated business objects except business object, is interfered in order to avoid the real information of business object, needs Garbage is filtered, by taking travelling products as an example, the description of detailed in Example two.
Embodiment two
As shown in figure 3, for the business object information process flow figure that the embodiment of the present application two provides, including:
S301:The details page content storage files of travelling products to be analyzed are obtained from service platform, are stored in the details page It includes picture access address and content of text to store up in file.
Here, it is based on the corresponding identification information of travelling products to be analyzed, is found from the catalogue in the details page information library The content storage files path of the details page of travelling products to be analyzed.File access kit is called, text is stored according to the content Part path obtains content storage files from details page information library.Then, parse content of text in content storage files and Picture access address.
Here, since the return of file access tool is html file, HTML analytical tools-HTML may be used PARSER parses html file.Specifically, HTML PARSER may be used and parse content of text and picture URL therein, When parsing content of text, content of text all in wherein content storage files all can be carried using HTML PARSER first It takes out, then executes S202 again, further extraction matches travelling knowledge base in the content of text of HTML PARSER parsings Travel information.HTML PARSER can also be used directly to parse the travel information of matching travelling knowledge base.
S302:According to the corresponding travelling products mark letter in each picture access address in details page content storage files The identification information of breath and travelling products to be analyzed is deleted not from the picture access address in details page content storage files Belong to the picture access address of travelling products to be analyzed.
S303:According to preset travelling knowledge base, from the corresponding picture in remaining picture access address, extraction matching trip The travel information of row knowledge base.
Here, the picture that non-present travelling products to be analyzed are filtered out from the picture URL queues of acquisition, rejects other productions The interference of the picture of product.Then HttpClient is used to obtain picture file according to remaining at least one URL.Then, it uses OCR identification technologies, extract the text information in picture file, and according to travelling knowledge base, from the text information of extraction into One step filters out sight spot shopping, scenic spot cuisines, tourism playing method and sight spot and plays the travel informations such as warm tip.
The server of the embodiment of the present application can be obtained therefrom after the content storage files for obtaining certain travelling products details page It takes and the relevant picture file of the travelling products and content of text.Here, for page beauty, businessman places in commodity details page Picture on can include a large amount of useful informations, such as departure place, journey routing, extra charge explanation, local playing method, sight spot Introduce etc..The application needs to extract crucial literal information from the image content of commodity details page, makees with reference to specific picture It further illustrates.
As shown in Fig. 4 (a), can be extracted from the picture file of the travelling products of acquisition departure place information " it is northern, upper, wide, It is deep ", journey stroke number of days " 4 days 3 evenings ", the title " the 4 days 3 late free walkers in Soul " of travelling products.In this way, it is established that travelling products Incidence relation of the title " Soul 4 days 3 late free walker " between above-mentioned departure place and journey stroke number of days, and save.
As shown in Fig. 4 (b), departure place information " Hangzhou " and trip can be extracted from the picture file of the travelling products of acquisition The title " Vietnam's Nha Trang air ticket " for swimming product, can also extract pricing information " 2490 " from the content of text of the travelling products Deng in this way, it is established that title " Vietnam's Nha Trang air ticket " being associated between above-mentioned departure place and pricing information of travelling products System, and save.
As shown in Fig. 4 (c), contains in the picture file of certain Zhang Youguan Nha Trang tourism and said with tourism playing method, extra charge Bright relevant information, for example can extract visa and inbound information from picture and illustrate information about service tip.This Sample can set up the incidence relation between tourist famous-city " Nha Trang " and above-mentioned travel information, and save.
S304:The significant information of the travel information of extraction and travelling products to be analyzed is associated and is stored.
Here it is possible to which the mapping established between the travel information of extraction and the significant information of travelling products to be analyzed is closed System, and store the mapping relations of foundation;Wherein, the significant information of the travelling products to be analyzed is one kind in following information Or it is a variety of:Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are set in advance at heading message The mark id information set.
S305:Travel information based on extraction controls the publication or recommendation of related travelling products.
In specific implementation, travel information that can be based on extraction and preset illegal keyword message are detected and are waited for point The legitimacy of travelling products is analysed to confirm that whether the travelling products to be analyzed can be issued.It can also receive user's When travelling products searching request, according to the tourism content-keyword information carried in the travelling products searching request, and storage Each travelling products extracted in advance travel information, determine and at least one tourisms of the tourism content-keyword information matches Product;Determining at least one travelling products are recommended into user.It can also be in the travel information for extracting travelling products to be analyzed Afterwards, the mapping relations between the travel information of extraction and tourist attractions are established, after obtaining travelling products to be released, according to this It is searched associated with the tourist attractions from pre-stored mapping relations the corresponding tourist attractions of travelling products to be released Travel information;According to the travel information found, official documents and correspondence suggestion when issuing the travel information is pushed to user.
Using the embodiment of the present application, the details page content storage text of travelling products to be analyzed can be obtained from service platform Part therefrom extracts picture access address and content of text, then out of, extraction the corresponding picture in picture access address and text Rong Zhong, the travel information of extraction matching travelling knowledge base, the travel information based on extraction control the hair of related travelling products respectively Cloth and recommendation save the cost that manpower fills in travel information in the pictures of travelling products, and can to use application scheme Abundant digging utilization is carried out with the travelling products information issued to businessman.
Based on same inventive concept, additionally provided in the embodiment of the present application a kind of corresponding with business object information processing method Business object information processing unit, the principle solved the problems, such as due to the device handled with the embodiment of the present application business object information Method is similar, therefore the implementation of the device may refer to the implementation of method, and overlaps will not be repeated.
As shown in figure 5, be business object information processing device structure diagram provided by the embodiments of the present application, including:
Acquisition module 51, the details page content storage files for obtaining business object to be analyzed from service platform;It is described Include picture access address and content of text in details page content storage files;
Extraction module 52 is used for according to preset professional knowledge library, from the picture obtained based on the picture access address In and the content of text in, extraction and the business tine of the professional knowledge storehouse matching respectively;
Memory module 53, for the business tine of extraction to be associated and deposit with the business object to be analyzed Storage.
Optionally, the business object is travelling products, and the professional knowledge library is travelling knowledge base, the industry to be analyzed The corresponding service item of object of being engaged in refers to the corresponding tourist attractions of the travelling products to be analyzed.
Optionally, the travelling knowledge base includes one or more in following information:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines letter Breath, pricing information.
Optionally, the memory module 53 is specifically used for:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established, And store the mapping relations of foundation;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are advance at heading message The mark id information of setting.
Optionally, described device further includes:
Processing module 54 is used for the business tine based on extraction, controls the publication or recommendation of associated business objects;Its In, the associated business objects include the business object to be analyzed, or including with corresponding to the business object to be analyzed The identical other business objects of service item, or including the business object to be analyzed and with the business object to be analyzed The identical other business objects of corresponding service item.
Optionally, the processing module 54 is specifically used for:
The business tine based on extraction and preset illegal keyword message detect the business pair to be analyzed The legitimacy of elephant is to confirm that whether the business object to be analyzed can be issued.
Optionally, the processing module 54 is specifically used for:
In the business object searching request for receiving user, according in the business carried in the business object searching request Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword At least one business object of information matches;Determining at least one business object is recommended into user.
Optionally, the processing module 54 is specifically used for:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the industry to be analyzed are established Mapping relations between the service item of object of being engaged in, and preserve the mapping relations;After obtaining business object to be released, according to The corresponding service item of business object to be released, from pre-stored mapping relations, lookup is associated with the service item Business tine;According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
Optionally, the extraction module 52 is specifically used for:
According to the corresponding business object identification information in each picture access address in the details page content storage files, And the identification information of the business object to be analyzed, from the picture access address in the details page content storage files, Delete the picture access address for being not belonging to the business object to be analyzed;From the corresponding picture in remaining picture access address, Extraction matches the business tine in the professional knowledge library.
Using the embodiment of the present application, business object information processing unit can obtain business object to be analyzed from service platform Details page content storage files, picture access address and content of text are therefrom extracted, then from the picture access address of extraction In corresponding picture and in content of text, the business tine in extraction matching professional knowledge library respectively, the business tine based on extraction, The publication or recommendation for controlling associated business objects, to using the business object information processing unit of the application, save manpower The cost of merchandise news in picture is filled in, and abundant digging utilization can be carried out to the content that businessman issues.
It should be understood by those skilled in the art that, embodiments herein can be provided as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, the application can be used in one or more wherein include computer usable program code computer The computer program production implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) The form of product.
The application is flow of the reference according to method, apparatus (system) and computer program product of the embodiment of the present application Figure and/or block diagram describe.It should be understood that can be realized by computer program instructions every first-class in flowchart and/or the block diagram The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided Instruct the processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine so that the instruction executed by computer or the processor of other programmable data processing devices is generated for real The device for the function of being specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that instruction generation stored in the computer readable memory includes referring to Enable the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one box of block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device so that count Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, in computer or The instruction executed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although the preferred embodiment of the application has been described, created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the application range.
Obviously, those skilled in the art can carry out the application essence of the various modification and variations without departing from the application God and range.In this way, if these modifications and variations of the application belong to the range of the application claim and its equivalent technologies Within, then the application is also intended to include these modifications and variations.

Claims (10)

1. a kind of business object information processing method, which is characterized in that this method includes:
The details page content storage files of business object to be analyzed are obtained from service platform;In the details page content storage files Include picture access address and content of text;
According to preset professional knowledge library, from the picture obtained based on the picture access address and in the content of text, The business tine of extraction and the professional knowledge storehouse matching respectively;
The business tine of extraction is associated and is stored with the business object to be analyzed.
2. the method as described in claim 1, which is characterized in that the business object is travelling products, the professional knowledge library For knowledge base of travelling, the corresponding service item of the business object to be analyzed refers to the corresponding tourism of the travelling products to be analyzed Sight spot.
3. method as claimed in claim 2, which is characterized in that the travelling knowledge base include one kind in following information or It is a variety of:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines information, valence Lattice information.
4. method as claimed in claim 2 or claim 3, which is characterized in that by the business tine of extraction and the industry to be analyzed Business object is associated and stores, including:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established, and are deposited Store up the mapping relations established;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are pre-set at heading message Mark id information.
5. the method as described in claim 1, which is characterized in that by the business tine of extraction and the business pair to be analyzed After being associated and storing, further include:
The business tine based on extraction controls the publication or recommendation of associated business objects;
Wherein, the associated business objects include the business object to be analyzed, or including with the business object to be analyzed The identical other business objects of corresponding service item, or including the business object to be analyzed and with the industry to be analyzed The identical other business objects of service item being engaged in corresponding to object.
6. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair The publication of elephant, including:
The business tine based on extraction and preset illegal keyword message detect the business object to be analyzed Legitimacy is to confirm that whether the business object to be analyzed can be issued.
7. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair The recommendation of elephant, including:
In the business object searching request for receiving user, closed according to the business tine carried in the business object searching request Key word information, and storage each business object extracted in advance business tine, determine with the business tine keyword message Matched at least one business object;Determining at least one business object is recommended into user.
8. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair The publication of elephant, including:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the business pair to be analyzed are established Mapping relations between the service item of elephant, and preserve the mapping relations;
After obtaining business object to be released, according to the corresponding service item of business object to be released, from prestoring Mapping relations in, search with the associated business tine of the service item;
According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
9. the method as described in claim 1, which is characterized in that from the corresponding picture in the picture access address, extraction Business tine with the professional knowledge library, including:
According to the corresponding business object identification information in each picture access address in the details page content storage files, and The identification information of the business object to be analyzed is deleted from the picture access address in the details page content storage files It is not belonging to the picture access address of the business object to be analyzed;
From the corresponding picture in remaining picture access address, extraction matches the business tine in the professional knowledge library.
10. a kind of business object information processing unit, which is characterized in that the device includes:
Acquisition module, the details page content storage files for obtaining business object to be analyzed from service platform;The details page Include picture access address and content of text in content storage files;
Extraction module, for according to preset professional knowledge library, from the picture obtained based on the picture access address and institute It states in content of text, respectively the business tine of extraction and the professional knowledge storehouse matching;
Memory module, for the business tine of extraction to be associated and store with the business object to be analyzed.
CN201710099966.9A 2017-02-23 2017-02-23 Business object information processing method and device Active CN108470296B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710099966.9A CN108470296B (en) 2017-02-23 2017-02-23 Business object information processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710099966.9A CN108470296B (en) 2017-02-23 2017-02-23 Business object information processing method and device

Publications (2)

Publication Number Publication Date
CN108470296A true CN108470296A (en) 2018-08-31
CN108470296B CN108470296B (en) 2022-02-25

Family

ID=63266698

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710099966.9A Active CN108470296B (en) 2017-02-23 2017-02-23 Business object information processing method and device

Country Status (1)

Country Link
CN (1) CN108470296B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109284795A (en) * 2018-09-04 2019-01-29 西安艾润物联网技术服务有限责任公司 A kind of data processing method and terminal device
CN110532449A (en) * 2019-08-30 2019-12-03 盈盛智创科技(广州)有限公司 A kind of processing method of service profile, device, equipment and storage medium
CN111325607A (en) * 2020-02-26 2020-06-23 上海携程商务有限公司 Marketing page construction method, system, equipment and medium
CN112445955A (en) * 2019-08-30 2021-03-05 珠海格力电器股份有限公司 Business opportunity information management method, system and storage medium
CN113298595A (en) * 2020-07-30 2021-08-24 阿里巴巴集团控股有限公司 Method and device for providing data object information and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080295028A1 (en) * 2007-05-21 2008-11-27 Sony Corporation Content display method, content display apparatus, recording medium, and server apparatus
CN102779140A (en) * 2011-05-13 2012-11-14 富士通株式会社 Keyword acquiring method and device
CN103927370A (en) * 2014-04-23 2014-07-16 焦点科技股份有限公司 Network information batch acquisition method of combined text and picture information
CN104424485A (en) * 2013-08-22 2015-03-18 北京卓易讯畅科技有限公司 Method and device for obtaining specific information based on image recognition

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080295028A1 (en) * 2007-05-21 2008-11-27 Sony Corporation Content display method, content display apparatus, recording medium, and server apparatus
CN102779140A (en) * 2011-05-13 2012-11-14 富士通株式会社 Keyword acquiring method and device
CN104424485A (en) * 2013-08-22 2015-03-18 北京卓易讯畅科技有限公司 Method and device for obtaining specific information based on image recognition
CN103927370A (en) * 2014-04-23 2014-07-16 焦点科技股份有限公司 Network information batch acquisition method of combined text and picture information

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109284795A (en) * 2018-09-04 2019-01-29 西安艾润物联网技术服务有限责任公司 A kind of data processing method and terminal device
CN110532449A (en) * 2019-08-30 2019-12-03 盈盛智创科技(广州)有限公司 A kind of processing method of service profile, device, equipment and storage medium
CN112445955A (en) * 2019-08-30 2021-03-05 珠海格力电器股份有限公司 Business opportunity information management method, system and storage medium
CN110532449B (en) * 2019-08-30 2022-05-31 盈盛智创科技(广州)有限公司 Method, device, equipment and storage medium for processing service document
CN112445955B (en) * 2019-08-30 2023-10-13 珠海格力电器股份有限公司 Business opportunity information management method, system and storage medium
CN111325607A (en) * 2020-02-26 2020-06-23 上海携程商务有限公司 Marketing page construction method, system, equipment and medium
CN113298595A (en) * 2020-07-30 2021-08-24 阿里巴巴集团控股有限公司 Method and device for providing data object information and electronic equipment

Also Published As

Publication number Publication date
CN108470296B (en) 2022-02-25

Similar Documents

Publication Publication Date Title
CN108470296A (en) A kind of business object information processing method and processing device
JP5269598B2 (en) System and method for image processing
CN103620588B (en) Based on browsing activity recognition matching application
KR101191172B1 (en) Method, apparatus and computer-readable recording medium for managing images in image database
US8204950B2 (en) Webpage search
CN105162627B (en) It was found that the method and system with presentation network application access information
KR101859050B1 (en) Method and system for searching map image using context of image
CN101097578A (en) Network resource searching method and system
JP2009134280A (en) Method for generating and providing map image for creating virtual space representing real world, server computer, and three-dimensional map image generating device
KR101062929B1 (en) Method, terminal, and computer-readable recording medium for supporting collection of object included in the image which is taken
CN106250129B (en) Vector quantization symbol dynamic drawing method based on meta graph recognition model
JP6351219B2 (en) Image search apparatus, image search method and program
CN103246678A (en) Method and device for previewing web page contents
CN103838862B (en) Video searching method, device and terminal
CN104320848B (en) The system and method for indoor positioning is realized based on cloud computing
US20030018789A1 (en) Information providing method and information providing system and terminal therefor
CN106294885A (en) A kind of data collection towards isomery webpage and mask method
KR20100089339A (en) Method and apparatus for generating and displaying image
JP2007109221A (en) Part management system, part management method, program and recording medium
CN112015845B (en) Method, device, equipment and storage medium for map retrieval test
WO2024164589A9 (en) Information display method and apparatus, electronic device, computer readable storage medium, and computer program product
CN104166660A (en) Search system and method based on range selection
KR101843585B1 (en) Service server and method by object recognition
JP2006065467A (en) Device for creating data extraction definition information and method for creating data extraction definition information
JP2006065467A5 (en)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant