CN108470296A - A kind of business object information processing method and processing device - Google Patents
A kind of business object information processing method and processing device Download PDFInfo
- Publication number
- CN108470296A CN108470296A CN201710099966.9A CN201710099966A CN108470296A CN 108470296 A CN108470296 A CN 108470296A CN 201710099966 A CN201710099966 A CN 201710099966A CN 108470296 A CN108470296 A CN 108470296A
- Authority
- CN
- China
- Prior art keywords
- business
- analyzed
- business object
- information
- extraction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0641—Shopping interfaces
- G06Q30/0643—Graphical representation of items or shoppers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/90335—Query processing
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Marketing (AREA)
- Economics (AREA)
- Computational Linguistics (AREA)
- Development Economics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
This application involves Internet technical field more particularly to a kind of business object information processing method and processing device, to realize in the case where not increasing businessman and filling in cost, abundant digging utilization is carried out to the content of businessman's publication.The embodiment of the present application provides a kind of business object information processing method:The details page content storage files of business object to be analyzed are obtained from service platform;Include picture access address and content of text in the details page content storage files;The business tine with professional knowledge storehouse matching is extracted respectively from the picture obtained based on the picture access address and in content of text according to preset professional knowledge library;The business tine of extraction is associated and is stored with business object to be analyzed.
Description
Technical field
This application involves Internet technical field more particularly to a kind of business object information processing method and processing devices.
Background technology
In electric business platform, businessman can issue a large amount of commodity details page content.In commodity details page, for page U.S.
It sees and typesetting is convenient, often place some pictures, can include departure place, journey routing, surcharge on these pictures
With information such as explanation, local playing method, sight spot introductions.
In order to analyze merchandise news, usually it is required for that commodity details page content is arranged, analyzed and handled.
But for the picture in details page, the information content that can not be directly obtained on picture.At present, if it is desired to make full use of figure
The information content of on piece, it is necessary to when businessman issues picture, it is desirable to which businessman is by the information content on picture with structuring
Form is stored.For example, when issuing picture shown in FIG. 1, it is necessary to which businessman fills title " 5 days 4, Beijing in storage
Late parent-offspring trip ", slogan " pure to play without shopping, the Spring Festival general do not appreciate ", neighbouring hotel name " ten thousand persons of outstanding talent/Westinghouse/Kai Binsi
Base/Fu Peng Sheraton Corp.s ".Obviously, this mode can increase the cost that businessman's progress commodity details page content is filled in, and reduce commodity hair
Cloth efficiency, if but businessman do not fill in the data of these structurings in publishing commodity information, just will be unable to extract one in picture
A little valuable merchandise newss, can not will so make full use of these valuable merchandise newss.
As it can be seen that in the case where not increasing businessman and filling in cost, abundant digging utilization is carried out to the content of businessman's publication, is
The direction realized is needed at present.
Invention content
The embodiment of the present application provides a kind of business object information processing method and processing device, is filled out to realize not increasing businessman
In the case of being write as this, abundant digging utilization is carried out to the business object information of businessman's publication.
The embodiment of the present application provides a kind of information extracting method, including:
The details page content storage files of business object to be analyzed are obtained from service platform;The details page content storage text
Include picture access address and content of text in part;
According to preset professional knowledge library, from the picture obtained based on the picture access address and in the text
Rong Zhong extracts the business tine with the professional knowledge storehouse matching respectively;
The business tine of extraction is associated and is stored with the business object to be analyzed.
Optionally, the business object is travelling products, and the professional knowledge library is travelling knowledge base, the industry to be analyzed
The corresponding service item of object of being engaged in refers to the corresponding tourist attractions of the travelling products to be analyzed.
Optionally, the travelling knowledge base includes one or more in following information:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines letter
Breath, pricing information.
Optionally, the business tine of extraction is associated and is stored with the business object to be analyzed, including:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established,
And store the mapping relations of foundation;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are advance at heading message
The mark id information of setting.
Optionally, after the business tine of extraction and the business object to be analyzed being associated and stored, also
Including:
The business tine based on extraction controls the publication or recommendation of associated business objects;
Wherein, the associated business objects include the business object to be analyzed, or including with the business to be analyzed
The identical other business objects of service item corresponding to object, or waited for point including the business object to be analyzed and with described
Analyse the identical other business objects of service item corresponding to business object.
Optionally, the business tine based on extraction controls the publication of associated business objects, including:
The business tine based on extraction and preset illegal keyword message detect the business pair to be analyzed
The legitimacy of elephant is to confirm that whether the business object to be analyzed can be issued.
Optionally, the business tine based on extraction controls the recommendation of associated business objects, including:
In the business object searching request for receiving user, according in the business carried in the business object searching request
Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword
At least one business object of information matches;Determining at least one business object is recommended into user.
Optionally, the business tine based on extraction controls the publication of associated business objects, including:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the industry to be analyzed are established
Mapping relations between the service item of object of being engaged in, and preserve the mapping relations;
After obtaining business object to be released, according to the corresponding service item of business object to be released, from advance
In the mapping relations of storage, search and the associated business tine of the service item;
According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
Optionally, from the corresponding picture in the picture access address, extraction matches in the business in the professional knowledge library
Hold, including:
According to the corresponding business object identification information in each picture access address in the details page content storage files,
And the identification information of the business object to be analyzed, from the picture access address in the details page content storage files,
Delete the picture access address for being not belonging to the business object to be analyzed;
From the corresponding picture in remaining picture access address, extraction matches the business tine in the professional knowledge library.
The embodiment of the present application provides a kind of information extracting device, including:
Acquisition module, the details page content storage files for obtaining business object to be analyzed from service platform;It is described detailed
Include picture access address and content of text in feelings page content storage files;
Extraction module is used for according to preset professional knowledge library, from the picture obtained based on the picture access address,
In the content of text, the business tine with the professional knowledge storehouse matching is extracted respectively;
Memory module, for the business tine of extraction to be associated and store with the business object to be analyzed.
Using the embodiment of the present application, the details page content storage text of business object to be analyzed can be obtained from service platform
Part therefrom extracts picture access address and content of text, then out of, extraction the corresponding picture in picture access address and text
Rong Zhong, extraction respectively matches the business tine in professional knowledge library, then by the business tine of extraction and the industry to be analyzed
Business object is associated and stores;Based on these business tines of storage, can control later associated business objects publication and
Recommend.To use application scheme, the cost that manpower fills in merchandise news in picture is saved, and businessman can be issued
Content carries out abundant digging utilization.
Description of the drawings
Fig. 1 is the business object information process flow figure that the embodiment of the present application one provides;
Fig. 2 is the picture schematic diagram of travelling products " Zhaoqing Guangdong 2 days with trip ";
Fig. 3 is the business object information process flow figure that the embodiment of the present application two provides;
Fig. 4 (a) is the picture schematic diagram of travelling products " Yi Ou comes Suzhou shopping village ";
Fig. 4 (b) is the picture schematic diagram of travelling products " visit of Beijing museum ";
Fig. 4 (c) is the picture schematic diagram of travelling products " Yanqi Lake one-day tour ";
Fig. 5 is business object information processing device structure diagram provided by the embodiments of the present application.
Specific implementation mode
The embodiment of the present application is applied to that the business object information that service side issues is automatically analyzed and excavated.Preparing
For analyze and excavate data when, other than the data in the content of text of publication, the text in the picture of business object to be analyzed
Word information is often more crucial, how (to be such as embodied in figure to the business object information of the various forms of expression of service side's publication
Business object information in piece) to carry out abundant digging utilization be the application content to be illustrated.
The embodiment of the present application is described in further detail with reference to the accompanying drawings of the specification.
Embodiment one
As shown in Figure 1, for the business object information process flow figure that the embodiment of the present application one provides, including following step
Suddenly:
S101:The details page content storage files of business object to be analyzed are obtained from service platform;The details page content
Include picture access address and content of text in storage file.
Business object to be analyzed can refer to it is any be suitable for recommending, show the object of user, including various entity products,
Virtual product etc..As a kind of application scenarios, business object to be analyzed here can refer to travelling products, such as 5 days 4 evenings of Beijing
Parent-offspring swims, and correspondingly, details page content generally comprises the picture of tourist attractions and introduces the content of text of corresponding travelling products.
In specific implementation, the details page of business object to be analyzed can be obtained from the catalogue in details page information library first
Content storage files path.Then, according to the content storage files path of acquisition, details page is obtained from details page information library
Content storage files.
Here, there are each business object details page (such as commodity in the catalogue in details page information library (such as commodity library)
Details page) content storage files path, can be got from the catalogue based on business object identification information above-mentioned to be analyzed
The details page content storage files path of business object.It is then possible to using dedicated file access kit, according to acquisition
Content storage files path obtains the details page content storage files of business object to be analyzed from details page information library.
After executing S101 and obtaining details page content storage files, it is also necessary to therefrom extract picture access address and text
Content.
Here, since picture file is all bigger, the access of the only picture generally preserved in content storage files
Location, such as the uniform resource locator (Uniform Resource Locator, URL) of picture.It is obtained from details page information library
Content storage files be hypertext markup language (Hyper Text MarkupLanguage, HTML) format, first can be with
Using the content in HTML PARSER (a kind of HTML parsing and analysis tool) parsing the above storage files, Ye Jicong
Picture access address is parsed in the content storage files of html format.Later, picture can be obtained according to picture access address
File.Here it is possible to according to the URL of picture, picture file is obtained using HttpClient (a kind of client programming kit).
S102:According to preset professional knowledge library, from the picture obtained based on the picture access address and the text
In this content, the business tine with the professional knowledge storehouse matching is extracted respectively.
When extracting business tine in picture, optical character identification (Optical may be used
CharacterRecognition, OCR) picture recognition technology, extract the text information in picture.
Here, include suitable for some of data mining and analysis business tine keywords in preset professional knowledge library
Information.According to specific business scenario difference, keyword message here is also different.For example, for this scene of travelling products,
Here professional knowledge library can refer to travelling knowledge base, wherein various travelling keywords are had collected, such as departure place, journey stroke
Arrangement, extra charge explanation, local playing method, sight spot introduction etc..In this way, can be extracted from the text information of picture with it is above-mentioned
Information that the corresponding departure place information of travelling keyword, journey routing information, extra charge illustrate, local playing method correlation letter
The information etc. that breath, sight spot are introduced.
S103:The business tine of extraction is associated and is stored with the business object to be analyzed.
It here, can be by matched business tine according to quotient after the business tine for extracting matching professional knowledge library
Product dimension is put in storage, and to carry out data analysis to these matched datas, to excavate, and is executed based on the result of data analysis, excavation
Follow-up business processing.Specifically, for travelling products, the mark of the business tine and travelling products to be analyzed of extraction can be established
Mapping relations between property information, and store the mapping relations of foundation;Wherein, the significant information of the travelling products to be analyzed
It is one or more in following information:Tourist attractions information, departure place information, destination information, waits for heading message for described in
Analyze the pre-set mark id information of travelling products.
For example, swimming this product for the 5 days 4 late parent-offsprings in Beijing, wrapped in the details page content of this travelling products of extraction
It includes the picture of tourist attractions and introduces the content of text of corresponding travelling products, wherein all include mark in picture and content of text
Inscribe information (or title of travelling products) " Beijing 5 days 4 late parent-offspring trip ", tourist attractions information " Beijing ", in content of text also
Include departure place " Shanghai " and destination " Beijing ".It, can be by title " Beijing 5 days 4 evenings parent when establishing above-mentioned mapping relations
Significant information of the son trip " as the travelling products, or can be by tourist attractions " Beijing " as the travelling products mark
Property information, departure place and destination " Beijing-Shanghai " can also be used as to significant information, alternatively, being tourism production by system
The mark ID (for example being identified using using number 101) of product setting is used as significant information.In this way, the tourism can be set up
The significant information of product and from the mapping relations between the other travel informations extracted in picture and in content of text, then into
Row storage.Above-mentioned significant information can as the key message that business object to be analyzed is retrieved and recommended, for example,
When user searches for tourism of Beijing, from the tourist attractions information of the travelling products of storage, it is north to extract corresponding tourist attractions
Capital or tourist famous-city are Pekinese's travelling products, and recommend user.
In general, tourism electric business platform commodity details page content is issued by businessman, in order to which the page is beautiful and typesetting
Convenient, businessman often will place a large amount of picture in commodity details page, can include a large amount of useful informations on picture;Such as set out
Ground, journey routing, extra charge explanation, local playing method, sight spot introduction etc..Pass through method provided by the present application, Ke Yicong
These details page contents are extracted crucial literal information, and are associated with corresponding commodity especially in image content, form quotient
The marking data of product, automatic excavating structural data meet the needs of each dimensional attribute information of structuring business.This is for big
Data application is a most basic and essential ring.
In the specific embodiment applied at one, optionally, in the business tine that will be extracted and the industry to be analyzed
After business object is associated and stores, the method can also include:S104 (not shown):Business tine based on extraction,
Control the publication or recommendation of associated business objects.
Here associated business objects include business object to be analyzed, or include with corresponding to business object to be analyzed
The identical other business objects of service item, or including business object to be analyzed and with the clothes corresponding to business object to be analyzed
The identical other business objects of business project.
Under a kind of application mode, the publication of the business tine control associated business objects based on extraction, including:
Business tine based on extraction and preset illegal keyword message, detect the legal of business object to be analyzed
Property is to confirm that whether the business object to be analyzed can be issued.
Whether include illegal keyword message in the business tine of Detection and Extraction under this application mode, such as certain
Travelling products illustrated with the font of very little in picture with force relevant information etc. of doing shopping, if so, then confirming that this is to be analyzed
Business object can not be issued, and can refuse to issue the business object to be analyzed at this time, or to issued industry to be analyzed
Business object is forced offline.
Under another application mode, the recommendation of the business tine control associated business objects based on extraction, including:
In the business object searching request for receiving user, according in the business carried in the business object searching request
Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword
At least one business object of information matches;Determining at least one business object is recommended into user.
In this manner, the business tine of the business object to be analyzed of extraction is stored as recommendation information,
After receiving the searching request that user carries business tine keyword message, from the recommendation information of storage, determine and the business
The business tine of the business object of content-keyword information matches recommends user.For example, being " 5 days 4, Beijing for picture header
This travelling products of late parent-offspring's trip ", after the text information and content of text in extracting and storing its corresponding picture file,
It, can be by the phase of this travelling products if receiving the searching request that user carries the such keyword message of Beijing tourism in 5 days
Close travel information (such as in picture and content of text in travel information) recommend user.
Under another application mode, the publication of the business tine control associated business objects based on extraction, including:
After the business tine for extracting business object to be analyzed, the business tine of extraction and business object to be analyzed are established
Mapping relations between service item, and preserve the mapping relations;It is to be released according to this after obtaining business object to be released
The corresponding service item of business object, from pre-stored mapping relations, search with the associated business of the service item in
Hold;According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
This application mode is for service side user, for example, for travelling products this scenes, business pair to be analyzed
As corresponding service item refers to the corresponding tourist attractions of travelling products to be analyzed.Here, by establish tourist attractions with it is corresponding
Travelling products official documents and correspondence content (image content and content of text) between mapping relations, when determining that businessman will issue certain tourism
When the travelling products at sight spot, it can be pushed to it based on the official documents and correspondence content with the relevant travelling products in the tourist attractions of storage
Related official documents and correspondence suggestion, for example suggest the key message etc. of displaying in picture in content arrangement pattern, picture.
As shown in Fig. 2, for travelling products " Zhaoqing Guangdong 2 days with trip ", figure of the user from the travelling products of acquisition
Scenic spot cuisines information " stuffed glutinous rice dumpling " is extracted in piece file, and travelling products title can be extracted from the content of text of the travelling products
" Zhaoqing Guangdong 2 days with trip ", " first day Shenzhen multiplies bus and goes to Zhaoqing, trip Qixingyan scenic spot, ancient Sung Dynasty Village wall for routing
Deng;Second day trip Dinghu Hill, Qingyun temple, flies many places sight spots such as puddle waterfall at tripod garden, rear to return to Shenzhen by bus ", in this way, building
Erect mapping of the title " Zhaoqing Guangdong 2 days with trip " of travelling products between above-mentioned scenic spot cuisines information and routing etc.
Relationship, and save.Later, when other businessmans need to issue travelling products information relevant with Zhaoqing Guangdong, Ke Yixiang
It pushes official documents and correspondence suggestion, for example travelling products title, scenic spot cuisines information, row are shown in picture or content of text to its suggestion
Journey arrangement can also show related sight spot, cuisines information successively to its suggestion in picture, illustrate that stroke is pacified in content of text
Row etc..
Using the embodiment of the present application, the details page content storage text of business object to be analyzed can be obtained from service platform
Part therefrom extracts picture access address and content of text, then from the corresponding picture in picture access address of extraction and text
In content, the business tine in extraction matching professional knowledge library, the business tine based on extraction control associated business objects respectively
Publication and recommendation save the cost that manpower fills in merchandise news in picture, and can be to businessman to use application scheme
The content of publication carries out abundant digging utilization.
In actual implementation, the picture URL that includes in the content storage files of details page may includes current to be analyzed
The picture of other unrelated business objects except business object, is interfered in order to avoid the real information of business object, needs
Garbage is filtered, by taking travelling products as an example, the description of detailed in Example two.
Embodiment two
As shown in figure 3, for the business object information process flow figure that the embodiment of the present application two provides, including:
S301:The details page content storage files of travelling products to be analyzed are obtained from service platform, are stored in the details page
It includes picture access address and content of text to store up in file.
Here, it is based on the corresponding identification information of travelling products to be analyzed, is found from the catalogue in the details page information library
The content storage files path of the details page of travelling products to be analyzed.File access kit is called, text is stored according to the content
Part path obtains content storage files from details page information library.Then, parse content of text in content storage files and
Picture access address.
Here, since the return of file access tool is html file, HTML analytical tools-HTML may be used
PARSER parses html file.Specifically, HTML PARSER may be used and parse content of text and picture URL therein,
When parsing content of text, content of text all in wherein content storage files all can be carried using HTML PARSER first
It takes out, then executes S202 again, further extraction matches travelling knowledge base in the content of text of HTML PARSER parsings
Travel information.HTML PARSER can also be used directly to parse the travel information of matching travelling knowledge base.
S302:According to the corresponding travelling products mark letter in each picture access address in details page content storage files
The identification information of breath and travelling products to be analyzed is deleted not from the picture access address in details page content storage files
Belong to the picture access address of travelling products to be analyzed.
S303:According to preset travelling knowledge base, from the corresponding picture in remaining picture access address, extraction matching trip
The travel information of row knowledge base.
Here, the picture that non-present travelling products to be analyzed are filtered out from the picture URL queues of acquisition, rejects other productions
The interference of the picture of product.Then HttpClient is used to obtain picture file according to remaining at least one URL.Then, it uses
OCR identification technologies, extract the text information in picture file, and according to travelling knowledge base, from the text information of extraction into
One step filters out sight spot shopping, scenic spot cuisines, tourism playing method and sight spot and plays the travel informations such as warm tip.
The server of the embodiment of the present application can be obtained therefrom after the content storage files for obtaining certain travelling products details page
It takes and the relevant picture file of the travelling products and content of text.Here, for page beauty, businessman places in commodity details page
Picture on can include a large amount of useful informations, such as departure place, journey routing, extra charge explanation, local playing method, sight spot
Introduce etc..The application needs to extract crucial literal information from the image content of commodity details page, makees with reference to specific picture
It further illustrates.
As shown in Fig. 4 (a), can be extracted from the picture file of the travelling products of acquisition departure place information " it is northern, upper, wide,
It is deep ", journey stroke number of days " 4 days 3 evenings ", the title " the 4 days 3 late free walkers in Soul " of travelling products.In this way, it is established that travelling products
Incidence relation of the title " Soul 4 days 3 late free walker " between above-mentioned departure place and journey stroke number of days, and save.
As shown in Fig. 4 (b), departure place information " Hangzhou " and trip can be extracted from the picture file of the travelling products of acquisition
The title " Vietnam's Nha Trang air ticket " for swimming product, can also extract pricing information " 2490 " from the content of text of the travelling products
Deng in this way, it is established that title " Vietnam's Nha Trang air ticket " being associated between above-mentioned departure place and pricing information of travelling products
System, and save.
As shown in Fig. 4 (c), contains in the picture file of certain Zhang Youguan Nha Trang tourism and said with tourism playing method, extra charge
Bright relevant information, for example can extract visa and inbound information from picture and illustrate information about service tip.This
Sample can set up the incidence relation between tourist famous-city " Nha Trang " and above-mentioned travel information, and save.
S304:The significant information of the travel information of extraction and travelling products to be analyzed is associated and is stored.
Here it is possible to which the mapping established between the travel information of extraction and the significant information of travelling products to be analyzed is closed
System, and store the mapping relations of foundation;Wherein, the significant information of the travelling products to be analyzed is one kind in following information
Or it is a variety of:Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are set in advance at heading message
The mark id information set.
S305:Travel information based on extraction controls the publication or recommendation of related travelling products.
In specific implementation, travel information that can be based on extraction and preset illegal keyword message are detected and are waited for point
The legitimacy of travelling products is analysed to confirm that whether the travelling products to be analyzed can be issued.It can also receive user's
When travelling products searching request, according to the tourism content-keyword information carried in the travelling products searching request, and storage
Each travelling products extracted in advance travel information, determine and at least one tourisms of the tourism content-keyword information matches
Product;Determining at least one travelling products are recommended into user.It can also be in the travel information for extracting travelling products to be analyzed
Afterwards, the mapping relations between the travel information of extraction and tourist attractions are established, after obtaining travelling products to be released, according to this
It is searched associated with the tourist attractions from pre-stored mapping relations the corresponding tourist attractions of travelling products to be released
Travel information;According to the travel information found, official documents and correspondence suggestion when issuing the travel information is pushed to user.
Using the embodiment of the present application, the details page content storage text of travelling products to be analyzed can be obtained from service platform
Part therefrom extracts picture access address and content of text, then out of, extraction the corresponding picture in picture access address and text
Rong Zhong, the travel information of extraction matching travelling knowledge base, the travel information based on extraction control the hair of related travelling products respectively
Cloth and recommendation save the cost that manpower fills in travel information in the pictures of travelling products, and can to use application scheme
Abundant digging utilization is carried out with the travelling products information issued to businessman.
Based on same inventive concept, additionally provided in the embodiment of the present application a kind of corresponding with business object information processing method
Business object information processing unit, the principle solved the problems, such as due to the device handled with the embodiment of the present application business object information
Method is similar, therefore the implementation of the device may refer to the implementation of method, and overlaps will not be repeated.
As shown in figure 5, be business object information processing device structure diagram provided by the embodiments of the present application, including:
Acquisition module 51, the details page content storage files for obtaining business object to be analyzed from service platform;It is described
Include picture access address and content of text in details page content storage files;
Extraction module 52 is used for according to preset professional knowledge library, from the picture obtained based on the picture access address
In and the content of text in, extraction and the business tine of the professional knowledge storehouse matching respectively;
Memory module 53, for the business tine of extraction to be associated and deposit with the business object to be analyzed
Storage.
Optionally, the business object is travelling products, and the professional knowledge library is travelling knowledge base, the industry to be analyzed
The corresponding service item of object of being engaged in refers to the corresponding tourist attractions of the travelling products to be analyzed.
Optionally, the travelling knowledge base includes one or more in following information:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines letter
Breath, pricing information.
Optionally, the memory module 53 is specifically used for:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established,
And store the mapping relations of foundation;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are advance at heading message
The mark id information of setting.
Optionally, described device further includes:
Processing module 54 is used for the business tine based on extraction, controls the publication or recommendation of associated business objects;Its
In, the associated business objects include the business object to be analyzed, or including with corresponding to the business object to be analyzed
The identical other business objects of service item, or including the business object to be analyzed and with the business object to be analyzed
The identical other business objects of corresponding service item.
Optionally, the processing module 54 is specifically used for:
The business tine based on extraction and preset illegal keyword message detect the business pair to be analyzed
The legitimacy of elephant is to confirm that whether the business object to be analyzed can be issued.
Optionally, the processing module 54 is specifically used for:
In the business object searching request for receiving user, according in the business carried in the business object searching request
Hold keyword message, and the business tine for each business object of storage extracted in advance, determines and the business tine keyword
At least one business object of information matches;Determining at least one business object is recommended into user.
Optionally, the processing module 54 is specifically used for:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the industry to be analyzed are established
Mapping relations between the service item of object of being engaged in, and preserve the mapping relations;After obtaining business object to be released, according to
The corresponding service item of business object to be released, from pre-stored mapping relations, lookup is associated with the service item
Business tine;According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
Optionally, the extraction module 52 is specifically used for:
According to the corresponding business object identification information in each picture access address in the details page content storage files,
And the identification information of the business object to be analyzed, from the picture access address in the details page content storage files,
Delete the picture access address for being not belonging to the business object to be analyzed;From the corresponding picture in remaining picture access address,
Extraction matches the business tine in the professional knowledge library.
Using the embodiment of the present application, business object information processing unit can obtain business object to be analyzed from service platform
Details page content storage files, picture access address and content of text are therefrom extracted, then from the picture access address of extraction
In corresponding picture and in content of text, the business tine in extraction matching professional knowledge library respectively, the business tine based on extraction,
The publication or recommendation for controlling associated business objects, to using the business object information processing unit of the application, save manpower
The cost of merchandise news in picture is filled in, and abundant digging utilization can be carried out to the content that businessman issues.
It should be understood by those skilled in the art that, embodiments herein can be provided as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application
Apply the form of example.Moreover, the application can be used in one or more wherein include computer usable program code computer
The computer program production implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.)
The form of product.
The application is flow of the reference according to method, apparatus (system) and computer program product of the embodiment of the present application
Figure and/or block diagram describe.It should be understood that can be realized by computer program instructions every first-class in flowchart and/or the block diagram
The combination of flow and/or box in journey and/or box and flowchart and/or the block diagram.These computer programs can be provided
Instruct the processor of all-purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce
A raw machine so that the instruction executed by computer or the processor of other programmable data processing devices is generated for real
The device for the function of being specified in present one flow of flow chart or one box of multiple flows and/or block diagram or multiple boxes.
These computer program instructions, which may also be stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that instruction generation stored in the computer readable memory includes referring to
Enable the manufacture of device, the command device realize in one flow of flow chart or multiple flows and/or one box of block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device so that count
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, in computer or
The instruction executed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in a box or multiple boxes.
Although the preferred embodiment of the application has been described, created once a person skilled in the art knows basic
Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as
It selects embodiment and falls into all change and modification of the application range.
Obviously, those skilled in the art can carry out the application essence of the various modification and variations without departing from the application
God and range.In this way, if these modifications and variations of the application belong to the range of the application claim and its equivalent technologies
Within, then the application is also intended to include these modifications and variations.
Claims (10)
1. a kind of business object information processing method, which is characterized in that this method includes:
The details page content storage files of business object to be analyzed are obtained from service platform;In the details page content storage files
Include picture access address and content of text;
According to preset professional knowledge library, from the picture obtained based on the picture access address and in the content of text,
The business tine of extraction and the professional knowledge storehouse matching respectively;
The business tine of extraction is associated and is stored with the business object to be analyzed.
2. the method as described in claim 1, which is characterized in that the business object is travelling products, the professional knowledge library
For knowledge base of travelling, the corresponding service item of the business object to be analyzed refers to the corresponding tourism of the travelling products to be analyzed
Sight spot.
3. method as claimed in claim 2, which is characterized in that the travelling knowledge base include one kind in following information or
It is a variety of:
Departure place information, destination information, playing method of travelling, prompt message of playing, sight spot introduction, shopping information, cuisines information, valence
Lattice information.
4. method as claimed in claim 2 or claim 3, which is characterized in that by the business tine of extraction and the industry to be analyzed
Business object is associated and stores, including:
The mapping relations between the business tine and the significant information of the travelling products to be analyzed of extraction are established, and are deposited
Store up the mapping relations established;
Wherein, the significant information of the travelling products to be analyzed is one or more in following information:
Tourist attractions information, departure place information, destination information, is that the travelling products to be analyzed are pre-set at heading message
Mark id information.
5. the method as described in claim 1, which is characterized in that by the business tine of extraction and the business pair to be analyzed
After being associated and storing, further include:
The business tine based on extraction controls the publication or recommendation of associated business objects;
Wherein, the associated business objects include the business object to be analyzed, or including with the business object to be analyzed
The identical other business objects of corresponding service item, or including the business object to be analyzed and with the industry to be analyzed
The identical other business objects of service item being engaged in corresponding to object.
6. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair
The publication of elephant, including:
The business tine based on extraction and preset illegal keyword message detect the business object to be analyzed
Legitimacy is to confirm that whether the business object to be analyzed can be issued.
7. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair
The recommendation of elephant, including:
In the business object searching request for receiving user, closed according to the business tine carried in the business object searching request
Key word information, and storage each business object extracted in advance business tine, determine with the business tine keyword message
Matched at least one business object;Determining at least one business object is recommended into user.
8. method as claimed in claim 5, which is characterized in that the business tine based on extraction controls related service pair
The publication of elephant, including:
After the business tine for extracting the business object to be analyzed, the business tine of extraction and the business pair to be analyzed are established
Mapping relations between the service item of elephant, and preserve the mapping relations;
After obtaining business object to be released, according to the corresponding service item of business object to be released, from prestoring
Mapping relations in, search with the associated business tine of the service item;
According to the business tine found, official documents and correspondence suggestion when issuing the business object is pushed to user.
9. the method as described in claim 1, which is characterized in that from the corresponding picture in the picture access address, extraction
Business tine with the professional knowledge library, including:
According to the corresponding business object identification information in each picture access address in the details page content storage files, and
The identification information of the business object to be analyzed is deleted from the picture access address in the details page content storage files
It is not belonging to the picture access address of the business object to be analyzed;
From the corresponding picture in remaining picture access address, extraction matches the business tine in the professional knowledge library.
10. a kind of business object information processing unit, which is characterized in that the device includes:
Acquisition module, the details page content storage files for obtaining business object to be analyzed from service platform;The details page
Include picture access address and content of text in content storage files;
Extraction module, for according to preset professional knowledge library, from the picture obtained based on the picture access address and institute
It states in content of text, respectively the business tine of extraction and the professional knowledge storehouse matching;
Memory module, for the business tine of extraction to be associated and store with the business object to be analyzed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710099966.9A CN108470296B (en) | 2017-02-23 | 2017-02-23 | Business object information processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710099966.9A CN108470296B (en) | 2017-02-23 | 2017-02-23 | Business object information processing method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108470296A true CN108470296A (en) | 2018-08-31 |
CN108470296B CN108470296B (en) | 2022-02-25 |
Family
ID=63266698
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710099966.9A Active CN108470296B (en) | 2017-02-23 | 2017-02-23 | Business object information processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108470296B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109284795A (en) * | 2018-09-04 | 2019-01-29 | 西安艾润物联网技术服务有限责任公司 | A kind of data processing method and terminal device |
CN110532449A (en) * | 2019-08-30 | 2019-12-03 | 盈盛智创科技(广州)有限公司 | A kind of processing method of service profile, device, equipment and storage medium |
CN111325607A (en) * | 2020-02-26 | 2020-06-23 | 上海携程商务有限公司 | Marketing page construction method, system, equipment and medium |
CN112445955A (en) * | 2019-08-30 | 2021-03-05 | 珠海格力电器股份有限公司 | Business opportunity information management method, system and storage medium |
CN113298595A (en) * | 2020-07-30 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Method and device for providing data object information and electronic equipment |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080295028A1 (en) * | 2007-05-21 | 2008-11-27 | Sony Corporation | Content display method, content display apparatus, recording medium, and server apparatus |
CN102779140A (en) * | 2011-05-13 | 2012-11-14 | 富士通株式会社 | Keyword acquiring method and device |
CN103927370A (en) * | 2014-04-23 | 2014-07-16 | 焦点科技股份有限公司 | Network information batch acquisition method of combined text and picture information |
CN104424485A (en) * | 2013-08-22 | 2015-03-18 | 北京卓易讯畅科技有限公司 | Method and device for obtaining specific information based on image recognition |
-
2017
- 2017-02-23 CN CN201710099966.9A patent/CN108470296B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080295028A1 (en) * | 2007-05-21 | 2008-11-27 | Sony Corporation | Content display method, content display apparatus, recording medium, and server apparatus |
CN102779140A (en) * | 2011-05-13 | 2012-11-14 | 富士通株式会社 | Keyword acquiring method and device |
CN104424485A (en) * | 2013-08-22 | 2015-03-18 | 北京卓易讯畅科技有限公司 | Method and device for obtaining specific information based on image recognition |
CN103927370A (en) * | 2014-04-23 | 2014-07-16 | 焦点科技股份有限公司 | Network information batch acquisition method of combined text and picture information |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109284795A (en) * | 2018-09-04 | 2019-01-29 | 西安艾润物联网技术服务有限责任公司 | A kind of data processing method and terminal device |
CN110532449A (en) * | 2019-08-30 | 2019-12-03 | 盈盛智创科技(广州)有限公司 | A kind of processing method of service profile, device, equipment and storage medium |
CN112445955A (en) * | 2019-08-30 | 2021-03-05 | 珠海格力电器股份有限公司 | Business opportunity information management method, system and storage medium |
CN110532449B (en) * | 2019-08-30 | 2022-05-31 | 盈盛智创科技(广州)有限公司 | Method, device, equipment and storage medium for processing service document |
CN112445955B (en) * | 2019-08-30 | 2023-10-13 | 珠海格力电器股份有限公司 | Business opportunity information management method, system and storage medium |
CN111325607A (en) * | 2020-02-26 | 2020-06-23 | 上海携程商务有限公司 | Marketing page construction method, system, equipment and medium |
CN113298595A (en) * | 2020-07-30 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Method and device for providing data object information and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN108470296B (en) | 2022-02-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108470296A (en) | A kind of business object information processing method and processing device | |
JP5269598B2 (en) | System and method for image processing | |
CN103620588B (en) | Based on browsing activity recognition matching application | |
KR101191172B1 (en) | Method, apparatus and computer-readable recording medium for managing images in image database | |
US8204950B2 (en) | Webpage search | |
CN105162627B (en) | It was found that the method and system with presentation network application access information | |
KR101859050B1 (en) | Method and system for searching map image using context of image | |
CN101097578A (en) | Network resource searching method and system | |
JP2009134280A (en) | Method for generating and providing map image for creating virtual space representing real world, server computer, and three-dimensional map image generating device | |
KR101062929B1 (en) | Method, terminal, and computer-readable recording medium for supporting collection of object included in the image which is taken | |
CN106250129B (en) | Vector quantization symbol dynamic drawing method based on meta graph recognition model | |
JP6351219B2 (en) | Image search apparatus, image search method and program | |
CN103246678A (en) | Method and device for previewing web page contents | |
CN103838862B (en) | Video searching method, device and terminal | |
CN104320848B (en) | The system and method for indoor positioning is realized based on cloud computing | |
US20030018789A1 (en) | Information providing method and information providing system and terminal therefor | |
CN106294885A (en) | A kind of data collection towards isomery webpage and mask method | |
KR20100089339A (en) | Method and apparatus for generating and displaying image | |
JP2007109221A (en) | Part management system, part management method, program and recording medium | |
CN112015845B (en) | Method, device, equipment and storage medium for map retrieval test | |
WO2024164589A9 (en) | Information display method and apparatus, electronic device, computer readable storage medium, and computer program product | |
CN104166660A (en) | Search system and method based on range selection | |
KR101843585B1 (en) | Service server and method by object recognition | |
JP2006065467A (en) | Device for creating data extraction definition information and method for creating data extraction definition information | |
JP2006065467A5 (en) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |