CN106033298A - Data extracting method and system - Google Patents

Data extracting method and system Download PDF

Info

Publication number
CN106033298A
CN106033298A CN201610287295.4A CN201610287295A CN106033298A CN 106033298 A CN106033298 A CN 106033298A CN 201610287295 A CN201610287295 A CN 201610287295A CN 106033298 A CN106033298 A CN 106033298A
Authority
CN
China
Prior art keywords
data
data items
signature
extracted
extract
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610287295.4A
Other languages
Chinese (zh)
Inventor
胡嵩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Marine Network Technology (beijing) Co Ltd
Original Assignee
Marine Network Technology (beijing) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Marine Network Technology (beijing) Co Ltd filed Critical Marine Network Technology (beijing) Co Ltd
Priority to CN201610287295.4A priority Critical patent/CN106033298A/en
Publication of CN106033298A publication Critical patent/CN106033298A/en
Priority to PCT/CN2017/082844 priority patent/WO2017190654A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0483Interaction with page-structured environments, e.g. book metaphor

Abstract

The invention aims at providing a data extracting method and system. When a data extraction trigger operation is detected, a data extraction area corresponding to the data extraction trigger operation is determined from the current conversation; according to target task schedules, item content of all data items is extracted from the data extraction area according to feature symbols corresponding to the data items contained in the target task schedules. By means of the method, the item content corresponding to all the data items contained in the target task schedules can be directly extracted from the data extraction area according to the target task schedules. The method and system are beneficial to group conversations and office scenes. By means of the corresponding relation of the data items and the feature symbols, the item content of all data items can be automatically extracted from the conversation content of users, tedious manual operation of users is reduced, and user experience is enhanced.

Description

A kind of method and system extracted for data
Technical field
The present invention relates to areas of information technology, particularly relate to a kind of technology extracted for data.
Background technology
In the interactive function that current various JICQs are provided, the most do not include session Content is extracted targetedly.Such as, user is it desired to extract the certain content in session, Can only slide at this session content, and then long by this session content, by " duplication " of ejecting, This session content is processed by option of operation such as accordingly " choosings entirely ".
When being applied to group session scene, the mode that this user extracts manually is more difficult to be suitable for. Owing to the participant of now session is more, session content updates faster, so that user needs Just can navigate at its session content wishing operation by more loaded down with trivial details manual lookup, Jin Erye Manually this session content can only be carried out various operation process.
Summary of the invention
It is an object of the invention to provide a kind of method and system extracted for data.
According to an aspect of the invention, it is provided a kind of data extraction method, wherein, the method Including:
A, when detecting that data extract trigger action, determines that from current sessions described data are extracted Data corresponding to trigger action extract region;
B, according to goal task arrangement, is comprised each data items according to described goal task arrangement Characteristic of correspondence labelling respectively, extracts the item of each data items described in extracted region from described data Mesh content.
According to another aspect of the present invention, additionally provide a kind of data extraction system, wherein, This system includes:
Extract area determining device, for when detecting that data extract trigger action, from current sessions Middle determine that described data are extracted data corresponding to trigger action and extracted region;
Data items extraction element, for according to goal task arrangement, pacifies according to described goal task Row is comprised each data items characteristic of correspondence labelling respectively, extracts extracted region institute from described data State the contents of a project of each data items.
Compared with prior art, the present invention directly can carry from data according to goal task arrangement Take extracted region goal task arrangement and comprised the contents of a project that each data items is the most corresponding.This It is favourable under group session and office scene.Corresponding with signature by data items Relation, the present invention can extract in the project of each data items automatically from the session content of user Hold, thus decrease the manual operation that user is loaded down with trivial details, enhance the experience of user.
Accompanying drawing explanation
By reading retouching in detail with reference to made non-limiting example is made of the following drawings Stating, the other features, objects and advantages of the present invention will become more apparent upon:
Fig. 1 illustrates the method flow diagram extracted according to an embodiment of the invention for data;
Fig. 2 illustrates that the system for data extraction according to another preferred embodiment of the present invention is shown It is intended to.
In accompanying drawing, same or analogous reference represents same or analogous parts.
Detailed description of the invention
It should be mentioned that, some are exemplary before being discussed in greater detail exemplary embodiment Embodiment is described as process or the method described as flow chart.Although flow chart is by every behaviour Be described into order process, but many of which operation can by concurrently, concomitantly or Person implements simultaneously.Additionally, the order of operations can be rearranged.When it has operated Shi Suoshu process can be terminated, it is also possible to have the additional step being not included in accompanying drawing. Described process can correspond to method, function, code, subroutine, subprogram etc..
Alleged " computer equipment " within a context, also referred to as " computer ", referring to can be by fortune Row preset program or instruction perform the predetermined process process such as numerical computations and/or logical calculated Intelligent electronic device, it can include processor and memorizer, processor perform at memorizer In the programmed instruction that prestores to perform predetermined process process, or by ASIC, FPGA, DSP Perform predetermined process process on hardware, or combined by said two devices and realize.Computer equipment Include but not limited to server, PC, notebook computer, panel computer, smart mobile phone Deng.
Described computer equipment such as includes subscriber equipment and the network equipment.Wherein, described user Equipment includes but not limited to smart mobile phone, PDA, PC, notebook computer etc.;Described network sets Standby include but not limited to server group that single network server, multiple webserver form or Based on cloud computing (Cloud Computing) it is made up of a large amount of computers or the webserver Cloud, wherein, cloud computing is the one of Distributed Calculation, by a group loosely-coupled computer collection One super virtual machine of composition.Wherein, described computer equipment can isolated operation come real The existing present invention, it is possible to access network by the mutual behaviour with other computer equipments in network Make to realize the present invention.Wherein, the network residing for described computer equipment includes but not limited to mutually Networking, wide area network, Metropolitan Area Network (MAN), LAN, VPN etc..
It should be noted that described subscriber equipment, the network equipment and network etc. are only for example, its He is such as applicable to the present invention, also at existing or that be likely to occur from now on computer equipment or network Within should being included in scope, and it is incorporated herein with way of reference.
The method (some of them are illustrated by flow chart) discussed herein below can be by hard Part, software, firmware, middleware, microcode, hardware description language or its combination in any are come Implement.When implementing by software, firmware, middleware or microcode, in order to implement necessary appointing Business program code or code segment can be stored in machine or computer-readable medium (is such as deposited Storage media) in.(one or more) processor can implement the task of necessity.
Concrete structure disclosed herein and function detail are the most representational, and be for The purpose of the exemplary embodiment of the present invention is described.But the present invention can replace shape by many Formula implements, and is not interpreted as being limited only by the embodiments set forth herein.
Although it should be appreciated that here may have been used term " first ", " second " etc. Describe unit, but these unit should not be limited by these terms.Use these arts Language is only used to make a distinction a unit with another unit.For example, do not carrying on the back In the case of the scope of exemplary embodiment, first module can be referred to as second unit, and And second unit can be referred to as first module similarly.Term "and/or" used herein above Including one of them or any and all combination of more listed associated item.
Term used herein above is only used to describe specific embodiment and be not intended to limit and show Example embodiment.Unless the context clearly dictates otherwise, odd number shape the most used herein above Formula " one ", " one " also attempt to include plural number.It is to be further understood that used herein above Term " include " and/or " comprising " specify stated feature, integer, step, operation, unit And/or the existence of assembly, and do not preclude the presence or addition of other features one or more, integer, Step, operation, unit, assembly and/or a combination thereof.
It should further be mentioned that in some replace implementation, the function/action being previously mentioned can With according to being different from accompanying drawing the order generation indicated.For example, involved merit is depended on Energy/action, the two width figures in succession illustrated can essentially substantially simultaneously perform or the most permissible Perform in a reverse order.
The invention discloses a kind of scheme extracting data items from the session content of user.Should Data extraction scheme can perform at subscriber equipment end, it is also possible to performs at network equipment end, or Can be coordinated with the network equipment from subscriber equipment and perform.
Such as, with subscriber equipment end carry out data extract illustrate, specifically, when detecting Data extract trigger action, and subscriber equipment determines that from current sessions described data are extracted and triggers behaviour Data corresponding to work extract region;Subsequently, according to goal task arrangement, subscriber equipment according to Described goal task arrangement is comprised each data items characteristic of correspondence labelling respectively, from described number According to extracting the contents of a project of each data items described in extracted region.
When being carried out data extraction by the network equipment, concrete steps are with above-mentioned at subscriber equipment end The step carrying out data extraction is identical, does not repeats them here.When by subscriber equipment and the network equipment When cooperation carries out data extraction, the above-mentioned step determining that data extract region and extraction data items Suddenly can be performed with the network equipment by subscriber equipment respectively, and both can arbitrarily perform aforementioned One of two steps.
For purposes of illustration only, following many data with the subscriber equipment execution present invention carry in this specification The scheme of taking carries out citing and illustrates, but, those skilled in the art will be understood that this kind of citing It is only used for illustrating the purpose of the present invention, and is understood not to any limitation of the invention, The present invention is equally performed with coordinating of the network equipment from the network equipment or subscriber equipment.
Further, the present invention is applicable to user conversation scene, the most various based on instant messaging The session context of application.Wherein, instant messaging application provides a kind of real-time communication system, generally The transmission word message, file, voice and the video that allow two people or many people to use network real-time are handed over Stream.When being applied to cluster conversation scene, the present invention can be from the session content of multiple users The data items that middle extraction goal task arranges, this is favourable under office scene.
Below in conjunction with the accompanying drawings the present invention is described in further detail.
Fig. 1 illustrates method flow diagram according to an embodiment of the invention, specifically illustrates a kind of number According to the process of extraction.
As it is shown in figure 1, in step sl, when detecting that data extract trigger action, subscriber equipment From current sessions, determine that described data are extracted the data corresponding to trigger action and extracted region;In step In rapid S2, according to goal task arrangement, subscriber equipment is comprised according to described goal task arrangement Each data items characteristic of correspondence labelling respectively, extracts each data described in extracted region from described data The contents of a project of project.
Specifically, in step sl, when detecting that data extract trigger action, subscriber equipment is from working as Front session determining, data are extracted the data corresponding to trigger action and extracted region.
Here, data extract trigger action include any present invention of being applicable to, can be set Come trigger data extract operation, the most various on screen along the slide of certain orientation, Such as go up pulling process, glide operation, slide etc. from left to right, or various on screen Press operation, such as double click operation, long by operation etc..
Additionally, it can also be the various operations triggered by button that data extract trigger action, wherein press Key can be physical button, it is also possible to be virtual key.Such as, session interface provides a void Intend button, when user clicks on this virtual key, i.e. start the data extraction procedure of the present invention.Or, Volume key is set to triggering key, when user presses set volume key, i.e. it is believed that detect Trigger action is extracted to data.
When detecting that data extract trigger action, subscriber equipment can determine according in the following manner Data extraction region:
1) directly region is extracted as data in the session content region in current screen;
Such as, when detecting that data extract trigger action, the pull-up behaviour applied on screen such as user Make, subscriber equipment will current screen region, namely the session content district presented in current screen Territory, extracts the data corresponding to trigger action as data and extracts region.
2) region is extracted as data in the session content region occurred in predetermined amount of time;
Here, the time period carrying out data extraction can pre-set, as former hours in Or today etc., depend on concrete application demand.Such as, predetermined amount of time is arranged to today, then When detecting that data extract trigger action, the session content place that subscriber equipment will occur today Region is extracted the data corresponding to trigger action as data and is extracted region.
3) determine that data extract region according to user's appointment.
After detecting that data extract trigger action, subscriber equipment can also receive user further Data are extracted the appointment in region, namely is carried out the beginning and end in setting data extraction region by user. Such as, when detecting that data extract trigger action, subscriber equipment prompts the user with and need to select further Beginning and end, thus user is extracted by slip screen selected data from session content region The beginning and end in region.
Further, it can also be an operative combination that data extract trigger action, and operative combination includes At least two operates, to be respectively used to determine the beginning and end that data extract region.
Such as, the side length of user's first current sessions content area presses screen, slide downward subsequently This session content region after stopping sliding, then vice-minister presses screen, subscriber equipment will user two Vice-minister presses the starting point extracting region at session content region corresponding during screen respectively as data And terminal.Alternately, user, after first length presses screen, slides subsequently on screen, user Equipment can will extract region as data terminal at the stopping of user's slide.
Those skilled in the art will be understood that above-mentioned each item data extracts trigger action and corresponding Data are extracted the determination mode in region and are example, are only used for illustrating the purpose of the present invention, and Being not construed as any limitation of the invention, other are any existing or data in the future are extracted and touched Send out operation and corresponding data are extracted the determination mode in region and are such as applicable to the present invention, all should be by It is included in the scope of patent protection of the present invention, and is incorporated in this.
In step s 2, according to goal task arrangement, subscriber equipment arranges according to described goal task Comprised each data items characteristic of correspondence labelling respectively, extracted described in extracted region from described data The contents of a project of each data items.
Wherein, goal task arrangement includes any to be realized by the data extraction scheme of the present invention Task arrangement, it can include that multiple data items, each data items have the data specified Form/content, these data items can extract from the session content of user.Such as, target is appointed Business arrangement is specifically including but not limited to schedule, project is reported, submit an expense account examination & approval etc..These targets Task arrangement the most each has specific data items.Specifically, such as, schedule can be wrapped Including the data items such as such as time, place, item, participant, project is reported and can be included such as The data items such as project name, director, progress, report time, reimbursement examination & approval can include all Such as data items such as item, time, expense, applicant, approvers.
Here, goal task arrangement can be predetermined, it is also possible to be to detect that data extraction is touched Determine in real time after sending out operation.
Such as, the scene arranged for predeterminated target task, conversation group is based on goal task arrangement Setting up, now current sessions group is only used for discussing the content relevant to goal task arrangement, or Person is only capable of from current sessions group extracting the content relevant to goal task arrangement.Further, when The title of front conversation group or attribute i.e. may be configured as corresponding goal task arrangement, such as submit an expense account group, XX project cluster.
Such as, for determining the scene that goal task arranges in real time, goal task arrangement can be with number It is associated according to extracting trigger action, thus different data extraction trigger actions can be corresponding different Goal task arranges.Specifically, such as, on screen, upper pulling process from the bottom to top can associate Project is reported, and slide from left to right can associate reimbursement examination & approval.Preferably, aforementioned difference Data extract trigger action can corresponding different goal task arrangement, it is possible to be used for same Conversation group.
Here, the signature storehouse of each data items can pre-build.
The signature storehouse of data items can be an independent storehouse, such as, include that any task is pacified The labelling of the data items of row.Or, the signature storehouse of data items can also be each task Arrange institute specific, that is, the spy that task arrangement is to there being a data items included by it Levy signature library.
Wherein, features described above signature library can be set up by the network equipment, it is also possible to is that user sets Standby foundation.
Such as, the network equipment pre-builds the signature storehouse of a data items, including institute Have task arrange corresponding to the signature of data items, the most such as: data items " name " Signature be@, the signature of data items " theme " be #, the spy of data items " amount of money " Levy and be labeled as $, the signature of data items " mail " is mailto etc..
Such as, subscriber equipment is at locally created each number about task arrangement " project report " According to the signature storehouse of project, including such as: the signature of data items " project name " It is that " head ", data items " enter for the signature of " task ", data items " director " Degree " signature be " progress ", the signature of data items " report time " be " time " Deng.
It should be noted that the above-mentioned citing to each signature is only the mesh illustrating the present invention , and it is understood not to any limitation of the invention.If other existing or in the future each The representation planting signature is equally used for expressing the data items defined in the present invention, then Also belong to the signature of data items indicated by the present invention, therefore should be comprised in the present invention's Within scope of patent protection, it is possible to way of reference is incorporated herein.
For the signature storehouse pre-build, subscriber equipment can be inquired about this feature signature library and come really Each data items distinguished characteristic of correspondence labelling that the task that sets the goal arranges, and then extract from data Each detailed programs content that extracted region is corresponding.
Such as, goal task is arranged to " project report ", and subscriber equipment is according to its data items " item Mesh title " signature " task ", the signature " head " of data items " director ", The signature " progress " of data items " progress " and data items " report time " Signature " time " from determined by data extract region and extract corresponding detailed programs respectively Content, extracting region such as data is the session content of today, then subscriber equipment is according to signature " task " extracts the contents of a project of therefrom data items " project name " is note3bug, according to It is an XX that signature " head " therefrom extracts the contents of a project of data items " director ", The contents of a project therefrom extracting data items " progress " according to signature " progress " are final Test, therefrom extracts according to signature " time " in the project of data items " report time " Hold for 2015/xx/xx.
Preferably, each data items can be at least through following 3 kinds with the corresponding relation of signature Arrange:
1) session content of multiple user is added up, to determine that each data items is corresponding with signature Relation.
Specifically, statistics a large number of users session content, and by the way of machine learning constantly from The middle corresponding relation determining data items and signature.Such as, a large number of users can be defeated after@ Enter name, may determine that signature@is corresponding to data items " participant " accordingly.
2) each data items and the corresponding relation of signature that at least one user uploads are received.
Specifically, each user can arrange the corresponding relation of data items and signature voluntarily, Thus subscriber equipment, the even network equipment, each data item of each user setup can be obtained accordingly Mesh and the corresponding relation of signature, to be subsequently used for the extraction of the contents of a project to relative users.
Preferably, its each set data items and the feature mark that multiple users upload is being obtained After the corresponding relation of note, these corresponding relations can be collected/screening etc. processes, thus obtains Obtain the one-to-one relationship of data items finally and signature, and this one-to-one relationship is returned Back to each user, so that all users can use unified form of presentation to conversate input. Corresponding relation based on this kind of unified data items Yu signature, at the item carrying out data items It is favourable and efficient during mesh contents extraction, and may be used without identical rule between user and enter Row exchange.
3) according to the language convention of participating user each in current sessions, each data items and feature are determined The corresponding relation of labelling.
Specifically, the angle from the language convention of each user accounts for determining that it is the most preferred The corresponding relation of data items and signature, the input of this more convenient user and new user's Experience.Such as, some user habit carrys out presentation data project " time " with " time ", Some user habit carrys out presentation data project " time " with " time ", then can use for concrete Family come according to its each language convention by the signature identification of data items " time " corresponding to " time " Or " time ".
It should be noted that those skilled in the art will be understood that above-mentioned 3) plant set-up mode also Non-mutually exclusive, but can combine to for determining the right of each data items and signature Should be related to.Further, above-mentioned 3) plant set-up mode to realize at subscriber equipment end, it is also possible to Network equipment end realizes.Certainly, when needing the setting to multiple users or language convention etc. to locate During reason, network equipment end carry out the determination of final data items and the corresponding relation of signature Advantageously and efficiently.
Preferably, subscriber equipment extract goal task arrange each data items the contents of a project it After, projects content can also be edited by user, to supplement/to revise the contents of a project extracted. Further, subscriber equipment can also using sectional drawing at the session content belonging to relevant item content as The supplemental content that goal task arranges, and present to user.
Fig. 2 illustrates system schematic according to an embodiment of the invention, specifically illustrates a kind of number According to the system extracted.As in figure 2 it is shown, data extraction system includes extracting area determining device 21 With data item extraction device 22.
It should be noted that extraction area determining device 21 and data item extraction device 22 are permissible All it is arranged in subscriber equipment, namely data extraction system is arranged in subscriber equipment;Or, Extract area determining device 21 and data item extraction device 22 be arranged separately in subscriber equipment and In the network equipment, thus both collectively constitute data extraction system;Or, extract region and determine dress Put 21 and data item extraction device 22 can all be arranged in the network equipment, namely data carry The system of taking is arranged in the network equipment.
For purposes of illustration only, following being arranged in subscriber equipment with data extraction system is described, Those skilled in the art will be understood that this kind of description is not only used for illustrating the purpose of the present invention, and not Should be understood any limitation of the invention.
Specifically, when detecting that data extract trigger action, area determining device 21 is extracted from currently Session determining, described data are extracted the data corresponding to trigger action and extracted region;Subsequently, according to Goal task arranges, and data items extraction element 22 is comprised respectively according to described goal task arrangement Data items characteristic of correspondence labelling respectively, extracts each data item described in extracted region from described data The purpose contents of a project.
Wherein, when detecting that data extract trigger action, extract area determining device 21 from current meeting Words determining, data are extracted the data corresponding to trigger action and extracted region.
Here, data extract trigger action include any present invention of being applicable to, can be set Come trigger data extract operation, the most various on screen along the slide of certain orientation, Such as go up pulling process, glide operation, slide etc. from left to right, or various on screen Press operation, such as double click operation, long by operation etc..
Additionally, it can also be the various operations triggered by button that data extract trigger action, wherein press Key can be physical button, it is also possible to be virtual key.Such as, session interface provides a void Intend button, when user clicks on this virtual key, i.e. start the data extraction procedure of the present invention.Or, Volume key is set to triggering key, when user presses set volume key, i.e. it is believed that detect Trigger action is extracted to data.
When detecting that data extract trigger action, extracting area determining device 21 can be according to following Mode determine data extract region:
1) directly region is extracted as data in the session content region in current screen;
Such as, when detecting that data extract trigger action, the pull-up behaviour applied on screen such as user Make, extract area determining device 21 will current screen region, namely current screen is presented Session content region, extract the data corresponding to trigger action as data and extract region.
2) region is extracted as data in the session content region occurred in predetermined amount of time;
Here, the time period carrying out data extraction can pre-set, as former hours in Or today etc., depend on concrete application demand.Such as, predetermined amount of time is arranged to today, then When detecting that data extract trigger action, extract the meeting that area determining device 21 will occur today The region at words content place is extracted the data corresponding to trigger action as data and is extracted region.
3) determine that data extract region according to user's appointment.
After detecting that data extract trigger action, extract area determining device 21 and can also enter one Step receives user and data is extracted the appointment in region, namely is come setting data extraction region by user Beginning and end.Such as, when detecting that data extract trigger action, area determining device 21 is extracted Prompt the user with and need to select beginning and end further, thus user by slip screen from session In content area, selected data extracts the beginning and end in region.
Further, it can also be an operative combination that data extract trigger action, and operative combination includes At least two operates, to be respectively used to determine the beginning and end that data extract region.
Such as, the side length of user's first current sessions content area presses screen, slide downward subsequently This session content region after stopping sliding, then vice-minister press screen, extraction area determining device 21 Will user two vice-minister press at session content region corresponding during screen respectively as data extraction The beginning and end in region.Alternately, user is after first length presses screen, subsequently on screen Sliding, extracting area determining device 21 can carry as data at the stopping of user's slide Take the terminal in region.
It should be noted that when extracting area determining device 21 and being arranged in the network equipment, right In user in the various operations of user equipment side, wait operation as the length of screen is pressed, can be by user Equipment Inspection also notifies to extract area determining device 21, extracts area determining device 21 and then permissible Determine that corresponding data extract region according to various pre-defined rules.Further, subscriber equipment also may be used To send other operation related datas, such as detect that screen length is by session in current screen during operation The beginning and end of content area, the screen length session content position etc. as corresponding to operation.
Those skilled in the art will be understood that above-mentioned each item data extracts trigger action and corresponding Data are extracted the determination mode in region and are example, are only used for illustrating the purpose of the present invention, and Being not construed as any limitation of the invention, other are any existing or data in the future are extracted and touched Send out operation and corresponding data are extracted the determination mode in region and are such as applicable to the present invention, all should be by It is included in the scope of patent protection of the present invention, and is incorporated in this.
According to goal task arrangement, data items extraction element 22 arranges institute according to described goal task Comprise each data items characteristic of correspondence labelling respectively, extract described in extracted region each from described data The contents of a project of data items.
Wherein, goal task arrangement includes any to be realized by the data extraction scheme of the present invention Task arrangement, it can include that multiple data items, each data items have the data specified Form/content, these data items can extract from the session content of user.Such as, target is appointed Business arrangement is specifically including but not limited to schedule, project is reported, submit an expense account examination & approval etc..These targets Task arrangement the most each has specific data items.Specifically, such as, schedule can be wrapped Including the data items such as such as time, place, item, participant, project is reported and can be included such as The data items such as project name, director, progress, report time, reimbursement examination & approval can include all Such as data items such as item, time, expense, applicant, approvers.
Here, goal task arrangement can be predetermined, it is also possible to be to detect that data extraction is touched Determine in real time after sending out operation.
Such as, the scene arranged for predeterminated target task, conversation group is based on goal task arrangement Setting up, now current sessions group is only used for discussing the content relevant to goal task arrangement, or Person is only capable of from current sessions group extracting the content relevant to goal task arrangement.Further, when The title of front conversation group or attribute i.e. may be configured as corresponding goal task arrangement, such as submit an expense account group, XX project cluster.
Such as, for determining the scene that goal task arranges in real time, goal task arrangement can be with number It is associated according to extracting trigger action, thus different data extraction trigger actions can be corresponding different Goal task arranges.Specifically, such as, on screen, upper pulling process from the bottom to top can associate Project is reported, and slide from left to right can associate reimbursement examination & approval.Preferably, aforementioned difference Data extract trigger action can corresponding different goal task arrangement, it is possible to be used for same Conversation group.
Here, the determination that goal task arranges can be come by a special device (Fig. 2 is not shown) Perform.Such as, data extraction system may also include a task arrangement and determines device, and this device is used for Determine goal task arrangement.Preferably, this device can also collect with data items extraction element 22 Become together.
Here, the signature storehouse of each data items can pre-build.
The signature storehouse of data items can be an independent storehouse, such as, include that any task is pacified The labelling of the data items of row.Or, the signature storehouse of data items can also be each task Arrange institute specific, that is, the spy that task arrangement is to there being a data items included by it Levy signature library.
Wherein, features described above signature library can be set up by the network equipment, it is also possible to is that user sets Standby foundation.
Such as, the network equipment pre-builds the signature storehouse of a data items, including institute Have task arrange corresponding to the signature of data items, the most such as: data items " name " Signature be@, the signature of data items " theme " be #, the spy of data items " amount of money " Levy and be labeled as $, the signature of data items " mail " is mailto etc..
Such as, subscriber equipment is at locally created each number about task arrangement " project report " According to the signature storehouse of project, including such as: the signature of data items " project name " It is that " head ", data items " enter for the signature of " task ", data items " director " Degree " signature be " progress ", the signature of data items " report time " be " time " Deng.
It should be noted that the above-mentioned citing to each signature is only the mesh illustrating the present invention , and it is understood not to any limitation of the invention.If other existing or in the future each The representation planting signature is equally used for expressing the data items defined in the present invention, then Also belong to the signature of data items indicated by the present invention, therefore should be comprised in the present invention's Within scope of patent protection, it is possible to way of reference is incorporated herein.
For the signature storehouse pre-build, subscriber equipment can be inquired about this feature signature library and come really Each data items distinguished characteristic of correspondence labelling that the task that sets the goal arranges, and then extract from data Each detailed programs content that extracted region is corresponding.
Such as, goal task is arranged to " project report ", and subscriber equipment is according to its data items " item Mesh title " signature " task ", the signature " head " of data items " director ", The signature " progress " of data items " progress " and data items " report time " Signature " time " from determined by data extract region and extract corresponding detailed programs respectively Content, extracting region such as data is the session content of today, then subscriber equipment is according to signature " task " extracts the contents of a project of therefrom data items " project name " is note3bug, according to It is an XX that signature " head " therefrom extracts the contents of a project of data items " director ", The contents of a project therefrom extracting data items " progress " according to signature " progress " are final Test, therefrom extracts according to signature " time " in the project of data items " report time " Hold for 2015/xx/xx.
Preferably, data extraction system can also include arranging device by a signature (Fig. 2 not show Go out), signature arranges device at least can arrange each data items with special by following 3 kinds Levy the corresponding relation of labelling:
1) session content of multiple user is added up, to determine that each data items is corresponding with signature Relation.
Specifically, signature arranges the session content of device statistics a large number of users, and passes through machine The mode of study the most therefrom determines the corresponding relation of data items and signature.Such as, in a large number User can input name after@, may determine that signature@is corresponding to data items " ginseng accordingly With people ".
2) each data items and the corresponding relation of signature that at least one user uploads are received.
Specifically, each user can arrange the corresponding relation of data items and signature voluntarily, Thus signature arranges device and can obtain each data items of each user setup accordingly with special Levy the corresponding relation of labelling, to be subsequently used for the extraction of the contents of a project to relative users.
Preferably, its each set data items and the feature mark that multiple users upload is being obtained Note corresponding relation after, signature arrange device these corresponding relations can be collected/ Screenings etc. process, thus obtain the one-to-one relationship of final data items and signature, and This one-to-one relationship is returned to each user, so that all users can use unified statement Mode conversates input.Corresponding relation based on this kind of unified data items Yu signature, It is favourable and efficient when the contents of a project carrying out data items are extracted, and also may be used between user Identical rule is used to exchange.
3) according to the language convention of participating user each in current sessions, each data items and feature are determined The corresponding relation of labelling.
Specifically, the angle from the language convention of each user accounts for determining that it is the most preferred The corresponding relation of data items and signature, the input of this more convenient user and new user's Experience.Such as, some user habit carrys out presentation data project " time " with " time ", Some user habit carrys out presentation data project " time " with " time ", then can use for concrete Family come according to its each language convention by the signature identification of data items " time " corresponding to " time " Or " time ".
It should be noted that those skilled in the art will be understood that above-mentioned 3) plant set-up mode also Non-mutually exclusive, but can combine to for determining the right of each data items and signature Should be related to.Further, above-mentioned 3) plant set-up mode to realize at subscriber equipment end, it is also possible to Network equipment end realizes.Certainly, when needing the setting to multiple users or language convention etc. to locate During reason, network equipment end carry out the determination of final data items and the corresponding relation of signature Advantageously and efficiently.
Preferably, in data extraction system extracts the project of each data items that goal task arranges After appearance, projects content can also be edited by user, to supplement/to revise the project extracted Content.Further, data extraction system can also be by the session content belonging to relevant item content The supplemental content that place's sectional drawing arranges as goal task, and present to user.
It should be noted that the present invention can be carried out in the assembly of hardware at software and/or software, Such as, can use special IC (ASIC), general purpose computer or any other be similar to Hardware device realizes.In one embodiment, the software program of the present invention can pass through processor Perform to realize steps described above or function.Similarly, the software program of the present invention (includes phase The data structure closed) can be stored in computer readable recording medium storing program for performing, such as, RAM deposits Reservoir, magnetically or optically driver or floppy disc and similar devices.It addition, some steps of the present invention or Function can employ hardware to realize, such as, as coordinate with processor thus perform each step or The circuit of function.
It addition, the part of the present invention can be applied to computer program, such as computer journey Sequence instructs, and when it is computer-executed, by the operation of this computer, can call or provide The method according to the invention and/or technical scheme.And call the programmed instruction of the method for the present invention, can Can be stored in fixing or movably in record medium, and/or by broadcasting or other signals hold Carry the data stream in media and be transmitted, and/or be stored in the meter run according to described programmed instruction Calculate in the working storage of machine equipment.Here, include a dress according to one embodiment of present invention Putting, this device includes the memorizer for storing computer program instructions and for performing programmed instruction Processor, wherein, when this computer program instructions is performed by this processor, trigger this device Run methods based on aforementioned multiple embodiments according to the present invention and/or technical scheme.
It is obvious to a person skilled in the art that the invention is not restricted to above-mentioned one exemplary embodiment Details, and without departing from the spirit or essential characteristics of the present invention, it is possible to it His concrete form realizes the present invention.Therefore, no matter from the point of view of which point, all should be by embodiment Regarding exemplary as, and be nonrestrictive, the scope of the present invention is by claims Rather than described above limit, it is intended that by fall claim equivalency implication and In the range of all changes be included in the present invention.Should be by any accompanying drawing mark in claim Note is considered as limiting involved claim.Furthermore, it is to be understood that " an including " word is not excluded for other lists Unit or step, odd number is not excluded for plural number.The multiple unit stated in system claims or device Can also be realized by software or hardware by a unit or device.The first, the second word such as grade Pragmatic represents title, and is not offered as any specific order.

Claims (18)

1. a data extraction method, wherein, the method includes:
A, when detecting that data extract trigger action, determines that from current sessions described data are extracted and touches Send out the data corresponding to operation and extract region;
B, according to goal task arrangement, is comprised each data items according to described goal task arrangement and is divided Other characteristic of correspondence labelling, in the project of each data items described in described data extract extracted region Hold.
Method the most according to claim 1, wherein, the method also includes:
-corresponding relation of each data items and signature is set.
Method the most according to claim 2, wherein, the method also includes:
-add up the session content of multiple user, to determine that each data items is corresponding with signature Relation.
The most according to the method in claim 2 or 3, wherein, the method also includes:
-receive each data items and the corresponding relation of signature that at least one user uploads.
5. according to the method according to any one of claim 2 to 4, wherein, the method also includes:
-according to the language convention of participating user each in described current sessions, determine each data items with The corresponding relation of signature.
Method the most according to any one of claim 1 to 5, wherein, the method also includes:
-determine described goal task arrangement.
Method the most according to claim 6, wherein, described goal task arrangement and described number It is associated according to extracting trigger action.
Method the most according to any one of claim 1 to 7, wherein, described data are extracted Trigger action is an operative combination, and described operative combination includes that at least two operates, to be respectively used to Determine that described data extract the beginning and end in region.
Method the most according to any one of claim 1 to 8, wherein, described goal task Arrangement includes following any one:
-schedule;
-project is reported;
-reimbursement examination & approval.
10. a data extraction system, wherein, this system includes:
Extract area determining device, for when detecting that data extract trigger action, from current sessions Middle determine that described data are extracted data corresponding to trigger action and extracted region;
Data items extraction element, for according to goal task arrangement, pacifies according to described goal task Row is comprised each data items characteristic of correspondence labelling respectively, extracts extracted region institute from described data State the contents of a project of each data items.
11. systems according to claim 10, wherein, this system also includes:
Signature arranges device, for arranging the corresponding relation of each data items and signature.
12. systems according to claim 11, wherein, described signature arranges device and enters One step is used for:
-add up the session content of multiple user, to determine that each data items is corresponding with signature Relation.
13. according to the system described in claim 11 or 12, and wherein, described signature is arranged Device is further used for:
-receive each data items and the corresponding relation of signature that at least one user uploads.
14. according to the system according to any one of claim 11 to 13, wherein, and described feature Tabbing apparatus is further used for:
-according to the language convention of participating user each in described current sessions, determine each data items with The corresponding relation of signature.
15. according to the system according to any one of claim 10 to 14, and wherein, this system is also Including:
Task arrangement determines device, is used for determining described goal task arrangement.
16. systems according to claim 15, wherein, described goal task arrangement is with described Data are extracted trigger action and are associated.
17. according to the system according to any one of claim 10 to 16, wherein, and described data Extracting trigger action is an operative combination, and described operative combination includes that at least two operates, with respectively For determining that described data extract the beginning and end in region.
18. according to the system according to any one of claim 10 to 17, wherein, and described target Task arrangement includes following any one:
-schedule;
-project is reported;
-reimbursement examination & approval.
CN201610287295.4A 2016-05-03 2016-05-03 Data extracting method and system Pending CN106033298A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201610287295.4A CN106033298A (en) 2016-05-03 2016-05-03 Data extracting method and system
PCT/CN2017/082844 WO2017190654A1 (en) 2016-05-03 2017-05-03 Method and system for data extraction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610287295.4A CN106033298A (en) 2016-05-03 2016-05-03 Data extracting method and system

Publications (1)

Publication Number Publication Date
CN106033298A true CN106033298A (en) 2016-10-19

Family

ID=57149320

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610287295.4A Pending CN106033298A (en) 2016-05-03 2016-05-03 Data extracting method and system

Country Status (2)

Country Link
CN (1) CN106033298A (en)
WO (1) WO2017190654A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017190654A1 (en) * 2016-05-03 2017-11-09 海致网络技术(北京)有限公司 Method and system for data extraction
WO2022152040A1 (en) * 2021-01-18 2022-07-21 北京字跳网络技术有限公司 Information processing method and apparatus, and electronic device and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112650430A (en) * 2020-12-28 2021-04-13 北京达佳互联信息技术有限公司 Task processing method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104378441A (en) * 2014-11-25 2015-02-25 小米科技有限责任公司 Schedule creating method and device
WO2015141101A1 (en) * 2014-03-20 2015-09-24 日本電気株式会社 Information-processing device, information processing method, and information-processing program
CN105302885A (en) * 2015-10-15 2016-02-03 北京锐安科技有限公司 Full-text data extraction method and device
CN105389304A (en) * 2015-10-27 2016-03-09 小米科技有限责任公司 Event extraction method and apparatus

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5864848A (en) * 1997-01-31 1999-01-26 Microsoft Corporation Goal-driven information interpretation and extraction system
US7487456B2 (en) * 2005-04-06 2009-02-03 Microsoft Corporation System and method for automatically populating appointment fields
CN103325031A (en) * 2012-03-19 2013-09-25 联想(北京)有限公司 Schedule reminding method and terminal
CN104796327B (en) * 2015-04-30 2018-09-28 上海众源网络有限公司 Message receival method and device, method for message transmission and system
CN106033298A (en) * 2016-05-03 2016-10-19 海致网络技术(北京)有限公司 Data extracting method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015141101A1 (en) * 2014-03-20 2015-09-24 日本電気株式会社 Information-processing device, information processing method, and information-processing program
CN104378441A (en) * 2014-11-25 2015-02-25 小米科技有限责任公司 Schedule creating method and device
CN105302885A (en) * 2015-10-15 2016-02-03 北京锐安科技有限公司 Full-text data extraction method and device
CN105389304A (en) * 2015-10-27 2016-03-09 小米科技有限责任公司 Event extraction method and apparatus

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017190654A1 (en) * 2016-05-03 2017-11-09 海致网络技术(北京)有限公司 Method and system for data extraction
WO2022152040A1 (en) * 2021-01-18 2022-07-21 北京字跳网络技术有限公司 Information processing method and apparatus, and electronic device and storage medium
CN114827066A (en) * 2021-01-18 2022-07-29 北京字跳网络技术有限公司 Information processing method, device, electronic equipment and storage medium
CN114827066B (en) * 2021-01-18 2024-02-20 北京字跳网络技术有限公司 Information processing method, apparatus, electronic device and storage medium

Also Published As

Publication number Publication date
WO2017190654A1 (en) 2017-11-09

Similar Documents

Publication Publication Date Title
CN109952610B (en) Selective identification and ordering of image modifiers
CN106303730B (en) A kind of method and apparatus for being used to provide combination barrage information
CN106874338A (en) Equipment, method and graphic user interface for manipulating user interface object using vision and/or touch feedback
CN102439973B (en) Video resouce management method and device in video conference
CN110377193A (en) Using confirmation option in graphical messages transmission user interface
CN101073048A (en) A content-management interface
CN110189089A (en) Method, system and mobile device for communication
CN105898520A (en) Video frame interception method and device
CN103310099A (en) Method and system for realizing augmented reality by adopting image capture and recognition technology
CN102289337A (en) Brand new display method of mobile terminal interface
CN103064584A (en) Method and device for pasting
CN110582018A (en) Video file processing method, related device and equipment
CN105900053A (en) Interface device for link designation, interface device for viewer, and computer program
CN105933730A (en) Video association information recommendation method and device
CN104835059A (en) Somatosensory interaction technology-based intelligent advertisement delivery system
CN106033298A (en) Data extracting method and system
CN105916037A (en) Video comment information processing method and apparatus
CN105612511A (en) Identifying and structuring related data
CN105869009A (en) Advertisement playing method and apparatus in video
CN106201224A (en) The method and device that a kind of batch data processes
CN113918522A (en) File generation method and device and electronic equipment
CN102411467B (en) Electronic equipment and its contents management method
CN103019546B (en) A kind of slideshow method, system and apparatus for demonstrating
CN104394478A (en) Method and player for playing video
CN114040248A (en) Video processing method and device and electronic equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161019