CN112862617A - Data processing method, system, storage medium and electronic device - Google Patents

Data processing method, system, storage medium and electronic device Download PDF

Info

Publication number
CN112862617A
CN112862617A CN201911189102.1A CN201911189102A CN112862617A CN 112862617 A CN112862617 A CN 112862617A CN 201911189102 A CN201911189102 A CN 201911189102A CN 112862617 A CN112862617 A CN 112862617A
Authority
CN
China
Prior art keywords
theme
stock
index
information
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911189102.1A
Other languages
Chinese (zh)
Inventor
贾鸿飞
杨光
崔勇
杨雪松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Taikang Asset Management Co ltd
Taikang Insurance Group Co Ltd
Original Assignee
Taikang Asset Management Co ltd
Taikang Insurance Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Taikang Asset Management Co ltd, Taikang Insurance Group Co Ltd filed Critical Taikang Asset Management Co ltd
Priority to CN201911189102.1A priority Critical patent/CN112862617A/en
Publication of CN112862617A publication Critical patent/CN112862617A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Operations Research (AREA)
  • Game Theory and Decision Science (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a data processing method, a system, a medium and electronic equipment, wherein a data acquisition device acquires stock related information in real time and sends the stock related information to a data analysis device; the data analysis device selects stocks related to the theme of the theme index to be constructed according to the stock related information collected in real time, calculates the relevance of each selected stock and the theme, and selects the first stocks with larger relevance to the theme to construct the theme index of the theme; and displaying the selected stock and the constructed theme index by the data presentation device. The technical scheme of the embodiment of the invention does not depend on artificial subjective selection, can automatically select high-quality stocks which objectively reflect the development trend of the theme and have enough performance support to dynamically construct the theme index according to the information collected in real time, and provides relatively objective and accurate guidance for theme investment.

Description

Data processing method, system, storage medium and electronic device
Technical Field
The present invention relates to the field of data mining, and in particular, to a method, system, storage medium, and electronic device for constructing a topic index based on stock information.
Background
The theme investment is judged and grasped by taking a certain event development trend as core logic, and investment opportunities of stocks with common attribute characteristics are mined by searching for over-expectations or manufacturing expectations. The theme investment is characterized in that stocks are not selected according to a general industry division method, but a certain factor driving the long-term development trend of an economic body is taken as a theme to select regions, industries, plates or stocks. The theme investment index (also called theme index for short) is an index which is constructed according to a certain rule by selecting component shares according to a determined theme and can reflect the market performance of the invested theme. Meanwhile, the theme index can also be used as an important investment carrier, and by tracking and copying the theme index, an investor can realize corresponding theme investment under the condition of transparency and low cost. It can be said that the topic index is an extension of the topic investment philosophy, which makes the topic investment more practical and less costly.
The existing theme investment is generally manually specified when selecting constituent stocks for the theme index, has high subjectivity, is difficult to reflect the change of the theme development trend in time after being specified for a long time, and cannot effectively measure the value of the theme investment. In addition, new stocks are continuously released, and a new valuable theme stock cannot be introduced in time by manually specifying constituent stocks, so that the theme index cannot accurately reflect market dynamics.
Disclosure of Invention
The embodiment of the invention aims to provide a method, a system, a medium and electronic equipment for dynamically constructing a theme index, which can automatically select high-quality stocks with sufficient performance support for objectively reflecting the theme development trend to dynamically construct the theme index based on real-time collected stock related information.
The purpose of the invention is realized by the following technical scheme:
according to a first aspect of the embodiments of the present invention, there is provided a data processing method, including: the data acquisition device acquires the related information of the stock in real time and sends the related information to the data analysis device; selecting stocks related to the theme of the theme index to be constructed by the data analysis device according to the stock related information acquired in real time by the data acquisition device, calculating the relevance of each selected stock to the theme, and selecting a plurality of previous stocks with larger relevance to the theme to construct the theme index of the theme; and displaying the selected stock and the constructed theme index by the data presentation means.
In some embodiments of the invention, the method may further comprise periodically collecting, by the data collection device, media news information, public behavior information, industry information, and stock related information; and discovering a new theme from the information collected by the data collection device by the data analysis device, and selecting the new theme as the theme of the theme index to be constructed.
In some embodiments of the invention, the method may further comprise receiving, by the data analysis device, a keyword as a topic of the topic index to be constructed in response to the keyword being received via a network or a user interface.
In some embodiments of the invention, the method may further comprise: and periodically or in response to a user request, recalculating the relevance of each stock to the theme according to the stock related information collected by the data collection device in a period of time, and reselecting the previous stocks which are more relevant to the theme to construct the theme index of the theme.
In some embodiments of the invention, the method may further comprise: the data acquisition device periodically acquires media news information, public behavior information, industry information and related stock information; and determining the heat degree of each theme by the data analysis device based on the information acquired by the data acquisition device, and selecting the theme of which the heat degree change exceeds a set threshold value as the theme of the theme index to be constructed.
In some embodiments of the invention, the relevance of each selected stock to the topic may be calculated by:
respectively extracting a word vector corresponding to the subject word and a word vector corresponding to each stock by using a pre-trained word vector model; and calculating the similarity between the word vector of the subject word and the word vector corresponding to each stock to obtain the correlation between the subject word and each stock.
In some embodiments of the invention, the stock-related information may include one or more of the following: a description of the stock, a public announcement, a periodic announcement, market data.
In some embodiments of the invention, the word vector model may be used by the data analysis device to calculate the relevance of each stock selected to the topic.
According to a second aspect of the embodiments of the present invention, there is provided a data processing system, including a data acquisition device, a data analysis device, and a data presentation device. The data acquisition device is used for acquiring the related stock information in real time and sending the related stock information to the data analysis device. The data analysis device is used for selecting stocks related to the theme of the theme index to be constructed according to the stock information acquired by the data acquisition device in real time, calculating the relevance of each selected stock to the theme, and selecting the first stocks with larger relevance to the theme to construct the theme index of the theme. The data presentation device is used for displaying the stocks selected by the data analysis device and the constructed theme index.
In some embodiments of the present invention, the data collection device may also be used to periodically collect media news information, public behavior information, industry information, and stock related information; and the data analysis device can also find a new theme from the information collected by the data collection device and select the new theme as the theme of the theme index to be constructed.
According to a third aspect of embodiments of the present invention, there is provided a computer readable medium, on which a computer program is stored, which program, when executed by a processor, performs the method as described in the first aspect of the embodiments above.
According to a fourth aspect of embodiments of the present invention, there is provided an electronic device comprising a processor and a memory, wherein the memory is configured to store executable instructions; the processor is configured to perform the method of the first aspect of the embodiments described above via execution of the executable instructions.
The technical scheme provided by the embodiment of the invention has the following beneficial effects: the method does not depend on artificial subjective selection, can automatically select high-quality stocks which objectively reflect the development trend of the theme and have enough performance support according to the information collected in real time to dynamically construct the theme index, and provides relatively objective and accurate guidance for theme investment.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention. It is obvious that the drawings in the following description are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort. In the drawings:
FIG. 1 schematically shows a flow diagram of a data processing method for stock analysis according to one embodiment of the invention;
FIG. 2 schematically illustrates a schematic block diagram of a data processing system for stock analysis in accordance with one embodiment of the present invention;
FIG. 3 illustrates a schematic structural diagram of a computer system suitable for use with the electronic device to implement one embodiment of the present invention.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that the invention may be practiced without one or more of the specific details, or with other methods, components, devices, steps, and so forth. In other instances, well-known methods, devices, implementations or operations have not been shown or described in detail to avoid obscuring aspects of the invention.
The block diagrams shown in the figures are functional entities only and do not necessarily correspond to physically separate entities. I.e. these functional entities may be implemented in the form of software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor means and/or microcontroller means.
The flow charts shown in the drawings are merely illustrative and do not necessarily include all of the contents and operations/steps, nor do they necessarily have to be performed in the order described. For example, some operations/steps may be decomposed, and some operations/steps may be combined or partially combined, so that the actual execution sequence may be changed according to the actual situation.
FIG. 1 schematically shows a flow diagram of data processing for stock analysis according to one embodiment of the invention. As shown in fig. 1, the method includes step S101, collecting, by a data collection device, stock related information in real time, and sending the stock related information to a data analysis device; step S102, selecting stocks related to the theme of the theme index to be constructed by a data analysis device according to stock related information collected by a data collection device in real time; step S103, calculating the relevance of each selected stock and the theme by the data analysis device; step S104, selecting a plurality of previous stocks with larger relevance with the theme by the data analysis device to construct a theme index of the theme; and a step S105 of displaying, by the data presentation means, the stocks selected by the data analysis means and the constructed topic indexes. It should be understood that although the execution bodies of the steps, such as the data acquisition device, the data analysis device and the data presentation device, are described above as separate independent devices, these devices may be executed in the same server, or may be served by different servers, or may be any one or more computing devices and combinations thereof for data processing related to stock analysis.
More specifically, in step S101, stock related information is collected in real time by the data collection device and transmitted to the data analysis device. Wherein the collected stock-related information may include one or more of the following: a description of the stock, a public announcement, a periodic announcement, market data. Such information may be obtained from, for example, a flat media dedicated to marketing company disclosure information, a stock exchange website, and some professional stock exchange information websites, as specified by the certificate authority.
In step S102, the data analysis device selects stocks related to the theme of the theme index to be constructed according to the stock related information collected by the data collection device in real time. For the stock information collected by the data collection device, a plurality of stocks relevant to the theme of the theme index to be constructed can be found through query methods such as keyword matching, similar meaning word matching and the like.
In one embodiment, the topic of the topic index to be constructed may be specified by the user, for example, a request of the user is received through the network, and the specified topic contained in the request is extracted; or receiving a keyword input by a user through a user interface, for example, and using the keyword as a subject of the subject index to be constructed.
In yet another embodiment, the topics for which the topic index is to be constructed may be automatically discovered hot topics. For example, media news information, public behavior information, industry information and stock related information may be periodically collected, and then a new theme or a hot theme is found from the collected information, and a theme index is to be constructed for the newly found theme. This is in view of the fact that subject matter investments often employ event-driven strategies, often relying on certain events or certain expectations, leading to investment hotspots. It is often difficult for a particular user to timely and efficiently discover investment hotspots. Therefore, in the embodiment, by monitoring and collecting news and topics related to each media in real time, collecting information related to public behaviors, collecting latest information related to industry development and the trend of stock release and stock investment, etc., hot topics or new topics are found in the collected information in time to recommend relevant topic investment to the user, and a topic index can be quickly constructed to provide timely and accurate information for stock investment.
The collected information includes the above-mentioned stock related information, and may include, but is not limited to, the following information:
media information: the method mainly collects category data such as social contact, news, research reports and the like. Social data such as mainstream finance community forums such as new wave stock bars, eastern wealth stock bars, snowballs, microblogs, WeChat-spread articles, discussion groups, post-public articles, original posts and reply data. News-like data includes, for example, articles from mainstream financial websites and industry hotspot websites, as well as data sources, forwarding times, and commentary demographics of the original news data. The research report data includes, for example, research reports (also referred to as "research reports") of mainstream financial websites and various types of research reports purchased by companies, and various types of announcements and reports of listed companies. And can selectively extract public company bulletin contents, research report abstracts, organization information, strategy categories, researcher information, associated stocks and the like.
Behavior information: the method mainly collects public behavior information which can be obtained in social contact, news and research and report media. The method can comprise the following steps: click volume, e.g., the number of times the public clicks on a media article (i.e., reading); number of replies, such as number of times the public replies to the intermediary article; a number of hops, e.g., the number of times the public has forwarded the social media article (e.g., the number of hops for a Sina microblog); the number of concerns, such as the number of times the public has focused on a certain topic (e.g., the number of people focused on a certain stock on the snowball), etc.
Industry information: often referring to domain specific professional data. Such as real estate deal and listing data, can be used to study information such as the popularity of the real estate market. For example, the bid data may be used to research information such as bid and winning bid of a company. And for example, the patent data from Chinese patent publication bulletin network is used to research the information of patent application of each company.
The data collected in real time cover multiple aspects of stock, news, public sentiment, industry development, technical development and the like, and the heat change of the existing subject words can be tracked by setting multiple heat indexes related to the subjects, so that the hot subjects can be determined more accurately in time according to market information. The heat indicator may include, but is not limited to: the number of times a keyword associated with a topic appears and the frequency of the occurrence in a unit time, the amount of reading, forwarding and review of news or information associated with the topic by a user, the amount of new stock release associated with the topic over a period of time, the average magnitude of increase or decrease in stock prices associated with the topic over a period of time, the amount of patent applications associated with the topic over a period of time, etc. After the heat degree of each theme is determined by using the acquired information, the theme of which the heat degree change amplitude exceeds a preset threshold value can be selected as the theme of the theme index to be constructed, so that the constructed theme index can reflect the heat degree change of the theme in real time. In yet another embodiment, new topics may also be discovered from this collected information using information entropy or mutual information based new word discovery algorithms or cluster based new word discovery algorithms.
With continued reference to FIG. 1, at step S103, the relevance of each selected stock to the topic is calculated by the data analysis device. The subject investment is generally cross-industry. For example, assuming that the theme is "new energy", then relevant to the theme may be a company that generates new energy products, a company that uses new energy, a company that provides technology development for new energy, a travel company that benefits from the use of new energy, and the like. Thus, the stocks related to the topic are found to be numerous by keyword matching in step S101, and there is a great difference between the investment index and contribution strength of each stock to the topic. To further screen representative stocks, a more detailed quantification of the relevance of each stock to the topic is required in step S102. For example, a text relevancy algorithm such as the TF-TDF model, the LDA topic model, etc. may be employed to calculate the relevancy between each stock and the topic. In one embodiment, a Word vector model such as the Word2Vec algorithm may be employed to calculate the relevance of each stock selected to the topic corresponding to the topic index to be built. For example, Word vector Word2Vec algorithm is used to train Word vector models on corpus text of financial related news, marketing company reports, specific industry fields, etc. Based on the trained word vector model, the subject word is used as a query vector to extract a corresponding word vector from the established word vector library, the word vector of each stock is obtained by using the word vector model, then the similarity calculation is carried out on the word vector of the subject word and the word vector corresponding to each stock, and the strength of the correlation between the subject word and each stock is reflected by the result of the similarity calculation.
In step S104, the first stocks having a greater relevance to the topic are selected by the data analysis device for constructing the topic index of the topic. After the stock is selected, the method for constructing and calculating the theme index belongs to the known content, and is not described in detail herein. Then, in step S105, the selected stocks and the constructed theme index are displayed by the data presentation apparatus. In one embodiment, the data presentation device may be a stand-alone server that receives information from the data analysis device and displays it on a corresponding display screen. In yet another embodiment, the data presentation means may be a display device, such as a display, for displaying information to a user.
In the technical scheme of the embodiment, the method does not depend on artificial subjective selection of a plurality of stocks for constructing the theme index, but automatically selects high-quality stocks which objectively reflect the theme development trend and have enough performance support to dynamically construct the theme index according to the information collected in real time, so that relatively objective and accurate guidance can be given to theme investment.
In another embodiment of the invention, the data analysis device may also periodically or in response to a user request, according to the stock related information in a period of time newly collected by the data collection device, recalculate the relevance of each selected stock to the topic, and reselect the first several stocks with greater relevance to the topic to construct the topic index of the topic, so as to better ensure the timeliness of the topic index and more effectively measure the topic investment value.
FIG. 2 is a block diagram of a data processing system for stock analysis, according to one embodiment of the invention. The system 200 includes a data acquisition device 201, a data analysis device 202, and a data presentation device 203. Although the block diagrams depict components in a functionally separate manner, such depiction is for illustrative purposes only. The functional components shown in the figures may be arbitrarily combined or separated into separate software, firmware, and/or hardware components. Moreover, regardless of how such components are combined or divided, they may execute on the same host or multiple hosts, where multiple hosts may be connected by one or more networks.
The data acquisition device 201 is used for acquiring stock related information in real time and sending the stock related information to the data analysis device 202. The data analysis device 202 is used to select stocks related to the topic of the topic index to be constructed according to the stock information collected in real time in the manner described above in connection with the method of fig. 1, calculate the relevance of each selected stock to the topic, and select the first several stocks with larger relevance to the topic to construct the topic index of the topic. The data presentation device 203 is used for displaying the stocks selected by the data analysis device 202 and the constructed theme index. In yet another embodiment of the present invention, the data collection device 201 is further used to periodically collect media news information, public behavior information, industry information, and stock related information as mentioned above; and the data analysis means 202 is further configured to find a new topic from the collected information, and select the new topic as a topic of the topic index to be constructed. In yet another embodiment of the present invention, the topic for which the topic index is to be constructed may be specified; the keyword is made the subject of the topic index to be constructed, for example, in response to a keyword specified in a user request received through the network or a keyword input through the user interface.
Referring now to FIG. 3, a block diagram of a computer system 300 suitable for use with the electronic device implementing an embodiment of the invention is shown. The computer system of the electronic device shown in fig. 3 is only an example, and should not bring any limitation to the function and the scope of use of the embodiments of the present invention.
As shown in fig. 3, the computer system 300 includes a Central Processing Unit (CPU)301 that can perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM)302 or a program loaded from a storage section 308 into a Random Access Memory (RAM) 303. In the RAM 303, various programs and data necessary for system operation are also stored. The CPU 301, ROM 302, and RAM 303 are connected to each other via a bus 304. An input/output (I/O) interface 305 is also connected to bus 304.
The following components are connected to the I/O interface 305: an input portion 306 including a keyboard, a mouse, and the like; an output section 307 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 308 including a hard disk and the like; and a communication section 309 including a network interface card such as a LAN card, a modem, or the like. The communication section 309 performs communication processing via a network such as the internet. A drive 310 is also connected to the I/O interface 305 as needed. A removable medium 311 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 310 as necessary, so that a computer program read out therefrom is mounted into the storage section 308 as necessary.
In particular, according to an embodiment of the present invention, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the invention include a computer program product comprising a computer program embodied on a computer-readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 309, and/or installed from the removable medium 311. The above-described functions defined in the system of the present application are executed when the computer program is executed by the Central Processing Unit (CPU) 301.
It should be noted that the computer readable medium shown in the present invention can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present invention, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present invention, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present invention may be implemented by software, or may be implemented by hardware, and the described units may also be disposed in a processor. Wherein the names of the elements do not in some way constitute a limitation on the elements themselves.
As another aspect, the present application also provides a computer-readable medium, which may be contained in the electronic device described in the above embodiments; or may exist separately without being assembled into the electronic device. The computer program or executable instructions stored thereon, when being executed, implement the technical solutions as described in the foregoing embodiments, and the implementation principles thereof are similar, and are not described herein again. In embodiments of the present invention, the computer readable storage medium may be any tangible medium that can store data and that can be read by a computing device. Examples of computer readable storage media include hard disk drives, Network Attached Storage (NAS), read-only memory, random-access memory, CD-ROMs, CD-R, CD-RWs, magnetic tapes, and other optical or non-optical data storage devices. The computer readable storage medium may also include a computer readable tangible medium distributed over a network coupled computer system so that the computer program or instructions may be stored and executed in a distributed fashion.
In another embodiment of the present invention, an electronic device is further provided, which includes a processor and a memory, where the memory is used for storing executable instructions that can be executed by the processor, and the processor is configured to execute the executable instructions stored in the memory, and when the executable instructions are executed, the technical solution described in any one of the foregoing embodiments is implemented, and the implementation principles thereof are similar, and are not described herein again.
It should be noted that although in the above detailed description several modules or units of the device for action execution are mentioned, such a division is not mandatory. Indeed, the features and functionality of two or more modules or units described above may be embodied in one module or unit, according to embodiments of the invention. Conversely, the features and functions of one module or unit described above may be further divided into embodiments by a plurality of modules or units.
Through the above description of the embodiments, those skilled in the art will readily understand that the exemplary embodiments described herein may be implemented by software, or by software in combination with necessary hardware. Therefore, the technical solution according to the embodiment of the present invention can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.) or on a network, and includes several instructions to enable a computing device (which can be a personal computer, a server, a touch terminal, or a network device, etc.) to execute the method according to the embodiment of the present invention.
Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.
It will be understood that the invention is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the invention is limited only by the appended claims.

Claims (10)

1. A method of data processing, comprising:
the data acquisition device acquires the related information of the stock in real time and sends the related information to the data analysis device;
selecting stocks related to the theme of the theme index to be constructed by the data analysis device according to the stock related information acquired in real time by the data acquisition device, calculating the relevance of each selected stock to the theme, and selecting a plurality of previous stocks with larger relevance to the theme to construct the theme index of the theme;
the selected stocks and the constructed topic index are displayed by the data presentation device.
2. The method of claim 1, further comprising:
the data acquisition device periodically acquires media news information, public behavior information, industry information and related stock information;
and discovering a new theme from the information collected by the data collection device by the data analysis device, and selecting the new theme as the theme of the theme index to be constructed.
3. The method of claim 1, further comprising:
and responding to the keywords received through a network or a user interface by the data analysis device, and taking the keywords as the subjects of the subject index to be constructed.
4. The method of claim 1, further comprising:
and periodically or in response to a user request, recalculating the relevance of each stock to the theme according to the stock related information collected by the data collection device in a period of time, and reselecting the previous stocks which are more relevant to the theme to construct the theme index of the theme.
5. The method of any one of claims 1-4, wherein the relevance of each selected stock to the topic is calculated by the data analysis device using a word vector model.
6. The method of claim 1, further comprising:
the data acquisition device periodically acquires media news information, public behavior information, industry information and related stock information;
and determining the heat degree of each theme by the data analysis device based on the information acquired by the data acquisition device, and selecting the theme of which the heat degree change exceeds a set threshold value as the theme of the theme index to be constructed.
7. The method of claim 1, wherein calculating the relevance of each of the selected stocks to the topic comprises:
respectively extracting a word vector corresponding to the subject word and a word vector corresponding to each stock by using a pre-trained word vector model;
and calculating the similarity between the word vector of the subject word and the word vector corresponding to each stock to obtain the correlation between the subject word and each stock.
8. A data processing system comprising:
data acquisition device for real-time acquisition of stock related information
The data analysis device is used for selecting stocks related to the theme of the theme index to be constructed according to the stock information collected in real time, calculating the relevance of each selected stock to the theme, and selecting the first stocks with larger relevance to the theme to construct the theme index of the theme;
and the data display device is used for displaying the selected stocks and the constructed theme index.
9. A computer-readable storage medium, on which a computer program is stored which, when executed, implements the method of claims 1-7.
10. An electronic device comprising a processor and a memory, wherein the memory is configured to store executable instructions; the processor is configured to perform the method of claims 1-7 via execution of the executable instructions.
CN201911189102.1A 2019-11-27 2019-11-27 Data processing method, system, storage medium and electronic device Pending CN112862617A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911189102.1A CN112862617A (en) 2019-11-27 2019-11-27 Data processing method, system, storage medium and electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911189102.1A CN112862617A (en) 2019-11-27 2019-11-27 Data processing method, system, storage medium and electronic device

Publications (1)

Publication Number Publication Date
CN112862617A true CN112862617A (en) 2021-05-28

Family

ID=75985243

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911189102.1A Pending CN112862617A (en) 2019-11-27 2019-11-27 Data processing method, system, storage medium and electronic device

Country Status (1)

Country Link
CN (1) CN112862617A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103226554A (en) * 2012-12-14 2013-07-31 西藏同信证券有限责任公司 Automatic stock matching and classifying method and system based on news data
US20150269680A1 (en) * 2014-03-21 2015-09-24 ASPN Solutions, LLC System and method for determining an investment theme for a potential investment fund
CN107025264A (en) * 2017-02-13 2017-08-08 闽南师范大学 A kind of automatic share-selecting method based on news big data
CN107527289A (en) * 2017-08-25 2017-12-29 百度在线网络技术(北京)有限公司 A kind of investment combination industry distribution method, apparatus, server and storage medium
CN107704512A (en) * 2017-08-31 2018-02-16 平安科技(深圳)有限公司 Financial product based on social data recommends method, electronic installation and medium
CN107767273A (en) * 2017-09-05 2018-03-06 平安科技(深圳)有限公司 Asset Allocation method, electronic installation and medium based on social data
CN108629693A (en) * 2018-05-08 2018-10-09 平安科技(深圳)有限公司 Automatically generate method, apparatus, computer equipment and the storage medium of suggestion for investment
KR20190113191A (en) * 2018-03-28 2019-10-08 주식회사 에이제이케이 Method for determining stock item index using media and System for carrying out the same

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103226554A (en) * 2012-12-14 2013-07-31 西藏同信证券有限责任公司 Automatic stock matching and classifying method and system based on news data
US20150269680A1 (en) * 2014-03-21 2015-09-24 ASPN Solutions, LLC System and method for determining an investment theme for a potential investment fund
CN107025264A (en) * 2017-02-13 2017-08-08 闽南师范大学 A kind of automatic share-selecting method based on news big data
CN107527289A (en) * 2017-08-25 2017-12-29 百度在线网络技术(北京)有限公司 A kind of investment combination industry distribution method, apparatus, server and storage medium
CN107704512A (en) * 2017-08-31 2018-02-16 平安科技(深圳)有限公司 Financial product based on social data recommends method, electronic installation and medium
CN107767273A (en) * 2017-09-05 2018-03-06 平安科技(深圳)有限公司 Asset Allocation method, electronic installation and medium based on social data
KR20190113191A (en) * 2018-03-28 2019-10-08 주식회사 에이제이케이 Method for determining stock item index using media and System for carrying out the same
CN108629693A (en) * 2018-05-08 2018-10-09 平安科技(深圳)有限公司 Automatically generate method, apparatus, computer equipment and the storage medium of suggestion for investment

Similar Documents

Publication Publication Date Title
Bu et al. A multidimensional framework for characterizing the citation impact of scientific publications
TWI539305B (en) Personalized information push method and device
Tran et al. Hashtag recommendation approach based on content and user characteristics
EP3564828A1 (en) Method of data query based on evaluation and device
EP3168795A1 (en) Method and apparatus for evaluating relevance of keyword to asset price
CN105095256A (en) Information push method and apparatus based on similarity degree between users
Palmer et al. A study of trust over time within a social network mediated environment
Suiter et al. Supporting scholars: An analysis of academic library websites' documentation on metrics and impact
Langley et al. Patterns of herding and their occurrence in an online setting
CA2917140A1 (en) Social network for employment search
CN110727761B (en) Object information acquisition method and device and electronic equipment
CN111159341B (en) Information recommendation method and device based on user investment and financial management preference
Jiang et al. How to find your appropriate doctor: An integrated recommendation framework in big data context
Chen et al. Towards correlating search on google and asking on stack overflow
CN107977678A (en) Method and apparatus for output information
Loughlin et al. The viability of StockTwits and Google Trends to predict the stock market
Huang et al. A comprehensive analysis of the journal evaluation system in China
US20150302088A1 (en) Method and System for Providing Personalized Content
Zheng et al. Algorithm for recommending answer providers in community-based question answering
Deng et al. A knowledge graph approach for recommending patents to companies
US20160342699A1 (en) Systems, methods, and devices for profiling audience populations of websites
CA3013346C (en) Method and system for personalizing software based on real time tracking of voice-of-customer feedback
TW201234204A (en) Opportunity identification for search engine optimization
Wolfram Modelling the stock market using Twitter
CN114285896B (en) Information pushing method, device, equipment, storage medium and program product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination