WO2019023867A1 - 一种基于智能终端的水印添加方法及水印添加系统 - Google Patents

一种基于智能终端的水印添加方法及水印添加系统 Download PDF

Info

Publication number
WO2019023867A1
WO2019023867A1 PCT/CN2017/095219 CN2017095219W WO2019023867A1 WO 2019023867 A1 WO2019023867 A1 WO 2019023867A1 CN 2017095219 W CN2017095219 W CN 2017095219W WO 2019023867 A1 WO2019023867 A1 WO 2019023867A1
Authority
WO
WIPO (PCT)
Prior art keywords
watermark
voice
module
information
watermark adding
Prior art date
Application number
PCT/CN2017/095219
Other languages
English (en)
French (fr)
Inventor
张亚林
Original Assignee
深圳传音通讯有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳传音通讯有限公司 filed Critical 深圳传音通讯有限公司
Priority to PCT/CN2017/095219 priority Critical patent/WO2019023867A1/zh
Publication of WO2019023867A1 publication Critical patent/WO2019023867A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/10Protecting distributed programs or content, e.g. vending or licensing of copyrighted material ; Digital rights management [DRM]
    • G06F21/106Enforcing content protection by specific content processing
    • G06F21/1063Personalisation

Definitions

  • the present invention relates to the field of intelligent terminals, and in particular, to a watermark adding method based on an intelligent terminal and a watermark adding system.
  • smart terminals With the popularization of smart terminals in people's daily lives, smart terminals have become an increasingly inseparable part of people's lives. With the development of science and technology, smart terminals are constantly innovating, and their functions are becoming more and more diverse. For example, paying, listening to songs, surfing the Internet, ordering food, reading books, etc., bring various things to people’s work and life. The kind of information and information greatly enriched the scope of people's information acquisition.
  • Watermarking technology embeds some identification information (ie, digital watermark) directly into a digital carrier (including multimedia, documents, software, etc.) or indirectly (modifies the structure of a specific area) without affecting the use value of the original carrier, and is not easy. Be explored and modified again. But it can be identified and identified by the producer. Through the information hidden in the carrier, it is possible to confirm the content creator, the purchaser, transmit the secret information, or determine whether the carrier has been tampered with. Digital watermarking is an effective way to protect information security, realize anti-counterfeiting and traceability, and copyright protection. It is an important branch and research direction of information hiding technology research.
  • the information of digital watermark should be safe, difficult to tamper with or forge. At the same time, there should be a low false detection rate. When the original content changes, the digital watermark should change, so that the original data can be detected. Of course, the digital watermark is also very resistant to repeated additions;
  • the digital watermark should be unperceptible and should not affect the normal use of the protected data; it will not degrade;
  • Sensitivity This feature applies to fragile watermarks. After being distributed, transmitted, and used, the digital watermark can accurately determine whether the data has been tampered with. Further, it is possible to judge the location, extent, and even the original information of the data tampering.
  • the current watermarking technology is mainly limited to the input of text to add a digital watermark, which requires the user to manually input text in the smart terminal, which is inconvenient for the user to use, especially when the number of watermark characters is long.
  • the prior art is further improved, and a new watermark adding method and a watermark adding system based on the smart terminal need to be proposed, which are not limited to the used application, and can input the input.
  • the voice is directly converted into a text-type watermark, and the conversion accuracy is high, which is convenient for the user to use, and provides a better user experience.
  • an object of the present invention is to provide a watermark adding method and a watermark adding system based on an intelligent terminal, which are not limited to the application program used, by which the user can directly convert the input voice into text. Watermarking, high conversion accuracy, easy for users to use, to provide users with a better experience.
  • the present invention provides a watermark adding method based on an intelligent terminal, where the watermark adding method includes the following steps, the smart terminal collecting a voice;
  • the watermark is embedded in an image to generate a watermark image with the watermark.
  • the voice comprises one or more of male voice, female voice and child voice.
  • the watermark adding method further comprises removing noise in the voice information.
  • the step of embedding the watermark in an image further comprises selecting the watermark to be embedded.
  • the invention further provides a watermark adding system based on an intelligent terminal, the watermark adding system comprising an acquiring module, an identifying module, a converting module, a synthesizing module and an embedding module;
  • the collecting module collects a voice and sends the voice to the identification module
  • the identification module identifies voice information included in the voice, acquires voice data corresponding to the voice information, and sends the voice data to the conversion module;
  • the conversion module converts the voice data into a text data corresponding to the voice data, and sends the text data to the synthesis module;
  • the synthesizing module synthesizes the text data, generates a watermark matching the speech, and sends the watermark to the embedding module;
  • the embedding module is communicatively coupled to the synthesizing module to embed the watermark in an image to generate a watermark image with the watermark.
  • the voice comprises one or more of male voice, female voice and child voice.
  • said identifying module further comprises removing noise in said voice information.
  • the watermark adding system further comprises a storage module communicatively coupled to the synthesizing module and the embedding module for storing the watermark.
  • the embedding module further comprises selecting the watermark to be embedded.
  • FIG. 1 is a schematic flowchart of a method for adding a watermark based on an intelligent terminal according to an embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a watermark adding system based on an intelligent terminal according to an embodiment of the present invention.
  • the invention provides a watermark adding method based on intelligent terminal and a watermark adding system.
  • Watermarking technology is to embed some identification information (ie digital watermark) directly into the digital carrier (including multimedia, documents, software, etc.) or Indirect representation (modification of the structure of a specific area), and does not affect the use value of the original carrier, and is not easy to be detected and modified again. But it can be identified and identified by the producer. Through the information hidden in the carrier, it is possible to confirm the content creator, the purchaser, transmit the secret information, or determine whether the carrier has been tampered with. Digital watermarking is an effective way to protect information security, realize anti-counterfeiting and traceability, and copyright protection. It is an important branch and research direction of information hiding technology research.
  • the watermark adding method and the watermark adding system provided by the present invention are not limited to the application program used, and the user can directly convert the input voice into a text type watermark output by the above method and system.
  • the watermark adding method and the watermark adding system provided by the present invention can recognize the speech not only to the standard male or female voice, but also can accurately recognize the voices of other sound colors such as children's voice, and the conversion precision is high, and is convenient for the user to use. Provide users with a better experience.
  • FIG. 1 is a schematic flowchart diagram of a smart terminal-based watermark adding method according to an embodiment of the present invention.
  • the embodiment of the invention provides a method for adding a watermark based on a smart device, such as a mobile phone or a tablet computer. Specifically, the method includes the following steps:
  • the smart terminal collects a voice
  • start a smart terminal such as a mobile phone, a tablet computer, etc.
  • find an icon of a watermark adding application on the smart terminal interface click a watermark to add an application icon, and open a watermark adding application.
  • select to turn on the microphone of the smart terminal select to turn on the microphone of the smart terminal.
  • the microphone is turned on, a voice is acquired, and voice collection is completed.
  • the voice comprises one or more of male voice, female voice and child voice.
  • the smart terminal collects the voice recorded by the user through the microphone.
  • the voice involved in the watermark adding method based on the smart terminal provided in the embodiment of the present invention includes various sound colors, and the recognition of the voice is not limited to the standard male voice.
  • the watermark adding method based on the smart terminal further identifies the voice information included in the voice, and generates a Corresponding voice data for subsequent processing. For example, when the smart terminal recognizes that the collected voice includes the voice information of the “smart terminal”, it generates corresponding text data according to the voice information of the “smart terminal”, that is, expresses “smart terminal” by means of data.
  • the watermark adding method further comprises removing noise in the voice information.
  • the watermark adding method in the embodiment of the present invention further comprises performing denoising processing on the obtained voice information to ensure that the converted voice data does not include the ambient sound and the background. Noise and so on.
  • the watermark adding method in the embodiment of the present invention includes converting the obtained voice data into text data corresponding to the voice data.
  • the first step in converting speech to text is by changing the format of the data.
  • Synthesizing the text data to generate a text information that matches the voice information specifically, after obtaining the text data converted by the voice data by changing the format of the data, synthesizing the converted text data, and obtaining a corresponding Text information corresponding to the previously collected voice information.
  • the smart terminal when the smart terminal acquires the corresponding text information by synthesizing the text data converted from the voice data, the smart terminal automatically generates a watermark according to the text information.
  • the text displayed by the watermark matches the previously acquired speech. That is, the previously acquired speech is converted into a text-type watermark.
  • the voice collected previously is the voice of the “smart terminal”
  • the watermark is converted by the intelligent terminal to generate a “smart terminal”.
  • the step of generating a watermark matching the speech and embedding the watermark in an image between the methods further includes storing the watermark.
  • the intelligent terminal-based watermark adding method provided in the embodiment of the present invention further includes storing the generated watermark between the step of generating a watermark matching the voice and the step of embedding the watermark in an image. step.
  • the smart terminal may choose to store the generated watermark in the smart terminal for use as a backup. It is convenient for users to call at any time.
  • the obtained watermark may be further embedded in the image.
  • the image embeds a text-type watermark converted from the input speech, and generates a watermark image with the typeface matching the input speech.
  • the watermark text in the watermark image does not need to be manually input by the user, and only needs to input the voice to be converted and generated.
  • the step of embedding the watermark in an image further comprises selecting the watermark to be embedded.
  • the step of embedding the watermark into the image further includes selecting a watermark to be embedded in the watermark stored in the smart terminal.
  • FIG. 2 is a schematic structural diagram of a smart terminal-based watermark adding system according to an embodiment of the present invention.
  • the embodiment of the present invention further provides a watermark adding system based on an intelligent terminal, such as a smart device such as a mobile phone or a tablet computer, and the watermark adding system can be operated in an Android operating environment.
  • the smart terminal-based watermark adding system provided by the present invention comprises an acquiring module, an identifying module, a converting module, a synthesizing module and an embedding module;
  • the collecting module collects a voice and sends the voice to the identification module; specifically, starts a smart terminal such as a mobile phone or a tablet computer, finds an icon of a watermark adding application on the interface of the smart terminal, and clicks the watermark Add an app icon and open the watermark add app.
  • the collecting module of the watermark adding system After entering the watermark adding application, the collecting module of the watermark adding system is turned on, and the collecting module establishes a communication connection with the identifying module in the watermark adding system.
  • the acquisition module is enabled, the voice is obtained, and after the voice is obtained, the voice is sent to the identification module through the communication connection.
  • the voice comprises one or more of male voice, female voice and child voice.
  • the watermark adding system of the smart terminal collects the voice recorded by the user through the collecting module.
  • the voice involved in the watermark adding system based on the smart terminal provided in the embodiment of the present invention includes various sound colors, and the recognition of the voice is not only Limited to the standard male and female voices, as well as other sounds such as children's voices.
  • the identification module identifies voice information included in the voice, acquires voice data corresponding to the voice information, and sends the voice data to the conversion module; specifically, the module to be acquired acquires a voice and completes After the voice is collected and sent to the identification module, the identification module obtains the voice information transmitted by the collection module through the communication connection, identifies the voice information contained in the voice, and generates a corresponding voice information according to the voice information.
  • the voice data is obtained through the communication connection after the voice data is obtained, and the obtained voice data is sent to the conversion module for use.
  • the identification module recognizes that the collected voice includes the voice information of the “smart terminal”
  • the voice data corresponding to the “smart terminal” is generated according to the voice information of the “smart terminal”, that is, the “smart terminal” that describes the voice input by means of data ".
  • said identifying module further comprises removing noise in said voice information.
  • the identification module in the watermark adding system in the embodiment of the present invention further includes denoising processing on the obtained voice information to ensure converted voice data. Does not include ambient sounds, background noise, etc.
  • the conversion module converts the voice data into a text data corresponding to the voice data, and sends the text data to the synthesis module;
  • the conversion module is in communication with the identification module and receives voice data from the identification module. After the to-be-transformed module obtains the voice data included in the voice through the communication connection, the conversion module included in the watermark adding system in the embodiment of the present invention converts the obtained voice data into text data corresponding to the voice data.
  • the first step in converting speech to text is by changing the format of the data. Once the conversion is complete, the conversion module will send the converted text data to the composition module via a communication link.
  • the synthesizing module synthesizes the text data, generates a watermark matching the speech, and sends the watermark to the embedding module;
  • the conversion module sends the converted text data to the synthesis module through the communication connection
  • the synthesis module generates a watermark according to the received text data synthesis, and the generated watermark matches the previously acquired voice information.
  • the synthesizing module synthesizes a watermark
  • the watermark is sent to the embedding module of the watermark adding system through a communication connection.
  • the system further comprises a storage module communicatively coupled to the synthesis module and the embedded module for storing the watermark.
  • the smart terminal based watermark adding system provided by the present invention further comprises a storage module.
  • the storage module establishes a communication connection with the synthesis module and the embedded module. If it is not necessary to embed the generated watermark temporarily, before the synthesizing module sends the watermark to the embedding module in the watermark adding system through the communication connection, the synthesized watermark can be further sent to the storage module for storage through the communication connection.
  • the storage module sends the watermark to the embedded module through a communication connection. That is, before the synthesizing module sends the synthesized watermark to the embedding module to form the watermark image, the synthesizing module further includes sending the synthesized watermark to the storage module for storage for use.
  • the embedding module is communicatively coupled to the synthesizing module to embed the watermark in an image to generate a watermark image with the watermark.
  • the embedding module may further embed the obtained watermark into the image.
  • the embedded module is in communication with the synthesis module.
  • the synthesized watermark is sent to the embedded module via the communication connection synthesis module.
  • the embedding module embeds a watermark converted from speech into the image, thereby generating a watermark image with the typeface matching the input speech.
  • the watermark text in the watermark image does not need to be manually input by the user, and only needs to be generated by the acquisition module to collect the input voice of the user.
  • the embedding module further comprises selecting the watermark to be embedded.
  • the synthesizing module sends the synthesized plurality of watermarks to the storage module for storage, if the storage module needs to send the watermark to the embedding module, and the embedding module is ready to embed the watermark, the embedding module further comprises selecting the embedded watermark. That is, the embedded module in the smart terminal based watermark adding system provided by the present invention further includes selection and setting of adding watermarks.
  • the user can directly convert the acquired voice into a text-type watermark matched with the voice through the smart terminal.
  • the watermark adding method and the watermark adding system are simple and convenient to operate, and do not need to be manually input by the user, and the running process does not depend on the running of an application, and is not restricted by the running application.
  • the voice input by the user can be accurately recognized, and is not limited to the sound color of the input voice, for example, male voice, female voice, and child voice.
  • the watermark adding system Based on the watermark adding system, not only the standard male and female voices can be identified and analyzed, but also other voices such as children's voices can be identified. Therefore, after the smart terminal based watermark adding system provided by the present invention, the voice to text conversion has With higher precision, the watermark is synthesized more accurately. Furthermore, according to the watermark adding method provided by the present invention, the user can further select the embedded watermark. Therefore, the watermark adding method and the watermark adding system provided by the present invention can provide a more convenient use experience for the user.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Technology Law (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Editing Of Facsimile Originals (AREA)

Abstract

一种基于智能终端的水印添加方法及水印添加系统。所述水印添加方法包括:所述智能终端采集一语音;识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据;将所述语音数据转化为一与所述语音数据对应的文字数据;合成所述文字数据,以生成一与所述语音信息匹配的文字信息;根据所述文字信息,生成一与所述语音匹配的水印;将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。所述水印添加系统包括采集模块、识别模块、转化模块、合成模块及嵌入模块。采用上述水印添加方法及系统后,能够将输入的语音直接转化为文字型水印,转化精确度高,便于用户的使用,为用户提供一种更好的使用体验。

Description

一种基于智能终端的水印添加方法及水印添加系统 技术领域
本发明涉及智能终端领域,尤其涉及一种基于智能终端的水印添加方法及水印添加系统。
背景技术
随着智能终端在人们的日常生活中的普及化,智能终端成为了人们生活中越来越不可分割的一部分。随着科学技术的日益发展,智能终端不断推陈出新,具备的功能也越来越多样化,例如,支付、听歌、上网、点餐、看书等,为人们工作生活等各方面带来各种各样的资讯信息,极大丰富了人们信息获取的范围。
如今,为了给用户提供更为便捷的使用体验,大多数智能终端都具有将语音转换为文字输出的功能,但是目前这类功能仅仅局限于在特定应用程序中使用,并且通常智能在社交应用程序中实现。
此外,基于数字化网络平台的出现,对网络资料(如:多媒体、文档、软件等)保护的需求也越来越高。目前比较常用的保护方法为嵌入水印。水印技术是将一些标识信息(即数字水印)直接嵌入数字载体当中(包括多媒体、文档、软件等)或是间接表示(修改特定区域的结构),且不影响原载体的使用价值,也不容易被探知和再次修改。但可以被生产方识别和辨认。通过这些隐藏在载体中的信息,可以达到确认内容创建者、购买者、传送隐秘信息或者判断载体是否被篡改等目的。数字水印是保护信息安全、实现防伪溯源、版权保护的有效办法,是信息隐藏技术研究领域的重要分支和研究方向。
水印技术基本上具有下面几个方面的特点:
1.安全性:数字水印的信息应是安全的,难以篡改或伪造,同时,应当有较低的误检测率,当原内容发生变化时,数字水印应当发生变化,从而可以检测原始数据的变更;当然数字水印同样对重复添加有很强的抵抗性;
2.隐蔽性:数字水印应是不可知觉的,而且应不影响被保护数据的正常使用;不会降质;
3.鲁棒性:该特性适用于鲁棒水印。是指在经历多种无意或有意的信号处理过程后,数字水印仍能保持部分完整性并能被准确鉴别。可能的信号处理过程包括信道噪声、滤波、数/模与模/数转换、重采样、剪切、位移、尺度变化以及有损压缩编码等;
4.敏感性:该特性适用于脆弱水印。是经过分发、传输、使用过程后,数字水印能够准确的判断数据是否遭受篡改。进一步的,可判断数据篡改位置、程度甚至恢复原始信息。
然而,目前的水印技术主要仍局限于输入文字添加数字水印,需要用户在智能终端中手动输入文字,不便于用户的实际使用,尤其不便于水印字符数较长的情况。
因此,为了克服现有技术的缺陷,对现有技术进行进一步地改进,需要提出一种基于智能终端的新型的水印添加方法及水印添加系统,不局限于所使用的应用程序,能够将输入的语音直接转化为文字型水印,转化精确度高,便于用户的使用,为用户提供一种更好的使用体验。
发明内容
为了克服上述技术缺陷,本发明的目的在于提供一种基于智能终端的水印添加方法及水印添加系统,不局限于所使用的应用程序,通过此方法及系统用户能够将输入的语音直接转化为文字型水印,转化精确度高,便于用户的使用,为用户提供一种更好的使用体验。
本发明提供了一种基于智能终端的水印添加方法,所述水印添加方法包含如下步骤,所述智能终端采集一语音;
识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据;
将所述语音数据转化为一与所述语音数据对应的文字数据;
合成所述文字数据,以生成一与所述语音信息匹配的文字信息;
根据所述文字信息,生成一与所述语音匹配的水印;
将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。
优选地,所述语音包含男声、女声、童声中的一种或多种。
优选地,于所述识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据的步骤中,所述水印添加方法进一步包含去除所述语音信息中的噪点。
优选地,于所述生成一与所述语音匹配的水印之与将所述水印嵌入一图像中的步骤之间,所述水印添加方法进一步包含储存所述水印。
优选地,当储存多个所述水印后,将所述水印嵌入一图像中的步骤中进一步包含选择需要嵌入的所述水印。
本发明进一步提供了一种基于智能终端的水印添加系统,所述水印添加系统包含采集模块、识别模块、转化模块、合成模块及嵌入模块;
所述采集模块,采集一语音,并将所述语音发送至所述识别模块;
所述识别模块,识别所述语音中包含的语音信息,获取与所述语音信息相应的语音数据,并将所述语音数据发送至所述转化模块;
所述转化模块,将所述语音数据转化为一与所述语音数据对应的文字数据,并将所述文字数据发送到所述合成模块;
所述合成模块,合成所述文字数据,生成一与所述语音匹配的水印,并将所述水印发送至所述嵌入模块;
所述嵌入模块与所述合成模块通讯连接,将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。
优选地,所述语音包含男声、女声、童声中的一种或多种。
优选地,所述识别模块进一步包含去除所述语音信息中的噪点。
优选地,所述水印添加系统进一步包含一储存模块,与所述合成模块和所述嵌入模块通讯连接,用于储存所述水印。
优选地,当所述储存模块储存多个所述水印后,所述嵌入模块进一步包含选择需要嵌入的所述水印。
采用了上述技术方案后,与现有技术相比,具有以下有益效果:
1.操作简便;
2.不受应用程序的限制;
3.直接将语音直接转化为文字型水印,方便快捷;
4.对输入的语音的解析度高,文字转化更为精准;
5.免去用户手动输入。
附图说明
图1为符合本发明实施例的一种基于智能终端的水印添加方法的流程示意图;
图2为符合本发明实施例的一种基于智能终端的水印添加系统的结构示意图。
具体实施方式
以下结合附图与具体实施例进一步阐述本发明的优点。
本发明提供了一种基于智能终端的水印添加方法及水印添加系统。水印技术是将一些标识信息(即数字水印)直接嵌入数字载体当中(包括多媒体、文档、软件等)或是 间接表示(修改特定区域的结构),且不影响原载体的使用价值,也不容易被探知和再次修改。但可以被生产方识别和辨认。通过这些隐藏在载体中的信息,可以达到确认内容创建者、购买者、传送隐秘信息或者判断载体是否被篡改等目的。数字水印是保护信息安全、实现防伪溯源、版权保护的有效办法,是信息隐藏技术研究领域的重要分支和研究方向。
本发明提供的水印添加方法及水印添加系统,不局限于所使用的应用程序,通过上述方法及系统用户能够将输入的语音直接转化为文字型水印输出。此外,本发明提供的水印添加方法及水印添加系统所能识别的语音不仅仅局限于标准的男声或女声,对童声等其他声色的语音也能精确识别,转化精确度高,便于用户的使用,为用户提供一种更好的使用体验。
如图1所示,为符合本发明实施例的一种基于智能终端的水印添加方法的流程示意图。本发明实施例提供了一种基于智能终端,如手机、平板电脑等智能设备的水印添加方法,具体地,包括如下步骤:
所述智能终端采集一语音;
具体地,启动智能终端如手机、平板电脑等设备,找到智能终端界面上的水印添加应用程序的图标,点击水印添加应用程序图标,打开水印添加应用程序。进入水印添加应用程序后,选择开启智能终端的麦克风。麦克风开启,获取一语音,完成语音采集。
优选地,所述语音包含男声、女声、童声中的一种或多种。
优选地,智能终端通过麦克风采集用户录入的语音,本发明实施例中提供的一种基于智能终端的水印添加方法中所涉及的语音包含各种声色,对语音的识别不仅仅局限于标准的男声、女声,也包含童声等其他声色的语音。
识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据;
具体地,待智能终端通过麦克风采集到一语音时,完成采集语音的步骤后,基于智能终端的水印添加方法进一步地对该语音中包含的语音信息进行识别,并根据该语音信息生成一与之相应的语音数据,用于后续处理。例如,当智能终端识别出采集到的语音包含“智能终端”的语音信息后,会根据“智能终端”的语音信息生成与之相应的文字数据,即通过数据的方式表达“智能终端”。
优选地,于所述识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据的步骤中,所述水印添加方法进一步包含去除所述语音信息中的噪点。
优选地,对采集所得的语音信息进行转化的步骤中,本发明实施例中的水印添加方法进一步包括对获取所得的语音信息的去噪处理,以确保转化成的语音数据不包含环境声音、背景杂音等。
将所述语音数据转化为一与所述语音数据对应的文字数据;
待从采集到的语音中获取到语音中所包含的语音数据后,本发明实施例中的水印添加方法包含将获取所得的语音数据转化为与该语音数据对应的文字数据。通过改变数据的格式,实现语音转化到文字的第一步。
合成所述文字数据,以生成一与所述语音信息匹配的文字信息;具体地,通过改变数据的格式获得经语音数据转化后的文字数据后,合成转化所得的文字数据,即可获得一相应的文字信息,该文字信息与先前采集所得的语音信息相对应。
根据所述文字信息,生成一与所述语音匹配的水印;
具体地,当智能终端通过合成由语音数据转化得到的文字数据获取相应的文字信息后,自动根据该文字信息生成一水印。该水印所显示的文字与先前采集所得的语音相匹配。即,先前采集的语音经转化成为一文字型水印。例如,先前采集所得的语音为“智能终端”的语音,最后经智能终端转化生成一“智能终端”字样的水印。
优选地,于所述生成一与所述语音匹配的水印之与将所述水印嵌入一图像中的步骤 之间,所述方法进一步包含储存所述水印。
优选地,本发明实施例中提供的一种基于智能终端的水印添加方法在生成一与语音匹配的水印的步骤与将所述水印嵌入一图像中的步骤之间进一步包括储存已生成的水印的步骤。例如,当智能终端根据用户输入的“智能终端”语音生成一“智能终端”字样的水印后,暂时不想直接将水印嵌入图像中时,可以选择将已生成的水印储存于智能终端中,作备用,便于用户随时调用。
将所述水印嵌入一图像中,以生成一带有所述水印的水印图像;
具体地,待智能终端根据采集所得的语音转化生成一水印后,可以进一步将所得水印嵌入图像中。以此,图像嵌入由输入的语音转化而得的文字型水印,生成一带有与输入语音匹配的字样的水印图像。水印图像中的水印文字无需用户手动输入,仅需输入语音即可转化生成。
优选地,当储存多个所述水印后,将所述水印嵌入一图像的步骤中进一步包含选择需要嵌入的所述水印。
具体地,根据采集所得的多个语音转化生成多个水印后,若用户暂时无需使用该水印时,为了避免重复制作,可以选择储存该水印,以备后续用户需要的时候使用。故,当智能终端中存有多个水印时,将水印嵌入图像的步骤中需进一步包含在储存于智能终端中的水印中选择需要嵌入的水印。
如图2所示,为符合本发明实施例的一种基于智能终端的水印添加系统的结构示意图。本发明实施例中进一步提供了一种基于智能终端,如手机、平板电脑等智能设备的水印添加系统,该水印添加系统可以于Android操作环境下运行。具体地,本发明提供的一种基于智能终端的水印添加系统包含采集模块、识别模块、转化模块、合成模块及嵌入模块;
所述采集模块,采集一语音,并将所述语音发送至所述识别模块;具体地,启动智能终端如手机、平板电脑等设备,找到智能终端界面上的水印添加应用程序的图标,点击水印添加应用程序图标,打开水印添加应用程序。进入水印添加应用程序后,开启水印添加系统的采集模块,采集模块与水印添加系统中的识别模块建立通讯连接。采集模块开启后,获取到语音,获取语音后,将该语音通过通讯连接发送到识别模块中。
优选地,所述语音包含男声、女声、童声中的一种或多种。
优选地,智能终端的水印添加系统通过采集模块采集用户录入的语音,本发明实施例中提供的一种基于智能终端的水印添加系统中所涉及的语音包含各种声色,对语音的识别不仅仅局限于标准的男声、女声,也包含童声等其他声色的语音。
所述识别模块,识别所述语音中包含的语音信息,获取与所述语音信息相应的语音数据,并将所述语音数据发送至所述转化模块;具体地,待采集模块获取一语音,完成采集语音的步骤并将采集语音发送到识别模块中后,识别模块通过通讯连接获取到由采集模块发送的语音后,对该语音包含的语音信息进行识别,并根据该语音信息生成一与之相应的语音数据,获取到语音数据后通过通讯连接,将所得的语音数据发送到转化模块中备用。例如,当识别模块识别出采集所得的语音包含“智能终端”的语音信息后,会根据“智能终端”的语音信息生成与之相应的语音数据,即通过数据的方式描述语音输入的“智能终端”。
优选地,所述识别模块进一步包含去除所述语音信息中的噪点。
优选地,对采集模块采集所得的语音信息进行转化的过程中,本发明实施例中的水印添加系统中的识别模块进一步包括对获取所得的语音信息的去噪处理,以确保转化成的语音数据不包含环境声音、背景杂音等。
所述转化模块,将所述语音数据转化为一与所述语音数据对应的文字数据,并将所述文字数据发送到所述合成模块;
转化模块与识别模块通讯连接,从识别模块中接收语音数据。待转化模块通过通讯连接获取到语音中所包含的语音数据后,本发明实施例中的水印添加系统包含的转化模块会将获取所得的语音数据转化为与该语音数据对应的文字数据。通过改变数据的格式,实现语音转化到文字的第一步。转化完成后,转化模块会通过通讯连接,将转化所得的文字数据发送到合成模块中。
所述合成模块,合成所述文字数据,生成一与所述语音匹配的水印,并将所述水印发送至所述嵌入模块;
具体地,当转化模块通过通讯连接将转化所得的文字数据发送到合成模块中后,合成模块根据接收所得的文字数据合成在处理后生成一水印,生成的水印与先前获取到的语音信息匹配。当合成模块合成了一水印后,通过通讯连接,将该水印发送到水印添加系统的嵌入模块中。
优选地,所述系统进一步包含一储存模块,与所述合成模块和所述嵌入模块通讯连接,用于储存所述水印。优选地,本发明提供的一种基于智能终端的水印添加系统进一步包含一储存模块。该储存模块与合成模块和嵌入模块建立通讯连接。若暂时不需要嵌入生成的水印时,在合成模块通过通讯连接将水印发送到水印添加系统中的嵌入模块中之前,可以进一步通过通讯连接将合成所得的水印发送至储存模块中储存。待需要嵌入该水印时,储存模块再通过通讯连接将上述水印发送至嵌入模块中嵌入。即在合成模块将合成出的水印发送到嵌入模块形成具有水印图像之前,合成模块进一步包括将合成的水印发送至储存模块储存,作备用。
所述嵌入模块与所述合成模块通讯连接,将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。
具体地,待合成模块根据采集所得的语音转化生成一水印后,嵌入模块可以进一步将所得水印嵌入图像中。嵌入模块与合成模块通讯连接。通过通讯连接合成模块将合成的水印发送至嵌入模块中。嵌入模块将由语音转化而来的水印嵌入图像中,以此,生成一带有与输入语音匹配的字样的水印图像。水印图像中的水印文字无需用户手动输入,仅需采集模块采集到用户输入语音即可转化生成。
优选地,当所述储存模块储存多个所述水印后,所述嵌入模块进一步包含选择需要嵌入的所述水印。优选地,当合成模块将合成出的多个水印发送到储存模块中储存后,若储存模块需要将水印发送到嵌入模块,嵌入模块准备嵌入水印时,嵌入模块进一步包括对所嵌入水印的选择。即,本发明提供的一种基于智能终端的水印添加系统中的嵌入模块进一步包含对水印添加的选择和设置。
采用本发明提供的一种基于智能终端的水印添加方法及水印添加系统后,用户可以通过智能终端直接将其所获取语音转化为一与语音匹配的文字型水印。该水印添加方法及水印添加系统操作简便,无需通过用户手动输入实现,且运行过程不依赖于某一应用程序的运行,也不受运行的应用程序限制。根据本发明提供的水印添加方法及水印添加系统,能够对用户输入的语音准确识别,并且不局限于输入语音的声色,例如,男声、女声及童声等。基于该水印添加系统,不仅可以对标准的男女声进行识别分析,也可对童声等其他声音作识别,故采用本发明提供的一种基于智能终端的水印添加系统后,语音到文字的转化具有较高的精准度,水印的合成更为准确。此外,根据本发明提供的水印添加方法,用户还能够进一步选择嵌入的水印。因此,基于本发明提供的水印添加方法及水印添加系统能为用户提供一种更为便捷的使用体验。
应当注意的是,本发明的实施例有较佳的实施性,且并非对本发明作任何形式的限制,任何熟悉该领域的技术人员可能利用上述揭示的技术内容变更或修饰为等同的有效实施例,但凡未脱离本发明技术方案的内容,依据本发明的技术实质对以上实施例所作的任何修改或等同变化及修饰,均仍属于本发明技术方案的范围内。

Claims (10)

  1. 一种基于智能终端的水印添加方法,其特征在于,所述水印添加方法包含如下步骤,所述智能终端采集一语音;
    识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据;
    将所述语音数据转化为一与所述语音数据对应的文字数据;
    合成所述文字数据,以生成一与所述语音信息匹配的文字信息;
    根据所述文字信息,生成一与所述语音匹配的水印;
    将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。
  2. 如权利要求1所述的水印添加方法,其特征在于,所述语音包含男声、女声、童声中的一种或多种。
  3. 如权利要求1所述的水印添加方法,其特征在于,于所述识别所述语音中的语音信息,以生成一与所述语音信息对应的语音数据的步骤中,所述水印添加方法进一步包含去除所述语音信息中的噪点。
  4. 如权利要求1所述的水印添加方法,其特征在于,于所述生成一与所述语音匹配的水印之与将所述水印嵌入一图像中的步骤之间,所述水印添加方法进一步包含储存所述水印。
  5. 如权利要求4所述的水印添加方法,其特征在于,当储存多个所述水印后,将所述水印嵌入一图像中的步骤中进一步包含选择需要嵌入的所述水印。
  6. 一种基于智能终端的水印添加系统,其特征在于,所述水印添加系统包含采集模块、识别模块、转化模块、合成模块及嵌入模块;
    所述采集模块,采集一语音,并将所述语音发送至所述识别模块;
    所述识别模块,识别所述语音中包含的语音信息,获取与所述语音信息相应的语音数据,并将所述语音数据发送至所述转化模块;
    所述转化模块,将所述语音数据转化为一与所述语音数据对应的文字数据,并将所述文字数据发送到所述合成模块;
    所述合成模块,合成所述文字数据,生成一与所述语音匹配的水印,并将所述水印发送至所述嵌入模块;
    所述嵌入模块与所述合成模块通讯连接,将所述水印嵌入一图像中,以生成一带有所述水印的水印图像。
  7. 如权利要求6所述的水印添加系统,其特征在于,所述语音包含男声、女声、童声中的一种或多种。
  8. 如权利要求6所述的水印添加系统,其特征在于,所述识别模块进一步包含去除所述语音信息中的噪点。
  9. 如权利要求6所述的水印添加系统,其特征在于,所述水印添加系统进一步包含一储存模块,与所述合成模块和所述嵌入模块通讯连接,用于储存所述水印。
  10. 如权利要求9所述的水印添加系统,其特征在于,当所述储存模块储存多个所述水印后,所述嵌入模块进一步包含选择需要嵌入的所述水印。
PCT/CN2017/095219 2017-07-31 2017-07-31 一种基于智能终端的水印添加方法及水印添加系统 WO2019023867A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/095219 WO2019023867A1 (zh) 2017-07-31 2017-07-31 一种基于智能终端的水印添加方法及水印添加系统

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/095219 WO2019023867A1 (zh) 2017-07-31 2017-07-31 一种基于智能终端的水印添加方法及水印添加系统

Publications (1)

Publication Number Publication Date
WO2019023867A1 true WO2019023867A1 (zh) 2019-02-07

Family

ID=65232182

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/095219 WO2019023867A1 (zh) 2017-07-31 2017-07-31 一种基于智能终端的水印添加方法及水印添加系统

Country Status (1)

Country Link
WO (1) WO2019023867A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117995165A (zh) * 2024-04-03 2024-05-07 中国科学院自动化研究所 基于隐变量空间添加水印的语音合成方法、装置及设备
CN117995165B (zh) * 2024-04-03 2024-05-31 中国科学院自动化研究所 基于隐变量空间添加水印的语音合成方法、装置及设备

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101345054A (zh) * 2008-08-25 2009-01-14 苏州大学 用于声频文件的数字水印制作及识别方法
US20110164784A1 (en) * 2008-03-14 2011-07-07 Bernhard Grill Embedder for embedding a watermark into an information representation, detector for detecting a watermark in an information representation, method and computer program and information signal
CN103377234A (zh) * 2012-04-26 2013-10-30 宇龙计算机通信科技(深圳)有限公司 一种多媒体数据中添加水印的方法及系统
CN105761722A (zh) * 2014-12-13 2016-07-13 哈尔滨功成科技创业投资有限公司 一种音频数字水印系统

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110164784A1 (en) * 2008-03-14 2011-07-07 Bernhard Grill Embedder for embedding a watermark into an information representation, detector for detecting a watermark in an information representation, method and computer program and information signal
CN101345054A (zh) * 2008-08-25 2009-01-14 苏州大学 用于声频文件的数字水印制作及识别方法
CN103377234A (zh) * 2012-04-26 2013-10-30 宇龙计算机通信科技(深圳)有限公司 一种多媒体数据中添加水印的方法及系统
CN105761722A (zh) * 2014-12-13 2016-07-13 哈尔滨功成科技创业投资有限公司 一种音频数字水印系统

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117995165A (zh) * 2024-04-03 2024-05-07 中国科学院自动化研究所 基于隐变量空间添加水印的语音合成方法、装置及设备
CN117995165B (zh) * 2024-04-03 2024-05-31 中国科学院自动化研究所 基于隐变量空间添加水印的语音合成方法、装置及设备

Similar Documents

Publication Publication Date Title
US10692480B2 (en) System and method of reading environment sound enhancement based on image processing and semantic analysis
AU2015394928B2 (en) Multimedia service pushing method and system based on two-dimensional code
US9288302B2 (en) Apparatus and method for reproducing handwritten message by using handwriting data
US20150127348A1 (en) Document distribution and interaction
CN110767209B (zh) 语音合成方法、装置、系统和存储介质
CN102609968B (zh) 实现有声图片的方法及系统
CN106098078B (zh) 一种可过滤扬声器噪音的语音识别方法及其系统
US7707241B2 (en) Determining type of signal encoder
WO2009075428A1 (en) Apparatus for and method of generating a multimedia email
KR100613859B1 (ko) 개인 휴대 단말기를 위한 멀티미디어 데이터 편집, 제공장치 및 방법
KR20190066537A (ko) 음성인식 기반의 사진 공유 방법, 장치 및 시스템
US8773696B2 (en) Method and system for generating document using speech data and image forming apparatus including the system
WO2019076120A1 (zh) 一种图像处理的方法、装置、存储介质及电子装置
CN113033245A (zh) 一种功能调节方法、装置、存储介质及电子设备
KR100684457B1 (ko) 이동통신단말의 외부 음원 인식을 이용하여 사용자에게고유정보를 제공하는 고유정보 제공 시스템, 고유정보 제공방법 및 그 이동통신단말
Tanwar et al. Audio steganography
CN113571048B (zh) 一种音频数据检测方法、装置、设备及可读存储介质
TW201405546A (zh) 可語音控制之點歌系統及其運作流程
CN112599130B (zh) 一种基于智慧屏的智能会议系统
CN113515594A (zh) 意图识别方法、意图识别模型训练方法、装置及设备
WO2019023867A1 (zh) 一种基于智能终端的水印添加方法及水印添加系统
US9077813B2 (en) Masking mobile message content
Koenig et al. Forensic authentication of digital audio and video files
CN109241331B (zh) 一种面向智能机器人的故事数据处理方法
KR102269123B1 (ko) 비대면 녹취록 자동 생성 시스템

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17919804

Country of ref document: EP

Kind code of ref document: A1