WO2022062195A1 - 机上信息辅助方法及装置 - Google Patents

机上信息辅助方法及装置 Download PDF

Info

Publication number
WO2022062195A1
WO2022062195A1 PCT/CN2020/135366 CN2020135366W WO2022062195A1 WO 2022062195 A1 WO2022062195 A1 WO 2022062195A1 CN 2020135366 W CN2020135366 W CN 2020135366W WO 2022062195 A1 WO2022062195 A1 WO 2022062195A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
voice data
voice
single piece
speech
Prior art date
Application number
PCT/CN2020/135366
Other languages
English (en)
French (fr)
Inventor
徐舒寒
张炯
李博
Original Assignee
中国商用飞机有限责任公司北京民用飞机技术研究中心
中国商用飞机有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国商用飞机有限责任公司北京民用飞机技术研究中心, 中国商用飞机有限责任公司 filed Critical 中国商用飞机有限责任公司北京民用飞机技术研究中心
Priority to EP21871657.9A priority Critical patent/EP4044179A4/en
Priority to PCT/CN2021/120852 priority patent/WO2022063288A1/zh
Publication of WO2022062195A1 publication Critical patent/WO2022062195A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G5/00Traffic control systems for aircraft, e.g. air-traffic control [ATC]
    • G08G5/0004Transmission of traffic-related information to or from an aircraft
    • G08G5/0013Transmission of traffic-related information to or from an aircraft with a ground station
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G5/00Traffic control systems for aircraft, e.g. air-traffic control [ATC]
    • G08G5/0017Arrangements for implementing traffic-related aircraft activities, e.g. arrangements for generating, displaying, acquiring or managing traffic information
    • G08G5/0021Arrangements for implementing traffic-related aircraft activities, e.g. arrangements for generating, displaying, acquiring or managing traffic information located in the aircraft
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection

Definitions

  • the invention relates to the field of aviation, and in particular, to an on-board information assistance method and device, which assist pilots in listening, replaying, and querying air traffic control information during flight missions, thereby improving the convenience and accuracy of obtaining control instructions and reducing Communicate costs and provide safety guarantees for aircraft navigation.
  • the controller communicates with the pilot through land-air voice, and the controller issues control instructions to the pilot, instructing the pilot to operate the aircraft as required.
  • the role of the controller is particularly important. Many important decisions in aircraft navigation are made by the controller based on the content of the dialogue between him and the pilot. Due to the professionalism, regional differences and personnel complexity of air traffic control, there are a large number of professional terms, unique regional names, mixed Chinese and English, and accent differences in the air traffic control voice, coupled with factors such as noisy environment and communication link interference. It is very likely that due to very small mistakes, such as misunderstanding of information, omission of information, etc., the pilot will misjudge the controller's voice, which will bring huge losses to the aviation field. Therefore, for the pilot on the plane, an auxiliary means is urgently needed in the process of communicating with the air traffic control.
  • speech recognition has been proposed as a technical approach to assist air traffic control information.
  • it is difficult to achieve accurate and efficient speech recognition in the air traffic control system which is mainly reflected in several aspects:
  • First, the environment in which the air traffic control system is located is special, which directly determines the performance of the air traffic control speech recognition in the voice information collection link. Particularity; secondly, there are special definitions for the pronunciation of numbers, letters, flight numbers, runways, etc.
  • the air traffic control speech recognition system has strict requirements on the accuracy of recognition.
  • speech recognition methods dedicated to air traffic control in the existing technology, but the accuracy rate is not high enough in general, so they are mostly used in ground scenarios, such as auxiliary control command quality assessment, post-event analysis, and workload assessment. and other ground-to-air call data analysis work. Therefore, there is an urgent need for a method to assist pilots on the plane, improve the convenience and accuracy of air traffic control communication, and provide safety guarantees for aircraft navigation.
  • the embodiments of the present invention provide an on-board information assistance method and device, so as to at least solve the technical problem of insufficient convenience and accuracy in obtaining management and control instructions in the prior art.
  • an on-board information assistance method including: acquiring target voice data; dividing the target voice data to obtain a single piece of voice data; performing voice recognition according to the single piece of voice data, generating text data; displaying the single piece of voice data and the text data.
  • the target voice data is land-air voice real-time call data captured through a voice communication link.
  • the dividing the target voice data to obtain a single piece of voice data includes: obtaining voice information entropy according to the target voice data, where the voice information entropy represents the complexity of the target voice data. ; According to the voice information entropy, the sentences in the target voice data are segmented and segmented, and the segmented voice data is output as the single piece of voice data.
  • the method further includes: storing the single piece of speech data and the text data.
  • the displaying the single piece of voice data and the text data includes: playing the single piece of voice data; and displaying the corresponding text data according to the played single piece of voice data.
  • the number of times of playing the single piece of voice data is at least once.
  • an on-board information assistance device including: an acquisition module for acquiring target voice data; a segmentation module for splitting the target voice data to obtain a single piece of voice data
  • the text module is used to perform speech recognition on the single piece of voice data to generate text data; the display module is used to display the single piece of voice data and the text data.
  • the target voice data is land-air voice real-time call data captured through a voice communication link.
  • the segmentation module includes: an acquisition unit, configured to acquire voice information entropy according to the target voice data, wherein the voice information entropy represents the complexity of the target voice data; a segmentation unit, used According to the speech information entropy, the sentence in the target speech data is segmented and segmented; the output unit is used for outputting the segmented speech data as the single piece of speech data.
  • the apparatus further includes: a storage unit configured to store a single piece of voice data and the text data.
  • the display module includes: a playing unit for playing the single piece of voice data; and a display unit for displaying text data corresponding to the single piece of voice data.
  • the number of times of playing the single piece of voice data is at least once.
  • a computer program product comprising instructions which, when executed on a computer, cause the computer to perform an on-board information assistance method.
  • a non-volatile storage medium includes a stored program, wherein the program controls the non-volatile storage medium when running The device on which it is located, executes an on-board information assistance method.
  • an electronic device including a processor and a memory; the memory stores computer-readable instructions, and the processor is configured to execute the computer-readable instructions, wherein, The computer readable instructions execute an on-board information assistance method when executed.
  • acquiring target voice data is adopted; dividing the target voice data to obtain a single piece of voice data; performing voice recognition according to the single piece of voice data to generate text data;
  • the method of displaying the text data described above solves the technical problem of insufficient convenience and accuracy in obtaining management and control instructions in the prior art by segmenting and recognizing the voice.
  • Fig. 1 is a flow chart of an on-board information assistance method according to an embodiment of the present invention
  • FIG. 2 is a structural block diagram of an on-board information assistance device according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of the effect of a method for assisting air traffic control information on a seed aircraft according to an embodiment of the present invention.
  • a method embodiment of an on-board information assistance method is provided. It should be noted that the steps shown in the flowchart of the accompanying drawing can be executed in a computer system such as a set of computer-executable instructions, Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that herein.
  • FIG. 1 is a flowchart of an on-board information assistance method according to an embodiment of the present invention. As shown in FIG. 1 , the method includes the following steps:
  • Step S102 acquiring target voice data.
  • a dedicated voice communication link needs to be established, and a voice capture program is set to record and record the voice data transmitted in the voice communication link. Intercept and transmit all the voice data to the processor for subsequent voice analysis processing.
  • the acquisition of the target voice data may be performed on a land control device or an airborne voice control device.
  • the specific control device selected for voice capture and transmission is not specifically limited here.
  • the target voice data is land-air voice real-time call data captured through a voice communication link.
  • the acquisition of the target voice data can be conducted through the voice communication link, real-time call data, because the real-time nature is often very important when the land and air pilots conduct voice communication, and real-time voice calls can be made between land and air.
  • the dynamic call that is constantly updated in the air is maintained to achieve the effect of safety. Therefore, when capturing voice, it is necessary to perform real-time voice capture according to the data in the communication link. The smaller the capture delay, the less the impact on the coordinated control of voice on land and in the air. smaller.
  • Step S104 segment the target voice data to obtain a single piece of voice data.
  • the dividing the target voice data to obtain a single piece of voice data includes: obtaining voice information entropy according to the target voice data, where the voice information entropy represents the complexity of the target voice data. ; According to the speech information entropy, the sentences in the target speech data are segmented and segmented, and the segmented speech data is output as the single piece of speech data.
  • the voice information entropy is obtained according to the target voice data, wherein the voice information entropy represents the complexity of the target voice data.
  • the voice data can be obtained according to the complexity of the voice data
  • the complexity and length of the data are automatically determined, and how to perform speech segmentation work is automatically determined, and the segmented speech data can be used for textual speech recognition operations more efficiently. Segment and segment the sentences in the target speech data according to the speech information entropy, and output the segmented speech data as the single piece of speech data.
  • information entropy is a rather abstract concept in mathematics.
  • information entropy may be understood as the probability of occurrence of certain information.
  • the information entropy and thermodynamic entropy are closely related. According to Charles H. Bennett's reinterpretation of Maxwell's Demon, the destruction of information is an irreversible process, so the destruction of information is in line with the second law of thermodynamics.
  • the generation of information is the process of introducing negative (thermodynamic) entropy into the system. So the sign of information entropy and thermodynamic entropy should be opposite.
  • information entropy can represent the value of information. In this way, we have a standard for measuring the value of information, and we can make more inferences about knowledge circulation.
  • x represents a random variable, which corresponds to the set of all possible outputs, which is defined as a symbol set, and the output of a random variable is represented by x.
  • P(x) represents the output probability function. The greater the uncertainty of the variable, the greater the entropy, and the greater the amount of information required to figure it out.
  • Step S106 Perform speech recognition according to the single piece of speech data to generate text data.
  • the method further includes: storing the single piece of speech data and the text data.
  • the playback content of a single piece of voice data is "there is thunderstorm in the air, land as soon as possible”
  • the text content after being recognized by the speech recognition algorithm is “there is thunderstorm in the air”, ",”, “land as soon as possible”. Therefore, finally, through the voice-text splicing operation, the above-identified text content is spliced to obtain the text data of "there is thunderstorm in the air, land as soon as possible”, and then display it to the pilot, prompting the pilot to land as soon as possible.
  • Step S108 displaying the single piece of voice data and the text data.
  • the displaying the single piece of voice data and the text data includes: playing the single piece of voice data; and displaying the corresponding text data according to the played single piece of voice data.
  • FIG. 3 is a schematic diagram of the effect of the method for assisting air traffic control information on a kind of aircraft according to an embodiment of the present invention. According to FIG. 3, it can be seen that when the user clicks the voice to play, the text display below the voice can be seen, The technical effect of increasing the accuracy of the voice information obtained by the user is achieved.
  • the number of times of playing the single piece of voice data is at least once.
  • each single piece of voice data can be played at least once, that is, played multiple times according to user needs to To achieve complete execution of the content or command in the voice.
  • FIG. 2 is a structural block diagram of an in-flight information assistance device according to an embodiment of the present invention. As shown in FIG. 2 , the device includes:
  • the acquiring module 20 is used for acquiring target voice data.
  • a dedicated voice communication link needs to be established, and a voice capture program is set to record and intercept the voice data transmitted in the voice communication link. , and transmit all the voice data to the processor for subsequent voice analysis processing.
  • the acquisition of the target voice data may be performed on a land control device or an airborne voice control device.
  • the specific control device selected for voice capture and transmission is not specifically limited here.
  • the target voice data is land-air voice real-time call data captured through a voice communication link.
  • the acquisition of the target voice data can be conducted through the voice communication link, real-time call data, because the real-time nature is often very important when the land and air pilots conduct voice communication, and real-time voice calls can be made between land and air.
  • the dynamic call that is constantly updated in the air is maintained to achieve the effect of safety. Therefore, when capturing voice, it is necessary to perform real-time voice capture according to the data in the communication link. The smaller the capture delay, the less the impact on the coordinated control of voice on land and in the air. smaller.
  • the segmentation module 22 is used for segmenting the target voice data to obtain a single piece of voice data.
  • the segmentation module includes: an acquisition unit, configured to acquire voice information entropy according to the target voice data, wherein the voice information entropy represents the complexity of the target voice data; a segmentation unit, used performing segmentation and segmentation on the sentences in the target speech data according to the speech information entropy; the output unit is used for outputting the segmented speech data as the single piece of speech data.
  • the voice information entropy is obtained according to the target voice data, wherein the voice information entropy represents the complexity of the target voice data.
  • the voice data can be obtained according to the complexity of the voice data According to the complexity and length, it can automatically determine how to perform speech segmentation, and the segmented speech data can be used for textual speech recognition operations more efficiently. Segment and segment the sentences in the target speech data according to the speech information entropy, and output the segmented speech data as the single piece of speech data.
  • information entropy is a rather abstract concept in mathematics.
  • information entropy may be understood as the probability of occurrence of certain information.
  • the information entropy and thermodynamic entropy are closely related. According to Charles H. Bennett's reinterpretation of Maxwell's Demon, the destruction of information is an irreversible process, so the destruction of information is in line with the second law of thermodynamics.
  • the generation of information is the process of introducing negative (thermodynamic) entropy into the system. So the sign of information entropy and thermodynamic entropy should be opposite.
  • information entropy can represent the value of information. In this way, we have a standard for measuring the value of information, and we can make more inferences about knowledge circulation.
  • x represents a random variable, which corresponds to the set of all possible outputs, which is defined as a symbol set, and the output of a random variable is represented by x.
  • P(x) represents the output probability function. The greater the uncertainty of the variable, the greater the entropy, and the greater the amount of information required to figure it out.
  • the text module 24 is configured to perform speech recognition according to the single piece of speech data to generate text data.
  • the apparatus further includes: a storage unit configured to store the single piece of voice data and the text data.
  • the playback content of a single piece of voice data is "there is thunderstorm in the air, land as soon as possible”
  • the text content after being recognized by the speech recognition algorithm is “there is thunderstorm in the air”, ",”, “land as soon as possible”. Therefore, finally, the text content identified above is spliced through the voice-text splicing operation to obtain the text data of "there is thunderstorm in the air, land as soon as possible”, and then display it to the pilot, prompting the pilot to land as soon as possible.
  • the presentation module 26 is configured to present the single piece of voice data and the text data.
  • the display module includes: a playing unit for playing the single piece of voice data; and a display unit for displaying the corresponding text data according to the played single piece of voice data.
  • FIG. 3 is a schematic diagram of the effect of the method for assisting air traffic control information on a kind of aircraft according to an embodiment of the present invention. According to FIG. 3, it can be seen that when the user clicks the voice to play, the text display below the voice can be seen, The technical effect of increasing the accuracy of the voice information obtained by the user is achieved.
  • the number of times of playing the single piece of voice data is at least once.
  • each single piece of voice data can be played at least once, that is, played multiple times according to user needs to To achieve complete execution of the content or command in the voice.
  • a computer program product comprising instructions which, when executed on a computer, cause the computer to execute an on-board information assistance method.
  • the above-mentioned on-board information assistance method includes: acquiring target voice data; dividing the target voice data to obtain a single piece of voice data; performing voice recognition according to the single piece of voice data to generate text data; The text data is displayed.
  • a non-volatile storage medium is further provided, and the non-volatile storage medium includes a stored program, wherein the program controls the location of the non-volatile storage medium when running.
  • the device implements an on-board information assistance method.
  • the above-mentioned on-board information assistance method includes: acquiring target voice data; dividing the target voice data to obtain a single piece of voice data; performing voice recognition according to the single piece of voice data to generate text data; The text data is displayed.
  • an electronic device including a processor and a memory; the memory stores computer-readable instructions, and the processor is configured to execute the computer-readable instructions, wherein, The computer readable instructions execute an on-board information assistance method when executed.
  • the above-mentioned on-board information assistance method includes: acquiring target voice data; dividing the target voice data to obtain a single piece of voice data; performing voice recognition according to the single piece of voice data to generate text data; The text data is displayed.
  • the disclosed technical content can be implemented in other ways.
  • the device embodiments described above are only illustrative, for example, the division of the units may be a logical function division, and there may be other division methods in actual implementation, for example, multiple units or components may be combined or Integration into another system, or some features can be ignored, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of units or modules, and may be in electrical or other forms.
  • the units described as separate components may or may not be physically separated, and components shown as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a computer-readable storage medium.
  • the technical solution of the present invention is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention.
  • the aforementioned storage medium includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program codes .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)
  • Traffic Control Systems (AREA)
  • Machine Translation (AREA)

Abstract

一种机上信息辅助方法及装置。方法包括:获取目标语音数据(S102);将目标语音数据进行切分,得到单条语音数据(S104);根据单条语音数据进行语音识别,生成文本数据(S106);将单条语音数据和文本数据进行展示(S108)。解决了现有技术中获取管控指令时的便捷性和准确性不够高的技术问题。

Description

机上信息辅助方法及装置 技术领域
本发明涉及航空领域,具体而言,涉及一种机上信息辅助方法及装置,在飞行任务中辅助飞行员进行空管信息的收听、回放、查询等,提高获取管控指令的便捷性和准确性,降低沟通成本,为飞机航行提供安全性的保障。
背景技术
在空中交通管制中,管制员与飞行员通过陆空语音进行通话,由管制员向飞行员发出管控指令,指挥飞行员按要求操作飞机。管制员角色尤为重要,飞机航行中许多重要的决策都是由管制员根据其与飞行员之间的对话内容而决定的。由于空管的专业性、地域差异性和人员复杂性,空管语音中存在大量专业名词、独特的地区名称、中英文混杂以及口音差异,加之嘈杂环境、通信链路干扰等因素,在实际操作中,很有可能因为极小失误,例如听错信息、遗漏信息等,导致飞行员对管制员语音的误判,进而给航空领域带来巨大的损失。因此,对于机上的飞行员而言,在与空管交流过程中迫切需要一种辅助手段。
随着自然语言处理技术的发展,人们提出采用语音识别作为一种辅助空管信息的技术途径。然而,在空管系统中实现准确高效的语音识别难度较大,主要体现在几个方面:首先,空管系统所处的环境特殊,这直接决定了空管语音识别在语音信息采集环节上的特殊性;其次,空管系统中对数字、字母、航班号、跑道等的发音有特殊定义,而且空管系统中对话语句的结构、顺序需遵循指定的规则,导致普通的语音识别产品无法应用到空管对话识别中;此外,由于在空管系统中极小的语音识别失误也可能造成巨大的损失,因此空管语音识别系统对识别的准确率要求严格。现有技术中存在一些专用于空管的语音识别方法,但普便存在准确率不够高的情况,于是大多被应用在地面场景中,例如用于辅助管制指挥质量评估、事后分析、工作负荷评估等地空通话数据分析工作。因此迫切需要一种在机上辅助飞行员,提高空管沟通便捷性和准确性的方法,为飞机航行提供安全性的保障。
针对上述的问题,目前尚未提出有效的解决方案。
发明内容
本发明实施例提供了一种机上信息辅助方法及装置,以至少解决现有技术中获取管控指令时的便捷性和准确性不够高的技术问题。
根据本发明实施例的一个方面,提供了一种机上信息辅助方法,包括:获取目标语音数据;将所述目标语音数据进行切分,得到单条语音数据;根据所述单条语音数据进行语音识别,生成文本数据;将所述单条语音数据和所述文本数据进行展示。
可选的,所述目标语音数据是通过语音通信链路捕获的陆空语音实时通话数据。
可选的,所述将所述目标语音数据进行切分,得到单条语音数据包括:根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;根据所述语音信息熵,对所述目标语音数据中的语句进行断句和切分,将切分后的语音数据作为所述单条语音数据进行输出。
可选的,在所述根据所述单条语音数据进行语音识别,生成文本数据之后,所述方法还包括:将所述单条语音数据和所述文本数据进行存储。
可选的,所述将所述单条语音数据和所述文本数据进行展示包括:播放所述单条语音数据;根据播放的所述单条语音数据,将对应的所述文本数据进行显示。
可选的,所述单条语音数据的播放次数为至少一次。
根据本发明实施例的另一方面,还提供了一种机上信息辅助装置,包括:获取模块,用于获取目标语音数据;切分模块,用于切分所述目标语音数据,得到单条语音数据;文本模块,用于对所述单条语音数据进行语音识别,生成文本数据;展示模块,用于展示所述单条语音数据和所述文本数据。
可选的,所述目标语音数据是通过语音通信链路捕获的陆空语音实时通话数据。
可选的,所述切分模块包括:获取单元,用于根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;切分单元,用于根据所述语音信息熵,对所述目标语音数据中的语句进行断句和切分;输出单元,用于将切分后的语音数据作为所述单条语音数据进行输出。
可选的,所述装置还包括:存储单元,用于存储单条语音数据和所述文本数据。
可选的,所述展示模块包括:播放单元,用于播放所述单条语音数据;显示单元,用于显示所述单条语音数据所对应的文本数据。
可选的,所述单条语音数据的播放次数为至少一次。
根据本发明实施例的另一方面,还提供了一种包括指令的计算机程序产品,当所 述指令在计算机上运行时,使得所述计算机执行一种机上信息辅助方法。
根据本发明实施例的另一方面,还提供了一种非易失性存储介质,所述非易失性存储介质包括存储的程序,其中,所述程序在运行时控制非易失性存储介质所在的设备,执行一种机上信息辅助方法。
根据本发明实施例的另一方面,还提供了一种电子装置,包含处理器和存储器;所述存储器中存储有计算机可读指令,所述处理器用于运行所述计算机可读指令,其中,所述计算机可读指令运行时执行一种机上信息辅助方法。
在本发明实施例中,采用获取目标语音数据;将所述目标语音数据进行切分,得到单条语音数据;根据所述单条语音数据进行语音识别,生成文本数据;将所述单条语音数据和所述文本数据进行展示的方式,通过对语音进行切分和识别,进而解决了现有技术中获取管控指令时的便捷性和准确性不够高的技术问题。
附图说明
此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:
图1是根据本发明实施例的一种机上信息辅助方法的流程图;
图2是根据本发明实施例的一种机上信息辅助装置的结构框图;
图3是根据本发明实施例的种机上空管信息辅助方法的效果示意图。
具体实施方式
为了使本技术领域的技术人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方 法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。
根据本发明实施例,提供了一种机上信息辅助方法的方法实施例,需要说明的是,在附图的流程图示出的步骤,可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。
实施例一
图1是根据本发明实施例的一种机上信息辅助方法的流程图,如图1所示,该方法包括如下步骤:
步骤S102,获取目标语音数据。
具体的,本发明实施例为了实现实时捕获陆地和空中之间的音频语音传输数据,需要建立专用的语音通信链路,并设置语音捕获程序,针对语音通信链路中传输的语音数据进行录制和截取,并将所有的语音数据传输到处理器中,进行后续的语音分析处理。获取目标语音数据可以是在陆地控制设备上进行,也可以是在空中机上语音控制设备来进行,具体选用哪种控制设备进行语音的捕获的传输,在此处不进行具体的限定。
可选的,所述目标语音数据是通过语音通信链路捕获的陆空语音实时通话数据。
具体的,目标语音数据的获取可以是通过语音通信链路进行的、实时地通话数据,由于在陆地和空中飞行员进行语音交流的时候,往往实时性非常重要,实时性的语音通话可以在陆地和空中保持不断更新的动态呼叫,达到安全的效果,因此在捕获语音的时候,需要根据通信链路中的数据进行实时的语音捕获,其中捕获的延迟越小,对陆地空中语音协调控制的影响就越小。
步骤S104,将所述目标语音数据进行切分,得到单条语音数据。
可选的,所述将所述目标语音数据进行切分,得到单条语音数据包括:根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;根据所述语音信息熵对所述目标语音数据中的语句进行断句和切分,将切分后的语音数据作为所述单条语音数据进行输出。
具体的,根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度,本发明实施例在获得了语音数据的复杂度之后才可以根据语音数据的复杂程度、长短程度,自动判定如何进行语音切分工作,切分后的语音 数据可以更效率地进行文字化的语音识别操作。根据所述语音信息熵对所述目标语音数据中的语句进行断句和切分,将切分后的语音数据作为所述单条语音数据进行输出。
需要说明的是,信息熵是一个数学上颇为抽象的概念,在这里不妨把信息熵理解成某种特定信息的出现概率。而信息熵和热力学熵是紧密相关的。根据Charles H.Bennett对Maxwell's Demon的重新解释,对信息的销毁是一个不可逆过程,所以销毁信息是符合热力学第二定律的。而产生信息,则是为系统引入负(热力学)熵的过程。所以信息熵的符号与热力学熵应该是相反的。一般而言,当一种信息出现概率更高的时候,表明它被传播得更广泛,或者说,被引用的程度更高。我们可以认为,从信息传播的角度来看,信息熵可以表示信息的价值。这样子我们就有一个衡量信息价值高低的标准,可以做出关于知识流通问题的更多推论。
信息熵计算公式:
H(x)=E[I(xi)]=E[log(2,1/P(xi))]=-∑P(xi)log(2,P(xi))(i=1,2,..n)
其中,x表示随机变量,与之相对应的是所有可能输出的集合,定义为符号集,随机变量的输出用x表示。P(x)表示输出概率函数。变量的不确定性越大,熵也就越大,把它搞清楚所需要的信息量也就越大。
步骤S106,根据所述单条语音数据进行语音识别,生成文本数据。
可选的,在所述根据所述单条语音数据进行语音识别,生成文本数据之后,所述方法还包括:将所述单条语音数据和所述文本数据进行存储。
具体的,为了将上述分割之后的单条语音数据进行识别,并将识别后的数据转化为与单条语音数据中内容相对应的文本数据,需要通过语音识别算法对单条语音数据中的每一帧语音数据进行识别,最后将识别出的汉字或字符进行拼接合并,得到完整的文本句子并输出。
例如,单条语音数据播放内容为“空中有雷暴天气,尽快降落”,那么通过语音识别算法识别之后的文字内容为“空中有雷暴天气”、“,”、“尽快降落”。因此最后通过语音文本拼接操作,对上述识别出来的文本内容见进行拼接得到“空中有雷暴天气,尽快降落”的文本数据,进而向飞行员进行显示,提示飞行员尽快进行降落。
步骤S108,将所述单条语音数据和所述文本数据进行展示。
可选的,所述将所述单条语音数据和所述文本数据进行展示包括:播放所述单条语音数据;根据播放的所述单条语音数据,将对应的所述文本数据进行显示。
具体的,在语音数据转化文本数据之后,根据飞行员及地面控制人员的需要,本发明实施例将单条语音数据和文本数据进行同时显示,即在播放语音数据的同时显示 相应的文本数据,以便使用者可以不受对方口音等因素影响,直观地了解语音的内容。如图3所示,图3是根据本发明实施例的种机上空管信息辅助方法的效果示意图,根据图3可以看出当使用者点击语音播放的同时,可以看到语音下方的文本显示,达到了增加使用者获取语音信息准确度的技术效果。
可选的,所述单条语音数据的播放次数为至少一次。
具体的,由于某些重要的语音数据需要不止一次进行播放才可以获取完整的语音信息或命令,所以每一个单条语音数据都可以进行至少一次的播放,即根据使用者需求进行多次播放,以达到完整地执行语音中的内容或命令。
通过上述步骤,可以实现增加获取管控指令时的便捷性和准确性的技术效果。
实施例二
图2是根据本发明实施例的一种机上信息辅助装置的结构框图,如图2所示,该装置包括:
获取模块20,用于获取目标语音数据。
具体的,本发明实施例为了实现实时捕获陆地和空中之间的音频语音传输数据,需要建立专用的语音通信链路,并设置语音捕获程序针对语音通信链路中传输的语音数据进行录制和截取,并将所有的语音数据传输到处理器中进行后续的语音分析处理。获取目标语音数据可以是在陆地控制设备上进行,也可以是在空中机上语音控制设备来进行,具体选用哪种控制设备进行语音的捕获的传输,在此处不进行具体的限定。
可选的,所述目标语音数据是通过语音通信链路捕获的陆空语音实时通话数据。
具体的,目标语音数据的获取可以是通过语音通信链路进行的、实时地通话数据,由于在陆地和空中飞行员进行语音交流的时候,往往实时性非常重要,实时性的语音通话可以在陆地和空中保持不断更新的动态呼叫,达到安全的效果,因此在捕获语音的时候,需要根据通信链路中的数据进行实时的语音捕获,其中捕获的延迟越小,对陆地空中语音协调控制的影响就越小。
切分模块22,用于将所述目标语音数据进行切分,得到单条语音数据。
可选的,所述切分模块包括:获取单元,用于根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;切分单元,用于根据所述语音信息熵对所述目标语音数据中的语句进行断句和切分;输出单元,用于将切分后的语音数据作为所述单条语音数据进行输出。
具体的,根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度,本发明实施例在获得了语音数据的复杂度之后才可以根据语音数据的复杂程度、长短程度来自动判定如何进行语音切分工作,切分后的语音数据可以更效率地进行文字化的语音识别操作。根据所述语音信息熵对所述目标语音数据中的语句进行断句和切分,将切分后的语音数据作为所述单条语音数据进行输出。
需要说明的是,信息熵是一个数学上颇为抽象的概念,在这里不妨把信息熵理解成某种特定信息的出现概率。而信息熵和热力学熵是紧密相关的。根据Charles H.Bennett对Maxwell's Demon的重新解释,对信息的销毁是一个不可逆过程,所以销毁信息是符合热力学第二定律的。而产生信息,则是为系统引入负(热力学)熵的过程。所以信息熵的符号与热力学熵应该是相反的。一般而言,当一种信息出现概率更高的时候,表明它被传播得更广泛,或者说,被引用的程度更高。我们可以认为,从信息传播的角度来看,信息熵可以表示信息的价值。这样子我们就有一个衡量信息价值高低的标准,可以做出关于知识流通问题的更多推论。
信息熵计算公式:
H(x)=E[I(xi)]=E[log(2,1/P(xi))]=-∑P(xi)log(2,P(xi))(i=1,2,..n)
其中,x表示随机变量,与之相对应的是所有可能输出的集合,定义为符号集,随机变量的输出用x表示。P(x)表示输出概率函数。变量的不确定性越大,熵也就越大,把它搞清楚所需要的信息量也就越大。
文本模块24,用于根据所述单条语音数据进行语音识别,生成文本数据。
可选的,所述装置还包括:存储单元,用于将所述单条语音数据和所述文本数据进行存储。
具体的,为了将上述分割之后的单条语音数据进行识别,并将识别后的数据转化为与单条语音数据中内容相对应的文本数据,需要通过语音识别算法对单条语音数据中的每一帧语音数据进行识别,最后将识别出的汉字或字符进行拼接合并,得到完整的文本句子并输出。
例如,单条语音数据播放内容为“空中有雷暴天气,尽快降落”,那么通过语音识别算法识别之后的文字内容为“空中有雷暴天气”、“,”、“尽快降落”。因此最后通过语音文本拼接操作对上述识别出来的文本内容见进行拼接得到“空中有雷暴天气,尽快降落”的文本数据,进而向飞行员进行显示,提示飞行员尽快进行降落。
展示模块26,用于将所述单条语音数据和所述文本数据进行展示。
可选的,所述展示模块包括:播放单元,用于播放所述单条语音数据;显示单元, 用于根据播放的所述单条语音数据,将对应的所述文本数据进行显示。
具体的,在语音数据转化文本数据之后,根据飞行员及地面控制人员的需要,本发明实施例将单条语音数据和文本数据进行同时显示,即在播放语音数据的同时显示相应的文本数据,以便使用者可以不受对方口音等因素影响,直观地了解语音的内容。如图3所示,图3是根据本发明实施例的种机上空管信息辅助方法的效果示意图,根据图3可以看出当使用者点击语音播放的同时,可以看到语音下方的文本显示,达到了增加使用者获取语音信息准确度的技术效果。
可选的,所述单条语音数据的播放次数为至少一次。
具体的,由于某些重要的语音数据需要不止一次进行播放才可以获取完整的语音信息或命令,所以每一个单条语音数据都可以进行至少一次的播放,即根据使用者需求进行多次播放,以达到完整地执行语音中的内容或命令。
根据本发明实施例的另一方面,还提供了一种包括指令的计算机程序产品,当所述指令在计算机上运行时,使得所述计算机执行一种机上信息辅助方法。
上述一种机上信息辅助方法包括:获取目标语音数据;将所述目标语音数据进行切分,得到单条语音数据;根据所述单条语音数据进行语音识别,生成文本数据;将所述单条语音数据和所述文本数据进行展示。
根据本发明实施例的另一方面,还提供了一种非易失性存储介质,所述非易失性存储介质包括存储的程序,其中,所述程序运行时控制非易失性存储介质所在的设备执行一种机上信息辅助方法。
上述一种机上信息辅助方法包括:获取目标语音数据;将所述目标语音数据进行切分,得到单条语音数据;根据所述单条语音数据进行语音识别,生成文本数据;将所述单条语音数据和所述文本数据进行展示。
根据本发明实施例的另一方面,还提供了一种电子装置,包含处理器和存储器;所述存储器中存储有计算机可读指令,所述处理器用于运行所述计算机可读指令,其中,所述计算机可读指令运行时执行一种机上信息辅助方法。
上述一种机上信息辅助方法包括:获取目标语音数据;将所述目标语音数据进行切分,得到单条语音数据;根据所述单条语音数据进行语音识别,生成文本数据;将所述单条语音数据和所述文本数据进行展示。
通过上述步骤,可以实现增加获取管控指令时的便捷性和准确性的技术效果。
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。
在本发明的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
在本申请所提供的几个实施例中,应该理解到,所揭露的技术内容,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,可以为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现,并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。

Claims (10)

  1. 一种机上信息辅助方法,其特征在于,包括:
    获取目标语音数据;
    将所述目标语音数据进行切分,得到单条语音数据;
    根据所述单条语音数据进行语音识别,生成文本数据;
    将所述单条语音数据和所述文本数据进行展示。
  2. 根据权利要求1所述的方法,其特征在于,所述目标语音数据是通过语音通信链路捕获的陆空语音实时通话数据。
  3. 根据权利要求1所述的方法,其特征在于,所述将所述目标语音数据进行切分,得到的单条语音数据包括:
    根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;
    根据所述语音信息熵,对所述目标语音数据中的语句进行断句和切分;
    将切分后的语音数据作为所述单条语音数据进行输出。
  4. 根据权利要求1所述的方法,其特征在于,在所述根据所述单条语音数据进行语音识别,生成文本数据之后,所述方法还包括:
    将所述单条语音数据和所述文本数据进行存储。
  5. 根据权利要求1所述的方法,其特征在于,所述将所述单条语音数据和所述文本数据进行展示包括:
    播放所述单条语音数据;
    根据播放的所述单条语音数据,将对应的所述文本数据进行显示。
  6. 根据权利要求5所述的方法,其特征在于,所述单条语音数据的播放次数为至少一次。
  7. 一种机上信息辅助装置,其特征在于,包括:
    获取模块,用于获取目标语音数据;
    切分模块,用于切分所述目标语音数据,得到单条语音数据;
    文本模块,用于对所述单条语音数据进行语音识别,生成文本数据;
    展示模块,用于展示所述单条语音数据和所述文本数据。
  8. 根据权利要求7所述的装置,其特征在于,所述目标语音数据,是通过语音通信链路捕获的陆空语音实时通话数据。
  9. 根据权利要求7所述的装置,其特征在于,所述切分模块包括:
    获取单元,用于根据所述目标语音数据,获取语音信息熵,其中,所述语音信息熵表征所述目标语音数据的复杂度;
    切分单元,用于根据所述语音信息熵,对所述目标语音数据中的语句进行断句和切分;
    输出单元,用于将切分后的语音数据作为所述单条语音数据进行输出。
  10. 根据权利要求7所述的装置,其特征在于,所述装置还包括:
    存储单元,用于存储所述单条语音数据和所述文本数据。
PCT/CN2020/135366 2020-09-27 2020-12-10 机上信息辅助方法及装置 WO2022062195A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP21871657.9A EP4044179A4 (en) 2020-09-27 2021-09-27 ON-BOARD INFORMATION ASSISTANCE SYSTEM AND METHOD
PCT/CN2021/120852 WO2022063288A1 (zh) 2020-09-27 2021-09-27 一种机上信息辅助系统和方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011033062.4A CN112185390B (zh) 2020-09-27 2020-09-27 机上信息辅助方法及装置
CN202011033062.4 2020-09-27

Publications (1)

Publication Number Publication Date
WO2022062195A1 true WO2022062195A1 (zh) 2022-03-31

Family

ID=73944256

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/135366 WO2022062195A1 (zh) 2020-09-27 2020-12-10 机上信息辅助方法及装置

Country Status (2)

Country Link
CN (1) CN112185390B (zh)
WO (1) WO2022062195A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115188225A (zh) * 2022-07-07 2022-10-14 中国商用飞机有限责任公司 一种空中交通管制的方法、系统及计算机可读介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022063288A1 (zh) * 2020-09-27 2022-03-31 中国商用飞机有限责任公司北京民用飞机技术研究中心 一种机上信息辅助系统和方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080162118A1 (en) * 2006-12-15 2008-07-03 International Business Machines Corporation Technique for Searching Out New Words That Should Be Registered in Dictionary For Speech Processing
CN101625857A (zh) * 2008-07-10 2010-01-13 新奥特(北京)视频技术有限公司 一种自适应的语音端点检测方法
CN106548778A (zh) * 2016-10-13 2017-03-29 北京云知声信息技术有限公司 一种字符转换规则的生成方法及装置
CN107480143A (zh) * 2017-09-12 2017-12-15 山东师范大学 基于上下文相关性的对话话题分割方法和系统
CN108416052A (zh) * 2018-03-20 2018-08-17 杭州声讯网络科技有限公司 一种针对语义分析行业数据分类方法
CN111210825A (zh) * 2019-12-16 2020-05-29 四川大学 一种增强地空通话管制员情景意识感知的方法与装置

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204489C (zh) * 2002-04-03 2005-06-01 英华达(南京)科技有限公司 可同步播放相关联的语音及文字的方法
CN1168030C (zh) * 2002-05-27 2004-09-22 北京南山高科技有限公司 语音文本同步播放方法
JP4173056B2 (ja) * 2003-06-10 2008-10-29 株式会社ケンウッド 携帯通信端末及びプログラム
JP4259401B2 (ja) * 2004-06-02 2009-04-30 カシオ計算機株式会社 音声処理装置及び音声符号化方法
US7912592B2 (en) * 2006-06-09 2011-03-22 Garmin International, Inc. Automatic speech recognition system and method for aircraft
KR20100010222A (ko) * 2008-07-22 2010-02-01 주식회사 한스텝 음성 데이터 재생 장치 및 방법
CN101916565A (zh) * 2010-06-24 2010-12-15 北京华安天诚科技有限公司 空管系统中的语音识别方法及语音识别装置
JP5732976B2 (ja) * 2011-03-31 2015-06-10 沖電気工業株式会社 音声区間判定装置、音声区間判定方法、及びプログラム
FR2991805B1 (fr) * 2012-06-11 2016-12-09 Airbus Dispositif d'aide a la communication dans le domaine aeronautique.
US9135916B2 (en) * 2013-02-26 2015-09-15 Honeywell International Inc. System and method for correcting accent induced speech transmission problems
US20160275968A1 (en) * 2013-10-22 2016-09-22 Nec Corporation Speech detection device, speech detection method, and medium
US20160155435A1 (en) * 2013-11-14 2016-06-02 Honeywell International Inc. Aircraft systems and methods for reducing and detecting read-back and hear-back errors
US20150162001A1 (en) * 2013-12-10 2015-06-11 Honeywell International Inc. System and method for textually and graphically presenting air traffic control voice information
US20170294184A1 (en) * 2016-04-08 2017-10-12 Knuedge Incorporated Segmenting Utterances Within Speech
US10803755B2 (en) * 2016-06-20 2020-10-13 The Boeing Company Vehicle operation instruction confirmation
CN106356063A (zh) * 2016-08-28 2017-01-25 桂林市晶准测控技术有限公司 一种对管控语音进行文字识别的方法和系统
CN107527618A (zh) * 2017-07-13 2017-12-29 安徽声讯信息技术有限公司 一种音频文字同步播放系统
CN108847217A (zh) * 2018-05-31 2018-11-20 平安科技(深圳)有限公司 一种语音切分方法、装置、计算机设备及存储介质
CN109065031B (zh) * 2018-08-02 2020-05-12 阿里巴巴集团控股有限公司 语音标注方法、装置及设备
WO2020072759A1 (en) * 2018-10-03 2020-04-09 Visteon Global Technologies, Inc. A voice assistant system for a vehicle cockpit system
CN109326292A (zh) * 2018-12-04 2019-02-12 北京九狐时代智能科技有限公司 一种音频识别结果的生成方法及装置
CN110197135B (zh) * 2019-05-13 2021-01-08 北京邮电大学 一种基于多维分割的视频结构化方法
CN110322870B (zh) * 2019-06-19 2020-10-30 北京信息职业技术学院 一种汉语语音信号切分方法和装置
CN110335609A (zh) * 2019-06-26 2019-10-15 四川大学 一种基于语音识别的地空通话数据分析方法及系统
CN111524504A (zh) * 2020-05-11 2020-08-11 中国商用飞机有限责任公司北京民用飞机技术研究中心 机载语音控制方法和装置
CN111667831B (zh) * 2020-06-08 2022-04-26 中国民航大学 基于管制员指令语义识别的飞机地面引导系统及方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080162118A1 (en) * 2006-12-15 2008-07-03 International Business Machines Corporation Technique for Searching Out New Words That Should Be Registered in Dictionary For Speech Processing
CN101625857A (zh) * 2008-07-10 2010-01-13 新奥特(北京)视频技术有限公司 一种自适应的语音端点检测方法
CN106548778A (zh) * 2016-10-13 2017-03-29 北京云知声信息技术有限公司 一种字符转换规则的生成方法及装置
CN107480143A (zh) * 2017-09-12 2017-12-15 山东师范大学 基于上下文相关性的对话话题分割方法和系统
CN108416052A (zh) * 2018-03-20 2018-08-17 杭州声讯网络科技有限公司 一种针对语义分析行业数据分类方法
CN111210825A (zh) * 2019-12-16 2020-05-29 四川大学 一种增强地空通话管制员情景意识感知的方法与装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115188225A (zh) * 2022-07-07 2022-10-14 中国商用飞机有限责任公司 一种空中交通管制的方法、系统及计算机可读介质

Also Published As

Publication number Publication date
CN112185390B (zh) 2023-10-03
CN112185390A (zh) 2021-01-05

Similar Documents

Publication Publication Date Title
CN109348275B (zh) 视频处理方法和装置
CN110517689B (zh) 一种语音数据处理方法、装置及存储介质
WO2022062195A1 (zh) 机上信息辅助方法及装置
US20220375225A1 (en) Video Segmentation Method and Apparatus, Device, and Medium
CN114465737B (zh) 一种数据处理方法、装置、计算机设备及存储介质
CN110648405B (zh) 一种基于增强现实的飞行操作辅助方法和系统
US10062384B1 (en) Analysis of content written on a board
US11954912B2 (en) Method for cutting video based on text of the video and computing device applying method
KR20220064940A (ko) 음성 생성 방법, 장치, 전자기기 및 저장매체
CN112434139A (zh) 信息交互方法、装置、电子设备和存储介质
US11580994B2 (en) Speech recognition
CN111429924A (zh) 语音交互方法、装置、机器人及计算机可读存储介质
CN113301382B (zh) 视频处理方法、设备、介质及程序产品
WO2022063288A1 (zh) 一种机上信息辅助系统和方法
TWI782436B (zh) 顯示系統以及與顯示系統互動之方法
JP2022088586A (ja) 音声認識方法、音声認識装置、電子機器、記憶媒体コンピュータプログラム製品及びコンピュータプログラム
CN114490967A (zh) 对话模型的训练方法、对话机器人的对话方法、装置和电子设备
CN111128181B (zh) 背诵题评测方法、装置以及设备
CN114925206A (zh) 人工智能体、语音信息识别方法、存储介质和程序产品
WO2023272833A1 (zh) 一种数据检测方法、装置、设备及可读存储介质
Arthur III et al. Performance evaluation of speech recognition systems as a next-generation pilot-vehicle interface technology
CN111209376A (zh) 一种ai数字机器人运行方法
CN114360535B (zh) 语音对话的生成方法、装置、电子设备及存储介质
CN112542163A (zh) 智能语音交互方法、设备及存储介质
US20230267726A1 (en) Systems and methods for image processing using natural language

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20955037

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20955037

Country of ref document: EP

Kind code of ref document: A1