WO2021114479A1 - Three-dimensional display system and method for sound control building information model - Google Patents

Three-dimensional display system and method for sound control building information model Download PDF

Info

Publication number
WO2021114479A1
WO2021114479A1 PCT/CN2020/076321 CN2020076321W WO2021114479A1 WO 2021114479 A1 WO2021114479 A1 WO 2021114479A1 CN 2020076321 W CN2020076321 W CN 2020076321W WO 2021114479 A1 WO2021114479 A1 WO 2021114479A1
Authority
WO
WIPO (PCT)
Prior art keywords
command
dimensional display
voice
module
building information
Prior art date
Application number
PCT/CN2020/076321
Other languages
French (fr)
Chinese (zh)
Inventor
林佳瑞
Original Assignee
清华大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 清华大学 filed Critical 清华大学
Publication of WO2021114479A1 publication Critical patent/WO2021114479A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Definitions

  • the invention relates to a three-dimensional display system and method for a voice-controlled building information model, and belongs to the technical field of building information.
  • Building Information Model is an engineering data model based on three-dimensional digital technology that integrates various related information of construction projects, and is a digital expression of the physical and functional characteristics of engineering project facilities.
  • a complete building information model can connect data, processes, and engineering resources at different stages of the life of a building project, and a complete description of the engineering object, which can be commonly used by construction project owners, designers, constructors, and users.
  • BIM is a unified engineering data source for all stages of construction projects, which fully integrates various data such as text, pictures, and monitoring designed at each stage of engineering design, construction and operation, and supports all parties to query, utilize, update and store in the process of work Information to assist project management and decision-making.
  • construction projects have the characteristics of large volume, long period, and many specialties, a large amount of data is used and generated in each stage of design, construction and operation. At the same time, it also involves various professional fields such as schedule, cost, and quality. Each stage and field often Different software and data formats are adopted, and various stages and fields are interdependent. Therefore, construction project data has typical big data characteristics. In addition, each professional business of construction engineering relies on other professional data. The project management and operation stage involves the comprehensive analysis and utilization of multiple professional data. At the same time, it also requires relevant engineering personnel to master multiple professional knowledge, and put forward proposals for the generation, management and use of engineering data. High demands.
  • BIM includes complete design data for various disciplines such as architecture, structure, electromechanical pipeline, etc., including the three-dimensional shape of each component of the building, as well as the characteristics and parameters of the corresponding shape.
  • existing BIM software all rely on a 3D display engine for 3D model display and interaction.
  • model rendering, baking and other technologies to improve the effect.
  • hardware terminals such as desktop computers, virtual reality helmets, augmented reality and mixed reality glasses to perform the three-dimensional display and visualization of BIM. Therefore, how to efficiently and quickly control the 3D display and visualization effects of BIM is very important.
  • This method is mainly used for BIM software running on desktop computer equipment.
  • the 3D display engine of the BIM software often implements interactive operations such as rotation, zooming, and translation of the BIM 3D display through operations such as mouse clicks, drags, and scroll wheels.
  • the first-person perspective of the three-dimensional scene roaming can also be realized through keyboard commands.
  • the selection, display and hiding of professional components in BIM can be selected by mouse clicking, frame selection of corresponding components, and corresponding functions can be realized based on clicking menu commands or keyboard shortcut commands.
  • This method requires the user to have a good understanding of BIM software and know which interactive method to use to realize the 3D display control of the model.
  • This method is similar to method (1), and is mainly used for devices such as desktop computers, tablet computers, or mobile phones with touch screens.
  • This type of equipment can simulate mouse clicks, scroll wheels and other operations through interactive methods such as finger click, two-finger opening and closing, and dragging, so as to realize the process of selecting, zooming, and panning various professional components in BIM.
  • the display and hiding of the BIM model and the interaction of the three-dimensional scene roaming can also be realized through the touch screen clicking menu commands.
  • This method is mainly used for head-mounted display devices such as virtual reality helmets.
  • Users can interact with the BIM three-dimensional display scene by means of virtual reality handles and other methods.
  • the cursor can be moved in the virtual scene by moving the handle, and the cursor can be clicked by clicking the handle button. Therefore, the corresponding operation can be equivalent to the way (1) Mouse movement, click and other operations, so as to realize the selection of BIM model, the click of menu commands and other functions, and the corresponding interactive control of BIM model selection, display, and hiding can also be realized. .
  • This method is mainly used for devices such as augmented reality eyes, mixed reality eyes, and virtual reality helmets.
  • the user can simulate the mouse click operation in the similar way (1) by operating a virtual point in the air to realize BIM model selection or menu command click, so as to realize the interaction of BIM model display and hiding;
  • the user can wave the palm of the hand ,
  • the arm realizes operations such as translation or rotation of the model, and the zooming interaction of the model can also be realized by opening and closing the hands or pinching the palm;
  • the user can use the change of the body posture to realize the interaction of the BIM three-dimensional display, such as by turning, moving forward and backward The roaming function of the first person perspective in the virtual scene.
  • the model display and hide operation is cumbersome: If you need to show or hide some components in BIM, first, you need to select the components by mouse click/frame selection, finger click, etc., then you need to switch commands by clicking the menu, and finally click
  • the display or hide command realizes the function of model display and hide control. If there are more model components and more software menu commands, this process often requires multiple clicks and menu command switching to complete.
  • BIM often contains multiple professional data, and the typical user scenario usually only needs to view the model of each single professional or area, which means that it is often necessary to select many components, and control the display and hiding of each component.
  • the existing BIM software is mainly design and construction management software. Its interface and functions involve multiple majors, with multiple functions and complex interface layout.
  • the purpose of the present invention is to provide a voice-activated building information model three-dimensional display system and method, which realizes the automatic processing and conversion of user voice input, the generation and execution of three-dimensional display commands, and the automatic update of three-dimensional display effects, which effectively improves the design Teachers, managers, owners and other users interactively control the efficiency of the three-dimensional display of BIM data, which reduces learning costs and improves the user-friendliness of BIM software.
  • the invention discloses a three-dimensional display system for a voice-controlled building information model, which includes: a voice input module, a voice recognition and processing module, a command processing module, and a three-dimensional display module;
  • the voice input module is used to collect user voice and convert the user voice to standard voice
  • the data format is transmitted to the voice recognition and processing module;
  • the voice recognition and processing module converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module;
  • the command processing module performs commands on the received commands Analyze and execute, and transmit the execution result of the command to the 3D display module;
  • the 3D display module updates and outputs the 3D display effect of the building information model in real time according to the execution result of the received command.
  • the voice recognition and processing module includes a three-dimensional display command database, and the three-dimensional display command database stores three-dimensional display commands related to the building information model and the corresponding operation objects of the commands.
  • the speech recognition and processing module includes a building information model metadata database, a metadata model for storing building information model data in the building information model metadata database, and the correspondence between the metadata of the building information model and the three-dimensional display commands in the three-dimensional display command database.
  • Metadata The model includes the data entities and their types and attributes contained in the building information model.
  • the voice recognition and processing module is a general machine learning or deep learning voice recognition model, which realizes the conversion of user voice input to text, and combines the metadata of the building information model and the three-dimensional display command data to process the text generated by the voice input;
  • the latter text is grammatically analyzed, and the generated grammar tree is preliminarily checked to ensure that the user's voice input generates effective three-dimensional display control and interactive commands.
  • the command processing module includes a command parsing module and a command execution module.
  • the command parsing module is based on the output commands of the voice recognition and processing module, and generates building information model data or views and viewpoints based on the relationship between the three-dimensional display commands and their operating objects At the same time, a three-dimensional display command call sequence is generated, and the filter command sequence and the three-dimensional display command call sequence are constructed as the data processing and command call command stream of the building information model, and the command stream is transmitted to the command execution module.
  • command execution module executes the command stream output by the command analysis module, obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module.
  • the present invention also provides a voice-activated building information model three-dimensional display method, which is implemented by any of the above-mentioned voice-activated building information model three-dimensional display systems, including the following steps: S1 acquires voice input in real time, and converts the voice input into a standard voice data format for transmission Go to the next step; S2 converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module; S3 command processing module parses and executes the received commands to generate building information model data Or view, viewpoint filtering command sequence and three-dimensional display command call sequence, construct the data processing and command call command flow of building information model; execute data processing and command call command flow, automatically obtain and query data related to the command flow, and send commands The flow acts on the relevant data and sends the command execution result to the 3D display module; S4 automatically updates the display result of the 3D display module based on the command execution result, realizing dynamic interaction with the user and presentation of the display result.
  • S2 specifically includes the following steps: S2.1 uses algorithm modules such as machine learning or deep learning to realize the automatic conversion of user voice input to text, and generates text input corresponding to the voice input; S2.2 uses statistical models and machine learning models Or deep learning model to segment the text; S2.3 uses natural speech processing models and algorithms to tag the word segmentation results; S2.4 text disambiguation: clean up stop words and unify synonyms into a standard word; S2 .5 Based on the part-of-speech tagging in step S2.3 and the result of text disambiguation in step S2.4, a natural speech processing algorithm is used to construct a grammar tree of the user input text.
  • algorithm modules such as machine learning or deep learning to realize the automatic conversion of user voice input to text, and generates text input corresponding to the voice input
  • S2.2 uses statistical models and machine learning models Or deep learning model to segment the text
  • S2.3 uses natural speech processing models and algorithms to tag the word segmentation results
  • S2 also includes: S2.6 importing the data in the 3D display command database and the building information model metadata database, adjusting the word segmentation result in step S2.2 according to the data in the database, and combining the word segmentation result and the grammar tree construction result with the read database Compare and analyze the data in the middle, check and confirm that all parts of the grammar tree can be converted into 3D display control commands and their operation object information; if they cannot be converted into 3D display control commands, the voice control process will be terminated and the next user voice input will be awaited.
  • step S3 specifically includes the following steps: S3.1 receives the command generated in the voice input module, and imports the data in the three-dimensional display command database and the building information model metadata database; S3.2 identifies and determines which building information model data needs to be extracted Or view, viewpoint; S3.3 generates a data object filtering command sequence corresponding to the user's voice command according to the recognition result in step S3.2; S3.4 recognizes and determines the three-dimensional display command information that needs to be executed; S3.5 according to step S3.
  • the three-dimensional display command recognition result is generated, and the three-dimensional display command call sequence corresponding to the user's voice command is generated;
  • S3.6 sorts the generated data object filtering command sequence and the three-dimensional display command sequence to ensure related commands and execution order, forming data processing and The command invokes the command flow.
  • the present invention has the following advantages due to the above technical scheme:
  • the system and method in this article realize the automatic processing and conversion of user voice input, the generation and execution of three-dimensional display commands, and the automatic update of three-dimensional display effects, which effectively improves designers and management personnel.
  • Owners and other users interactively control the efficiency of the three-dimensional display of BIM data, which reduces learning costs and improves the user-friendliness of BIM software.
  • the system and method in this article can support a variety of BIM three-dimensional display interactive control based on voice recognition, and can use different voice input terminals, different voice recognition algorithm modules, different BIM metadata and three-dimensional display command storage methods, and support different BIM software , 3D display module and virtual reality, augmented reality, and mixed reality environments provide a method for owners, designers, managers, etc. to efficiently and interactively control the BIM 3D display effect.
  • FIG. 1 is a schematic structural diagram of a three-dimensional display system of a voice-controlled building information model in an embodiment of the present invention
  • FIG. 2 is a flowchart of a three-dimensional display method of a voice-controlled building information model in an embodiment of the present invention
  • FIG. 3 is a flowchart of a working method of a speech recognition and processing module in an embodiment of the present invention
  • FIG. 5 is a flowchart of the working method of the command execution module and the three-dimensional display module in an embodiment of the present invention.
  • This embodiment discloses a three-dimensional display system of a voice-controlled building information model, as shown in FIG. 1, which includes:
  • Voice input module is mainly a voice input device such as a microphone of a computer device such as a desktop computer, a tablet computer, and a head-mounted display, but it can also be a voice input device other than a microphone.
  • the voice input module can collect user voice input in real time in different environments, convert the collected voice input into a standard voice data format, and transmit the voice input to the voice recognition and processing module in the standard voice data format.
  • Speech recognition and processing module Based on general machine learning or deep learning speech recognition models, this module converts user voice input into text form, and combines BIM metadata and three-dimensional display command data to process the text generated by voice input . This module will perform grammatical analysis on the processed text and conduct a preliminary check on the generated grammar tree to ensure that the user's voice input generates effective three-dimensional display control and interactive commands. This module can further optimize the results of text processing and grammatical analysis by continuously learning the user's voice habits, so that users can achieve BIM operations without using professional terminology.
  • the BIM metadata and 3D display command data in this module parse and execute the commands received from the voice recognition and processing module, and generate building information model data or views, filtering command sequences of viewpoints, and 3D display Command call sequence, build the data processing and command call command flow of building information model; execute data processing and command call command flow, automatically obtain and query data related to the command flow, and apply the command flow to related data, and send the command execution result To the three-dimensional display module.
  • This module is mainly the display screen of different computer equipment, virtual reality helmet display and other projection and display equipment, etc. This module will be based on the command execution result of the command processing module and the display update or transformation operation on the model. The 3D display effect is updated and output in real time, so as to realize the dynamic interactive 3D display control with the user.
  • the voice recognition and processing module includes a three-dimensional display command database and a BIM metadata database: the three-dimensional display command database is stored in a graph database or an ontology database, and the three-dimensional display commands related to BIM software and the corresponding operation objects of the commands.
  • the operation objects include the views corresponding to the commands. Name, viewpoint name, or component category, etc.
  • the 3D display command database provides support for the voice recognition and processing module to ensure that the user’s voice input can be converted into the corresponding 3D display command or the operating object of the command.
  • it also analyzes and converts the subsequent commands to the 3D display commands of the BIM software. Sequence provides support.
  • the three-dimensional display command database can also adopt other database forms such as relational database, document database, column database and so on.
  • the BIM metadata database mainly adopts the form of graph database or ontology database, which stores the metadata model of the corresponding BIM data of BIM software, that is, the data entities contained in BIM and their types, attributes and other information, as well as the BIM metadata and three-dimensional display command database Correspondence of three-dimensional display commands.
  • the BIM metadata database provides support for speech recognition and processing modules to ensure that the user’s voice input correctly converts BIM data objects and their corresponding three-dimensional display commands.
  • it also lays the foundation for subsequent command analysis and conversion into the three-dimensional display command sequence of BIM software. basis.
  • the BIM metadata database can also adopt other database forms such as relational database, document database, column database and so on.
  • the command processing module includes a command parsing module and a command execution module.
  • the command parsing module is based on the output commands of the voice recognition and processing module, combined with the three-dimensional display command database and the BIM metadata database, according to the three-dimensional display commands and their operation objects
  • the relationship between the building information model data or the view and viewpoint filtering command sequence is generated, and the 3D display command calling sequence is generated at the same time, and the filtering command sequence and the 3D display command calling sequence are constructed as the data processing and command calling command flow of the building information model , And transmit the command stream to the command execution module.
  • the command execution module executes the command stream output by the command analysis module, obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module.
  • this embodiment discloses a three-dimensional display method of a voice-activated building information model, as shown in Fig. 2-5, including the following steps:
  • S1 uses the voice input module to obtain voice input in real time, and convert the voice input into a standard voice data format for transmission to the next step;
  • S2 converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module; uses the three-dimensional display command database and the three-dimensional display commands and BIM metadata stored in the BIM metadata database to input the voice into the module
  • the output standard speech format data is converted into text, and the text segmentation, stop word clarity, and synonym disambiguation are processed, and then the corresponding grammar tree is constructed, and the grammar tree is checked to determine whether a valid command sequence can be generated. If a valid user command sequence cannot be generated, output a prompt to the user and wait for the next user voice input. If a valid user command sequence can be generated, then enter the user command parsing step.
  • the S3 command processing module parses and executes the received commands, generates the building information model data or view, the filtering command sequence of the viewpoint, and the three-dimensional display command call sequence, constructs the data processing and command call command flow of the building information model; performs data processing and Command calls the command stream, automatically obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module;
  • S4 automatically updates the display result of the 3D display module based on the command execution result, realizing dynamic interaction with the user and the presentation of the display result;
  • the S5 command ends and waits for the next user voice input.
  • the purpose of the data input processing and conversion process is to convert the user's voice input into text and perform subsequent processing to ensure that the user input can generate an effective three-dimensional display control command sequence.
  • the main implementation methods of this step are as follows:
  • S2.1 Speech-to-text Using algorithm modules such as machine learning or deep learning to realize the automatic conversion of user's voice input to text, and generate text input corresponding to the voice input;
  • S2.3 Part-of-speech tagging Use natural speech processing models and algorithms to tag the word segmentation results, including verbs, nouns, quantifiers, etc.;
  • step S2.6 Import the data in the three-dimensional display command database and the building information model metadata database, adjust the word segmentation result in step S2.2 according to the data in the database, compare the word segmentation result and the grammar tree construction result with the data in the read database, and check it And confirm that all parts of the grammar tree can be converted into 3D display control commands and their operation object information; if they cannot be converted into 3D display control commands, the voice control process will be terminated and the next user voice input will be awaited.
  • the function of the user command parsing link is to combine the results of text segmentation, part-of-speech tagging and grammar tree output in step S2 to generate corresponding data object extraction commands and three-dimensional display control command sequences in combination with the characteristics of the BIM software.
  • the main implementation method of this step is as follows:
  • S3.1.2 Command operation object recognition According to the results of text segmentation and part-of-speech tagging, combined with the read BIM metadata and three-dimensional display command data, identify and determine which BIM data or view and viewpoint data objects need to be extracted;
  • S3.1.4 Display command recognition According to the results of text segmentation and part-of-speech tagging, combined with three-dimensional display command data, identify and determine the three-dimensional display command information that needs to be executed;
  • S3.1.6 Ordering sequence of commands According to the constructed syntax tree, combined with the dependency of each node of the syntax tree, sort the generated data object filtering command sequence and three-dimensional display command sequence to ensure the corresponding command and order execution.
  • the main function of user command execution and 3D display update is to update the display results of the 3D display module in real time.
  • the main implementation method of this step is as follows:
  • Extract command operation object group and classify the filtered data objects to form the extracted command operation object;
  • the 3D display module refreshes the display content according to the 3D display update command sent in step S3.2.4, and completes the execution of the user's voice command.
  • Voice input module This module uses the microphone of the desktop computer as the voice input module, and directly reads the voice commands input by the user in real time;
  • Speech recognition and processing module This module and method first use the built-in speech recognition module of the Windows system to realize the speech-to-text function, or the deep learning speech-to-text function realized by TensorFlow or pytorch; the subsequent text segmentation uses the python module jieba To achieve, stop words and synonyms are processed by self-editing algorithm, and syntax tree analysis is implemented by Stanford NLP parser of Stanford University;
  • This module uses the graph database neo4j to store three-dimensional display commands.
  • the main storage content includes Revit display, hide components, and open and switch views. It also includes the data object type and attributes corresponding to the corresponding command. And other information; this module can also be implemented using relational databases, column databases, document databases, etc.;
  • the graph database neo 4j is also used to carry out BIM data related data types, attributes and their inheritance, and association relationships in Revit, in order to construct the correspondence between BIM data objects and text generated by user voice input Provide support; the database can also be implemented using relational databases, column databases, document databases, etc.;
  • User command analysis module This module and method adopts C# voice to carry out the secondary development of Revit, and through the corresponding API and voice input module, voice recognition and processing module, three-dimensional display command database, BIM metadata database for mutual call and data Exchange, and finally generate filtering commands and 3D display control command sequences for Revit views, viewpoint elements and component elements;
  • Command execution module The module and method are also implemented in the secondary development of Revit using C# voice, so as to realize the filtering and command execution of the corresponding Revit data, and call the command of Revit 3D display update;
  • Three-dimensional display module This module is directly composed of the display of the desktop computer and the three-dimensional display interface of Revit.
  • the three-dimensional display effect of the display can be automatically refreshed, so as to realize the three-dimensional display based on the user's voice control. Display interactive control and update.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A three-dimensional display system and method for a sound control building information model. The system comprises: a voice input module, a voice recognition and processing module, a command processing module and a three-dimensional display module, wherein the voice input module is used for collecting a user voice and transmitting the user voice to the voice recognition and processing module in a standard voice data format; the voice recognition and processing module converts the user voice into a command that can be recognized by a BIM and transmits the command to the command processing module; the command processing module parses and executes the received command and transmits an execution result of the command to the three-dimensional display module; and according to the execution result of the received command, the three-dimensional display module updates a three-dimensional display effect of the BIM and outputs same in real time. The present invention realizes the automatic processing conversion of a user voice input in a BIM and the generation, execution and automatic update of a three-dimensional display command, and effectively improves the efficiency of a user interactively controlling the three-dimensional display of BIM data.

Description

一种声控建筑信息模型三维显示系统和方法Three-dimensional display system and method of voice-controlled building information model 技术领域Technical field
本发明是关于一种声控建筑信息模型三维显示系统和方法,属于建筑信息技术领域。The invention relates to a three-dimensional display system and method for a voice-controlled building information model, and belongs to the technical field of building information.
背景技术Background technique
建筑信息模型(Building Information Model,BIM)是以三维数字技术为基础,集成了建筑工程项目各种相关信息的工程数据模型,是对工程项目设施实体与功能特性的数字化表达。一个完善的建筑信息模型,能够连接建筑项目生命期不同阶段的数据、过程和工程资源,对工程对象的完整描述,可被建设项目业主、设计方、施工方与使用方等普遍使用。BIM是建设工程各阶段的统一工程数据源,完整集成工程设计、建造及运营各阶段设计的文本、图片、监测等各类数据,并支持各参与方在工作过程中查询、利用、更新和存储信息,辅助工程管理及决策。Building Information Model (BIM) is an engineering data model based on three-dimensional digital technology that integrates various related information of construction projects, and is a digital expression of the physical and functional characteristics of engineering project facilities. A complete building information model can connect data, processes, and engineering resources at different stages of the life of a building project, and a complete description of the engineering object, which can be commonly used by construction project owners, designers, constructors, and users. BIM is a unified engineering data source for all stages of construction projects, which fully integrates various data such as text, pictures, and monitoring designed at each stage of engineering design, construction and operation, and supports all parties to query, utilize, update and store in the process of work Information to assist project management and decision-making.
由于建设工程具有体量庞大、周期长、专业众多等特点,设计、施工及运营各阶段均使用和产生了大量数据,同时也涉及进度、成本、质量等各个专业领域,各阶段、各领域往往采用不同的软件、数据格式,各阶段和各领域相互依赖,因此,建设工程数据具有典型的大数据特征。另外,建设工程各专业业务均依赖其他专业数据,工程管理及运营阶段更涉及多个专业数据的综合分析利用,同时也要求有关工程人员掌握多专业知识,对工程数据的产生、管理及使用提出了很高的要求。Because construction projects have the characteristics of large volume, long period, and many specialties, a large amount of data is used and generated in each stage of design, construction and operation. At the same time, it also involves various professional fields such as schedule, cost, and quality. Each stage and field often Different software and data formats are adopted, and various stages and fields are interdependent. Therefore, construction project data has typical big data characteristics. In addition, each professional business of construction engineering relies on other professional data. The project management and operation stage involves the comprehensive analysis and utilization of multiple professional data. At the same time, it also requires relevant engineering personnel to master multiple professional knowledge, and put forward proposals for the generation, management and use of engineering data. High demands.
BIM包括建筑、结构、机电管线等各专业完整的设计数据,其中既有建筑各组成部分的三维形体,也包括相应形体的特征与参数。为实现项目业主、设计方、施工方及使用方之间直观、快捷的交互,BIM的三维显示或可视化至关重要。现有的BIM软件均依赖三维显示引擎进行三维模型显示与交互,为提升BIM的真实感程度,往往还需要利用模型渲染、烘焙等技术提升效果。与此同时,为结合不同场景需求,往往需要利用桌面电脑、虚拟现实头盔、增强现实及混合现实眼镜等不同硬件终端进行BIM的三维显示和可视化。因此,如何高效、快捷的控制BIM的三维显示与可视化效果至关重要。BIM includes complete design data for various disciplines such as architecture, structure, electromechanical pipeline, etc., including the three-dimensional shape of each component of the building, as well as the characteristics and parameters of the corresponding shape. In order to achieve intuitive and quick interaction between project owners, designers, construction parties and users, the three-dimensional display or visualization of BIM is essential. Existing BIM software all rely on a 3D display engine for 3D model display and interaction. In order to enhance the realism of BIM, it is often necessary to use model rendering, baking and other technologies to improve the effect. At the same time, in order to combine the needs of different scenarios, it is often necessary to use different hardware terminals such as desktop computers, virtual reality helmets, augmented reality and mixed reality glasses to perform the three-dimensional display and visualization of BIM. Therefore, how to efficiently and quickly control the 3D display and visualization effects of BIM is very important.
当前,在桌面电脑、头戴显示器、平板电脑、增强现实及混合现实眼镜等设备中采用的BIM三维显示与可视化控制与交互方式主要有以下四种:Currently, there are mainly four BIM three-dimensional display and visualization control and interaction methods used in desktop computers, head-mounted displays, tablet computers, augmented reality and mixed reality glasses and other devices:
(1)基于鼠标、键盘的三维显示交互控制方式(1) Three-dimensional display interactive control method based on mouse and keyboard
该方式主要用于桌面电脑设备上运行的BIM软件。该BIM软件的三维显示引擎往往通过鼠标点击、拖动与滚轮等操作实现BIM三维显示的旋转、缩放及平移等交互 操作。同时,也可通过键盘命令的方式实现第一人称视角的三维场景漫游。此外,BIM中各专业构件的选择、显示与隐藏可通过鼠标点选、框选相应构件,并基于点击菜单命令或键盘快捷键命令实现相应功能。该方式要求使用人员对BIM软件具有较好了解,并知道采用何种交互方式实现模型的三维显示控制。This method is mainly used for BIM software running on desktop computer equipment. The 3D display engine of the BIM software often implements interactive operations such as rotation, zooming, and translation of the BIM 3D display through operations such as mouse clicks, drags, and scroll wheels. At the same time, the first-person perspective of the three-dimensional scene roaming can also be realized through keyboard commands. In addition, the selection, display and hiding of professional components in BIM can be selected by mouse clicking, frame selection of corresponding components, and corresponding functions can be realized based on clicking menu commands or keyboard shortcut commands. This method requires the user to have a good understanding of BIM software and know which interactive method to use to realize the 3D display control of the model.
(2)基于触摸屏的三维显示交互控制方式(2) Three-dimensional display interactive control method based on touch screen
该方式与方式(1)类似,主要用于具有触摸屏的桌面电脑、平板电脑或手机等设备。此类设备可通过手指单击、双指开合、拖动等交互方式模拟鼠标单击、滚轮等操作,实现BIM中各专业构件的选择、缩放、平移等过程。与此同时,也可通过触摸屏点击菜单命令的形式实现BIM模型的显示、隐藏以及三维场景漫游交互。This method is similar to method (1), and is mainly used for devices such as desktop computers, tablet computers, or mobile phones with touch screens. This type of equipment can simulate mouse clicks, scroll wheels and other operations through interactive methods such as finger click, two-finger opening and closing, and dragging, so as to realize the process of selecting, zooming, and panning various professional components in BIM. At the same time, the display and hiding of the BIM model and the interaction of the three-dimensional scene roaming can also be realized through the touch screen clicking menu commands.
(3)基于手柄或专用输入设备的三维显示交互控制方式(3) Three-dimensional display interactive control method based on handle or dedicated input device
该方式主要用于虚拟现实头盔等头戴显示设备。用户可利用虚拟现实手柄等方式与BIM三维显示场景进行交互。如通过移动手柄实现虚拟场景光标的移动、通过点击手柄按键实现光标点击等功能。从而,可将相应操作等价为方式(1)鼠标移动、点击等操作,从而实现BIM模型的选择、菜单命令的点击等功能,也相应的可以实现BIM模型的选择、显示、隐藏等交互控制。This method is mainly used for head-mounted display devices such as virtual reality helmets. Users can interact with the BIM three-dimensional display scene by means of virtual reality handles and other methods. For example, the cursor can be moved in the virtual scene by moving the handle, and the cursor can be clicked by clicking the handle button. Therefore, the corresponding operation can be equivalent to the way (1) Mouse movement, click and other operations, so as to realize the selection of BIM model, the click of menu commands and other functions, and the corresponding interactive control of BIM model selection, display, and hiding can also be realized. .
(4)基于手势和身体姿态的三维显示交互控制方式(4) Three-dimensional display interactive control method based on gestures and body posture
该方式主要用于增强现实眼睛、混合现实眼睛以及虚拟现实头盔等设备。首先,用户可以通过在空中虚点等操作模拟类似方式(1)中的鼠标点击操作,实现BIM模型选择或菜单命令点击,从而实现BIM模型的显示、隐藏等交互;其次,用户可以通过挥动手掌、手臂实现模型平移或旋转等操作,还可通过双手开合或手掌捏合实现模型的缩放交互;最后,用户还可利用身体姿态的变化实现BIM三维显示的交互,如通过转身、前后移动等实现虚拟场景中第一人称视角的漫游功能。This method is mainly used for devices such as augmented reality eyes, mixed reality eyes, and virtual reality helmets. First, the user can simulate the mouse click operation in the similar way (1) by operating a virtual point in the air to realize BIM model selection or menu command click, so as to realize the interaction of BIM model display and hiding; second, the user can wave the palm of the hand , The arm realizes operations such as translation or rotation of the model, and the zooming interaction of the model can also be realized by opening and closing the hands or pinching the palm; finally, the user can use the change of the body posture to realize the interaction of the BIM three-dimensional display, such as by turning, moving forward and backward The roaming function of the first person perspective in the virtual scene.
随着BIM技术的不断普及,基于BIM的建设工程数据规模日益庞大,BIM的用户也迅猛增长,如何为用户提供更直观、便捷的三维显示交互控制,对用户高效利用BIM数据,基于BIM实现多方协作非常重要。然而,上述现有技术中显示方法仍存在交互方式不友好、命令或操作复杂、菜单选项繁多等问题,具体分析如下:With the continuous popularization of BIM technology, the scale of BIM-based construction engineering data has become increasingly large, and the number of BIM users has also grown rapidly. How to provide users with more intuitive and convenient three-dimensional display interactive control, efficient use of BIM data for users, and multi-party implementation based on BIM Collaboration is very important. However, the display method in the prior art described above still has problems such as unfriendly interaction, complicated commands or operations, and numerous menu options. The specific analysis is as follows:
(1)模型显示、隐藏操作繁琐:如需要显示或隐藏BIM中的部分构件,首先,需要通过鼠标点击/框选、手指点击等方式选中构件,接着还需要通过点击菜单切换命令,并最终点击显示、或隐藏命令实现模型显示、隐藏控制的功能。如果选择的模型构件较多,以及软件菜单命令较多,那么这个过程往往需要多次点击和菜单命令切换才能完成。然而,BIM往往包含多个专业数据,而典型用户场景通常只需要查看各个单专 业或区域的模型,这就意味着常常需要选择很多构件,并对各个构件进行显示、隐藏控制。同时,既有BIM软件主要以设计及施工管理软件为主,其界面与功能均涉及多个专业,功能多、界面布局复杂,有关工程人员需要较长时间才能找到或点击有关菜单命令。因此,对BIM软件用户来说,其模型显示、隐藏控制操作往往比较繁琐,而这个功能有时频繁使用的,因此会占用大量用户时间,严重影响BIM数据的查看与使用效率,制约各参与方协同工作。手势或身体姿态的交互方式往往并不适用于模型的显示、隐藏控制。(1) The model display and hide operation is cumbersome: If you need to show or hide some components in BIM, first, you need to select the components by mouse click/frame selection, finger click, etc., then you need to switch commands by clicking the menu, and finally click The display or hide command realizes the function of model display and hide control. If there are more model components and more software menu commands, this process often requires multiple clicks and menu command switching to complete. However, BIM often contains multiple professional data, and the typical user scenario usually only needs to view the model of each single professional or area, which means that it is often necessary to select many components, and control the display and hiding of each component. At the same time, the existing BIM software is mainly design and construction management software. Its interface and functions involve multiple majors, with multiple functions and complex interface layout. It takes a long time for relevant engineers to find or click on relevant menu commands. Therefore, for BIM software users, the model display and hide control operations are often cumbersome, and this function is sometimes used frequently, so it will take up a lot of user time, seriously affecting the efficiency of viewing and using BIM data, and restricting the collaboration of all participants. jobs. The interactive mode of gesture or body posture is often not suitable for the display and hide control of the model.
(2)视图及视点切换操作复杂:在BIM三维漫游及模型查看的过程中,用户往往会创建或记录不同的模型查看视角和视点,也会创建东立面、南立面、某层平面等不同的视图。但在进行视图切换时,用户往往需要通过点击菜单命令打开相应的视图、视点列表,然后再选择并切换到相应的视图或视点。当视点或视图很多时,用户将耗费大量时间进行数据的查找和搜索。因此,综合考虑菜单命令寻找、点击以及视图、视点查找的过程,也会占用用户较多时间,影响用户的工作与协同效率。采用手势交互的方式则与点击方式类似,而身体姿态则基本不适用于视图、视点切换。(2) View and viewpoint switching operations are complicated: In the process of BIM 3D roaming and model viewing, users often create or record different model viewing angles and viewpoints, and also create east facade, south facade, and a certain level of planes, etc. Different views. However, when switching views, users often need to open the corresponding view or viewpoint list by clicking a menu command, and then select and switch to the corresponding view or viewpoint. When there are many viewpoints or views, users will spend a lot of time searching and searching for data. Therefore, comprehensive consideration of the process of searching and clicking menu commands and searching for views and viewpoints will also take up a lot of users' time and affect the efficiency of users' work and collaboration. The way of using gesture interaction is similar to the way of clicking, and the body posture is basically not suitable for switching between views and viewpoints.
(3)高度依赖手掌及手臂运动:由以上两条可知,当前有关BIM软件和设备提供的交互方式高度依赖鼠标点击、键盘命令或手势等交互方式,这些交互方式既需要反复运动手掌或手臂,用户非常容易疲劳。因此,已有的三维显示交互控制模式难以适应长时间BIM数据查看、使用等工作场景,会给用户身体健康带来较大的不利影响。(3) Highly dependent on palm and arm movement: From the above two, it can be seen that the current interactive methods provided by relevant BIM software and equipment are highly dependent on interactive methods such as mouse clicks, keyboard commands or gestures. These interactive methods require repeated movement of the palm or arm. Users are very prone to fatigue. Therefore, the existing three-dimensional display interactive control mode is difficult to adapt to working scenarios such as long-term BIM data viewing and use, and it will bring greater adverse effects on the health of users.
发明内容Summary of the invention
针对上述问题,本发明的目的是提供一种声控建筑信息模型三维显示系统和方法,其实现了用户语音输入的自动处理转换以及三维显示命令生成、执行与三维显示效果自动更新,有效提高了设计师、管理人员、业主等用户交互式控制BIM数据三维显示的效率,降低了学习成本,并提升了BIM软件使用友好度。In view of the above problems, the purpose of the present invention is to provide a voice-activated building information model three-dimensional display system and method, which realizes the automatic processing and conversion of user voice input, the generation and execution of three-dimensional display commands, and the automatic update of three-dimensional display effects, which effectively improves the design Teachers, managers, owners and other users interactively control the efficiency of the three-dimensional display of BIM data, which reduces learning costs and improves the user-friendliness of BIM software.
本发明公开了一种声控建筑信息模型三维显示系统,包括:语音输入模块、语音识别及处理模块、命令处理模块和三维显示模块;语音输入模块用于采集用户语音,并将用户语音以标准语音数据格式传输给语音识别及处理模块;语音识别及处理模块将语音输入模块采集的用户语音转换成建筑信息模型能够识别的命令,并将命令传输至命令处理模块;命令处理模块对接收的命令进行解析和执行,并将命令的执行结果传输至三维显示模块;三维显示模块根据接收的命令的执行结果,对建筑信息模型的三维显示效果进行更新与实时输出。The invention discloses a three-dimensional display system for a voice-controlled building information model, which includes: a voice input module, a voice recognition and processing module, a command processing module, and a three-dimensional display module; the voice input module is used to collect user voice and convert the user voice to standard voice The data format is transmitted to the voice recognition and processing module; the voice recognition and processing module converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module; the command processing module performs commands on the received commands Analyze and execute, and transmit the execution result of the command to the 3D display module; the 3D display module updates and outputs the 3D display effect of the building information model in real time according to the execution result of the received command.
进一步,语音识别及处理模块包括三维显示命令数据库,三维显示命令数据库存 储有关建筑信息模型的三维显示命令及命令相应的操作对象。Further, the voice recognition and processing module includes a three-dimensional display command database, and the three-dimensional display command database stores three-dimensional display commands related to the building information model and the corresponding operation objects of the commands.
进一步,语音识别及处理模块包括建筑信息模型元数据库,建筑信息模型元数据库存储建筑信息模型数据的元数据模型,以及建筑信息模型元数据与三维显示命令数据库中三维显示命令的对应关系,元数据模型包括建筑信息模型中包含的数据实体及其类型、属性。Further, the speech recognition and processing module includes a building information model metadata database, a metadata model for storing building information model data in the building information model metadata database, and the correspondence between the metadata of the building information model and the three-dimensional display commands in the three-dimensional display command database. Metadata The model includes the data entities and their types and attributes contained in the building information model.
进一步,语音识别及处理模块通用机器学习或深度学习的语音识别模型,实现用户语音输入向文字的转换,并结合建筑信息模型元数据及三维显示命令数据对语音输入生成的文本进行处理;将处理后的文本进行语法分析,并对生成的语法树进行初步检核,确保用户语音输入生成有效的三维显示控制及交互命令。Furthermore, the voice recognition and processing module is a general machine learning or deep learning voice recognition model, which realizes the conversion of user voice input to text, and combines the metadata of the building information model and the three-dimensional display command data to process the text generated by the voice input; The latter text is grammatically analyzed, and the generated grammar tree is preliminarily checked to ensure that the user's voice input generates effective three-dimensional display control and interactive commands.
进一步,命令处理模块包括命令解析模块和命令执行模块,命令解析模块以语音识别及处理模块的输出命令为基础,根据三维显示命令及其操作对象的相互关系,生成建筑信息模型数据或视图、视点的过滤命令序列,同时生成三维显示命令调用序列,并将过滤命令序列和三维显示命令调用序列构建为建筑信息模型的数据处理及命令调用命令流,并将命令流传输至命令执行模块。Further, the command processing module includes a command parsing module and a command execution module. The command parsing module is based on the output commands of the voice recognition and processing module, and generates building information model data or views and viewpoints based on the relationship between the three-dimensional display commands and their operating objects At the same time, a three-dimensional display command call sequence is generated, and the filter command sequence and the three-dimensional display command call sequence are constructed as the data processing and command call command stream of the building information model, and the command stream is transmitted to the command execution module.
进一步,命令执行模块将执行命令解析模块输出的命令流,获取和查询与命令流相关数据,并将命令流作用于相关数据,将命令执行结果发送至三维显示模块。Further, the command execution module executes the command stream output by the command analysis module, obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module.
本发明还提供了一种声控建筑信息模型三维显示方法,通过上述任一种声控建筑信息模型三维显示系统实现,包括如下步骤:S1实时获取语音输入,并将语音输入转换为标准语音数据格式传输到下一步;S2将语音输入模块采集的用户语音转换成建筑信息模型能够识别的命令,并将命令传输至命令处理模块;S3命令处理模块对接收的命令进行解析和执行,生成建筑信息模型数据或视图、视点的过滤命令序列以及三维显示命令调用序列,构建建筑信息模型的数据处理及命令调用命令流;执行数据处理及命令调用命令流,自动获取和查询与命令流相关数据,并将命令流作用于相关数据,将命令执行结果发送至三维显示模块;S4基于命令执行结果,自动更新三维显示模块的显示结果,实现与用户的动态交互与显示结果呈现。The present invention also provides a voice-activated building information model three-dimensional display method, which is implemented by any of the above-mentioned voice-activated building information model three-dimensional display systems, including the following steps: S1 acquires voice input in real time, and converts the voice input into a standard voice data format for transmission Go to the next step; S2 converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module; S3 command processing module parses and executes the received commands to generate building information model data Or view, viewpoint filtering command sequence and three-dimensional display command call sequence, construct the data processing and command call command flow of building information model; execute data processing and command call command flow, automatically obtain and query data related to the command flow, and send commands The flow acts on the relevant data and sends the command execution result to the 3D display module; S4 automatically updates the display result of the 3D display module based on the command execution result, realizing dynamic interaction with the user and presentation of the display result.
进一步,其中S2具体包括以下步骤:S2.1采用机器学习或深度学习等算法模块实现用户语音输入向文本的自动转换,生成语音输入相对应的文本输入;S2.2采用统计模型、机器学习模型或深度学习模型对文本进行分词;S2.3采用自然语音处理模型和算法对分词结果进行词性标注;S2.4文本消歧:对停用词进行清理,并将同义词统一为一个标准词;S2.5基于步骤S2.3中词性标注与步骤S2.4中文本消歧结果,采用自然语音处理算法构建用户输入文本的语法树。Further, S2 specifically includes the following steps: S2.1 uses algorithm modules such as machine learning or deep learning to realize the automatic conversion of user voice input to text, and generates text input corresponding to the voice input; S2.2 uses statistical models and machine learning models Or deep learning model to segment the text; S2.3 uses natural speech processing models and algorithms to tag the word segmentation results; S2.4 text disambiguation: clean up stop words and unify synonyms into a standard word; S2 .5 Based on the part-of-speech tagging in step S2.3 and the result of text disambiguation in step S2.4, a natural speech processing algorithm is used to construct a grammar tree of the user input text.
进一步,其中S2还包括:S2.6导入三维显示命令数据库和建筑信息模型元数据库中数据,根据数据库中数据调整步骤S2.2中分词结果,将分词结果及语法树构建结果与读入的数据库中数据进行对比分析,核对并确认语法树各部分均可转换为三维显示控制命令及其操作对象信息;如不能转换为三维显示控制命令则终止语音控制流程,等待下一次用户语音输入。Furthermore, S2 also includes: S2.6 importing the data in the 3D display command database and the building information model metadata database, adjusting the word segmentation result in step S2.2 according to the data in the database, and combining the word segmentation result and the grammar tree construction result with the read database Compare and analyze the data in the middle, check and confirm that all parts of the grammar tree can be converted into 3D display control commands and their operation object information; if they cannot be converted into 3D display control commands, the voice control process will be terminated and the next user voice input will be awaited.
进一步,其中步骤S3具体包括以下步骤:S3.1接收语音输入模块中产生的命令,并导入三维显示命令数据库和建筑信息模型元数据库中数据;S3.2识别并确定需要提取哪些建筑信息模型数据或视图、视点;S3.3根据步骤S3.2中识别结果,生成用户语音命令对应的数据对象过滤命令序列;S3.4识别并确定需要执行的三维显示命令信息;S3.5根据步骤S3.4中三维显示命令识别结果,生成用户语音命令对应的三维显示命令调用序列;S3.6对生成的数据对象过滤命令序列和三维显示命令序列进行排序,确保相关命令和执行顺序,形成数据处理及命令调用命令流。Further, step S3 specifically includes the following steps: S3.1 receives the command generated in the voice input module, and imports the data in the three-dimensional display command database and the building information model metadata database; S3.2 identifies and determines which building information model data needs to be extracted Or view, viewpoint; S3.3 generates a data object filtering command sequence corresponding to the user's voice command according to the recognition result in step S3.2; S3.4 recognizes and determines the three-dimensional display command information that needs to be executed; S3.5 according to step S3. 4, the three-dimensional display command recognition result is generated, and the three-dimensional display command call sequence corresponding to the user's voice command is generated; S3.6 sorts the generated data object filtering command sequence and the three-dimensional display command sequence to ensure related commands and execution order, forming data processing and The command invokes the command flow.
本发明由于采取以上技术方案,其具有以下优点:本文中的系统和方法实现了用户语音输入的自动处理转换以及三维显示命令生成、执行与三维显示效果自动更新,有效提高了设计师、管理人员、业主等用户交互式控制BIM数据三维显示的效率,降低了学习成本,并提升了BIM软件使用友好度。本文中的系统和方法能够支持多种基于语音识别的BIM三维显示交互式控制,可采用不同语音输入终端、不同语音识别算法模块、不同BIM元数据及三维显示命令存储方式,并支持不同BIM软件、三维显示模块以及虚拟现实、增强现实、混合现实环境,为业主、设计师、管理人员等高效、交互式的控制BIM三维显示效果提供了方法。The present invention has the following advantages due to the above technical scheme: The system and method in this article realize the automatic processing and conversion of user voice input, the generation and execution of three-dimensional display commands, and the automatic update of three-dimensional display effects, which effectively improves designers and management personnel. , Owners and other users interactively control the efficiency of the three-dimensional display of BIM data, which reduces learning costs and improves the user-friendliness of BIM software. The system and method in this article can support a variety of BIM three-dimensional display interactive control based on voice recognition, and can use different voice input terminals, different voice recognition algorithm modules, different BIM metadata and three-dimensional display command storage methods, and support different BIM software , 3D display module and virtual reality, augmented reality, and mixed reality environments provide a method for owners, designers, managers, etc. to efficiently and interactively control the BIM 3D display effect.
附图说明Description of the drawings
图1为本发明一实施例中声控建筑信息模型三维显示系统的结构示意图;FIG. 1 is a schematic structural diagram of a three-dimensional display system of a voice-controlled building information model in an embodiment of the present invention;
图2为本发明一实施例中声控建筑信息模型三维显示方法的流程图;FIG. 2 is a flowchart of a three-dimensional display method of a voice-controlled building information model in an embodiment of the present invention;
图3为本发明一实施例中语音识别及处理模块的工作方法流程图;3 is a flowchart of a working method of a speech recognition and processing module in an embodiment of the present invention;
图4为本发明一实施例中命令解析模块的工作方法流程图;4 is a flowchart of a working method of a command parsing module in an embodiment of the present invention;
图5为本发明一实施例中命令执行模块和三维显示模块的工作方法流程图。FIG. 5 is a flowchart of the working method of the command execution module and the three-dimensional display module in an embodiment of the present invention.
具体实施方式Detailed ways
以下结合附图来对本发明进行详细的描绘。然而应当理解,附图的提供仅为了更好地理解本发明,它们不应该理解成对本发明的限制。在本发明的描述中,需要理解的是,术语仅仅是用于描述的目的,而不能理解为指示或暗示相对重要性。Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. However, it should be understood that the drawings are only provided for a better understanding of the present invention, and they should not be construed as limiting the present invention. In the description of the present invention, it should be understood that the terms are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance.
实施例一Example one
本实施例公开了一种声控建筑信息模型三维显示系统,如图1所示,其包括:This embodiment discloses a three-dimensional display system of a voice-controlled building information model, as shown in FIG. 1, which includes:
(1)语音输入模块:该模块主要是桌面电脑、平板电脑、头戴显示器等计算机设备的麦克风等语音输入设备,但也可以是除了麦克风以外的语音输入设备。该语音输入模块可以在不同环境下对用户语音输入进行实时采集,将采集到的语音输入转换为标准语音数据格式,并以标准语音数据格式将语音输入传输给语音识别及处理模块。(1) Voice input module: This module is mainly a voice input device such as a microphone of a computer device such as a desktop computer, a tablet computer, and a head-mounted display, but it can also be a voice input device other than a microphone. The voice input module can collect user voice input in real time in different environments, convert the collected voice input into a standard voice data format, and transmit the voice input to the voice recognition and processing module in the standard voice data format.
(2)语音识别及处理模块:该模块基于通用机器学习或深度学习的语音识别模型,将用户语音输入转换为文本形式,并结合BIM元数据及三维显示命令数据对语音输入生成的文本进行处理。本模块将对处理后的文本进行语法分析,并对生成的语法树进行初步检核,确保用户语音输入生成有效的三维显示控制及交互命令。该模块还可以通过不断学习用户语音习惯进一步优化文本处理和语法分析的结果,使用户无需使用专业术语可以实现BIM操作。(2) Speech recognition and processing module: Based on general machine learning or deep learning speech recognition models, this module converts user voice input into text form, and combines BIM metadata and three-dimensional display command data to process the text generated by voice input . This module will perform grammatical analysis on the processed text and conduct a preliminary check on the generated grammar tree to ensure that the user's voice input generates effective three-dimensional display control and interactive commands. This module can further optimize the results of text processing and grammatical analysis by continuously learning the user's voice habits, so that users can achieve BIM operations without using professional terminology.
(3)命令处理模块:该模块中BIM元数据及三维显示命令数据对接收到的语音识别及处理模块的命令进行解析和执行,生成建筑信息模型数据或视图、视点的过滤命令序列以及三维显示命令调用序列,构建建筑信息模型的数据处理及命令调用命令流;执行数据处理及命令调用命令流,自动获取和查询与命令流相关数据,并将命令流作用于相关数据,将命令执行结果发送至三维显示模块。(3) Command processing module: The BIM metadata and 3D display command data in this module parse and execute the commands received from the voice recognition and processing module, and generate building information model data or views, filtering command sequences of viewpoints, and 3D display Command call sequence, build the data processing and command call command flow of building information model; execute data processing and command call command flow, automatically obtain and query data related to the command flow, and apply the command flow to related data, and send the command execution result To the three-dimensional display module.
(4)三维显示模块:该模块主要是不同计算机设备的显示屏、虚拟现实头盔显示器以及其他投影、显示设备等,该模块将基于命令处理模块的命令执行结果以及显示更新或变换操作对模型的三维显示效果进行更新与实时输出,从而实现与用户的动态交互式三维显示控制。(4) Three-dimensional display module: This module is mainly the display screen of different computer equipment, virtual reality helmet display and other projection and display equipment, etc. This module will be based on the command execution result of the command processing module and the display update or transformation operation on the model. The 3D display effect is updated and output in real time, so as to realize the dynamic interactive 3D display control with the user.
其中,语音识别及处理模块包括三维显示命令数据库和BIM元数据库:三维显示命令数据库采用图数据库或本体库存储,有关BIM软件的三维显示命令及命令相应的操作对象,操作对象包括命令对应的视图名称、视点名称或构件类别等。三维显示命令数据库一方面为语音识别及处理模块提供支持,确保用户语音输入能够转换为相应的三维显示命令或命令的操作对象,另一方面也为后续命令解析和转换为BIM软件的三维显示命令序列提供支持。三维显示命令数据库也可采用关系型数据库、文档数据库、列数据库等其他数据库形式。BIM元数据库主要采用图数据库或本体库的形式,存储BIM软件相应BIM数据的元数据模型,也就是BIM中包含的数据实体及其类型、属性等信息,以及BIM元数据与三维显示命令数据库中三维显示命令的对应关系。BIM元数据库一方面为语音识别及处理模块提供支持,确保用户语音输入正确转换BIM数据对象及其对应的三维显示命令,另一方面也为后续命令解析和转换为BIM软件的三 维显示命令序列奠定基础。BIM元数据库也可采用关系型数据库、文档数据库、列数据库等其他数据库形式。Among them, the voice recognition and processing module includes a three-dimensional display command database and a BIM metadata database: the three-dimensional display command database is stored in a graph database or an ontology database, and the three-dimensional display commands related to BIM software and the corresponding operation objects of the commands. The operation objects include the views corresponding to the commands. Name, viewpoint name, or component category, etc. On the one hand, the 3D display command database provides support for the voice recognition and processing module to ensure that the user’s voice input can be converted into the corresponding 3D display command or the operating object of the command. On the other hand, it also analyzes and converts the subsequent commands to the 3D display commands of the BIM software. Sequence provides support. The three-dimensional display command database can also adopt other database forms such as relational database, document database, column database and so on. The BIM metadata database mainly adopts the form of graph database or ontology database, which stores the metadata model of the corresponding BIM data of BIM software, that is, the data entities contained in BIM and their types, attributes and other information, as well as the BIM metadata and three-dimensional display command database Correspondence of three-dimensional display commands. On the one hand, the BIM metadata database provides support for speech recognition and processing modules to ensure that the user’s voice input correctly converts BIM data objects and their corresponding three-dimensional display commands. On the other hand, it also lays the foundation for subsequent command analysis and conversion into the three-dimensional display command sequence of BIM software. basis. The BIM metadata database can also adopt other database forms such as relational database, document database, column database and so on.
本实施例中,命令处理模块包括命令解析模块和命令执行模块,命令解析模块以语音识别及处理模块的输出命令为基础,结合三维显示命令数据库和BIM元数据库,根据三维显示命令及其操作对象的相互关系,生成建筑信息模型数据或视图、视点的过滤命令序列,同时生成三维显示命令调用序列,并将过滤命令序列和三维显示命令调用序列构建为建筑信息模型的数据处理及命令调用命令流,并将命令流传输至命令执行模块。命令执行模块将执行命令解析模块输出的命令流,获取和查询与命令流相关数据,并将命令流作用于相关数据,将命令执行结果发送至三维显示模块。In this embodiment, the command processing module includes a command parsing module and a command execution module. The command parsing module is based on the output commands of the voice recognition and processing module, combined with the three-dimensional display command database and the BIM metadata database, according to the three-dimensional display commands and their operation objects The relationship between the building information model data or the view and viewpoint filtering command sequence is generated, and the 3D display command calling sequence is generated at the same time, and the filtering command sequence and the 3D display command calling sequence are constructed as the data processing and command calling command flow of the building information model , And transmit the command stream to the command execution module. The command execution module executes the command stream output by the command analysis module, obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module.
实施例二Example two
基于同一发明构思,本实施例公开了一种声控建筑信息模型三维显示方法,如图2-5所示,包括如下步骤:Based on the same inventive concept, this embodiment discloses a three-dimensional display method of a voice-activated building information model, as shown in Fig. 2-5, including the following steps:
S1利用语音输入模块,实时获取语音输入,并将语音输入转换为标准语音数据格式传输到下一步;S1 uses the voice input module to obtain voice input in real time, and convert the voice input into a standard voice data format for transmission to the next step;
S2将语音输入模块采集的用户语音转换成建筑信息模型能够识别的命令,并将命令传输至命令处理模块;利用三维显示命令数据库和BIM元数据库存储的三维显示命令及BIM元数据将语音输入模块输出的标准语音格式数据转换为文本,并进行文本分词、停用词清晰以及同义词消歧等处理,然后构建相应语法树,并对语法树进行校验,以确定是否可以生成有效的命令序列。如果不能生成有效的用户命令序列,则输出给用户的提示,并等待下一次用户语音输入。如果可以生成有效的用户命令序列,则进入用户命令解析步骤。S2 converts the user's voice collected by the voice input module into commands that can be recognized by the building information model, and transmits the commands to the command processing module; uses the three-dimensional display command database and the three-dimensional display commands and BIM metadata stored in the BIM metadata database to input the voice into the module The output standard speech format data is converted into text, and the text segmentation, stop word clarity, and synonym disambiguation are processed, and then the corresponding grammar tree is constructed, and the grammar tree is checked to determine whether a valid command sequence can be generated. If a valid user command sequence cannot be generated, output a prompt to the user and wait for the next user voice input. If a valid user command sequence can be generated, then enter the user command parsing step.
S3命令处理模块对接收的命令进行解析和执行,生成建筑信息模型数据或视图、视点的过滤命令序列以及三维显示命令调用序列,构建建筑信息模型的数据处理及命令调用命令流;执行数据处理及命令调用命令流,自动获取和查询与命令流相关数据,并将命令流作用于相关数据,将命令执行结果发送至三维显示模块;The S3 command processing module parses and executes the received commands, generates the building information model data or view, the filtering command sequence of the viewpoint, and the three-dimensional display command call sequence, constructs the data processing and command call command flow of the building information model; performs data processing and Command calls the command stream, automatically obtains and queries data related to the command stream, applies the command stream to the relevant data, and sends the command execution result to the three-dimensional display module;
S4基于命令执行结果,自动更新三维显示模块的显示结果,实现与用户的动态交互与显示结果呈现;S4 automatically updates the display result of the 3D display module based on the command execution result, realizing dynamic interaction with the user and the presentation of the display result;
S5命令结束,并等待下一次用户语音输入。The S5 command ends and waits for the next user voice input.
数据输入处理与转换过程的目的是将用户语音输入转换为文本并进行后续处理,确保用户输入能够生成有效的三维显示控制命令序列。如图3所示,该步骤主要实现方法如下:The purpose of the data input processing and conversion process is to convert the user's voice input into text and perform subsequent processing to ensure that the user input can generate an effective three-dimensional display control command sequence. As shown in Figure 3, the main implementation methods of this step are as follows:
S2.1语音转文本:采用机器学习或深度学习等算法模块实现用户语音输入向文本的自动转换,生成语音输入相对应的文本输入;S2.1 Speech-to-text: Using algorithm modules such as machine learning or deep learning to realize the automatic conversion of user's voice input to text, and generate text input corresponding to the voice input;
S2.2文本分词:采用统计模型、机器学习模型或深度学习模型对文本进行分词;S2.2 Text segmentation: Use statistical models, machine learning models or deep learning models to segment texts;
S2.3词性标注:采用自然语音处理模型和算法对分词结果进行词性标注,具体包括动词、名词、数量词等;S2.3 Part-of-speech tagging: Use natural speech processing models and algorithms to tag the word segmentation results, including verbs, nouns, quantifiers, etc.;
S2.4文本消歧:基于停用词词典、同义词词典以及其他自然语音处理方法对停用词进行清理,并将同义词统一为标准形式,为后续分析奠定基础;S2.4 Text disambiguation: clean up stop words based on stop word dictionaries, synonym dictionaries and other natural speech processing methods, and unify synonyms into standard forms, laying the foundation for subsequent analysis;
S2.5语法树构建:基于步骤S2.3中词性标注与步骤S2.4中文本消歧结果,采用自然语音处理算法构建用户输入文本的语法树;S2.5 grammar tree construction: based on the results of part-of-speech tagging in step S2.3 and text disambiguation in step S2.4, a natural speech processing algorithm is used to construct a grammar tree of user input text;
S2.6导入三维显示命令数据库和建筑信息模型元数据库中数据,根据数据库中数据调整步骤S2.2中分词结果,将分词结果及语法树构建结果与读入的数据库中数据进行对比分析,核对并确认语法树各部分均可转换为三维显示控制命令及其操作对象信息;如不能转换为三维显示控制命令则终止语音控制流程,等待下一次用户语音输入。S2.6 Import the data in the three-dimensional display command database and the building information model metadata database, adjust the word segmentation result in step S2.2 according to the data in the database, compare the word segmentation result and the grammar tree construction result with the data in the read database, and check it And confirm that all parts of the grammar tree can be converted into 3D display control commands and their operation object information; if they cannot be converted into 3D display control commands, the voice control process will be terminated and the next user voice input will be awaited.
用户命令解析环节的作用是将步骤S2中输出的文本分词、词性标注及语法树结果,结合BIM软件特点生成相应的数据对象提取命令及三维显示控制命令序列。如图4所示,该步骤主要实现方法如下:The function of the user command parsing link is to combine the results of text segmentation, part-of-speech tagging and grammar tree output in step S2 to generate corresponding data object extraction commands and three-dimensional display control command sequences in combination with the characteristics of the BIM software. As shown in Figure 4, the main implementation method of this step is as follows:
S3.1.1支撑数据读入:读入三维显示命令数据库及BIM元数据库的数据,作为后续用户命令解析过程的支撑数据;S3.1.1 Support data reading: Read in the data of the three-dimensional display command database and the BIM metadata database as supporting data for the subsequent user command parsing process;
S3.1.2命令操作对象识别:根据文本分词与词性标注结果,结合读入的BIM元数据及三维显示命令数据,识别并确定需要提取哪些BIM数据或视图、视点数据对象;S3.1.2 Command operation object recognition: According to the results of text segmentation and part-of-speech tagging, combined with the read BIM metadata and three-dimensional display command data, identify and determine which BIM data or view and viewpoint data objects need to be extracted;
S3.1.3生成对象过滤命令序列:根据命令操作对象识别结果,结合BIM软件相应数据对象的数据提取方式,生成用户语音命令对应的数据对象过滤命令序列;S3.1.3 Generate object filtering command sequence: According to the recognition result of the command operation object, combined with the data extraction method of the corresponding data object of the BIM software, generate the data object filtering command sequence corresponding to the user's voice command;
S3.1.4显示命令识别:根据文本分词与词性标注结果,结合三维显示命令数据,识别并确定需要执行的三维显示命令信息;S3.1.4 Display command recognition: According to the results of text segmentation and part-of-speech tagging, combined with three-dimensional display command data, identify and determine the three-dimensional display command information that needs to be executed;
S3.1.5生成显示命令序列:根据三维显示命令识别结果,结合BIM软件相应命令的实现方式和信息,生成用户语音命令对应的三维显示命令序列;S3.1.5 Generate a display command sequence: According to the recognition result of the three-dimensional display command, combined with the implementation method and information of the corresponding command of the BIM software, generate the three-dimensional display command sequence corresponding to the user's voice command;
S3.1.6命令序列排序:根据构建的语法树,结合语法树各节点依赖关系,对生成的数据对象过滤命令序列和三维显示命令序列进行排序,确保相应的命令和顺序执行。S3.1.6 Ordering sequence of commands: According to the constructed syntax tree, combined with the dependency of each node of the syntax tree, sort the generated data object filtering command sequence and three-dimensional display command sequence to ensure the corresponding command and order execution.
用户命令执行与三维显示更新的主要功能是实时更新三维显示模块的显示结果。如图5所示,该步骤的主要实现方法如下:The main function of user command execution and 3D display update is to update the display results of the 3D display module in real time. As shown in Figure 5, the main implementation method of this step is as follows:
S3.2.1执行对象过滤命令:在BIM软件中执行步骤S3.1.3中生成的数据对象过滤 命令,并输出过滤的数据对象列表;S3.2.1 Execute the object filtering command: execute the data object filtering command generated in step S3.1.3 in the BIM software, and output the filtered data object list;
S3.2.2提取命令操作对象:将过滤后的数据对象进行分组、分类处理,形成提取后的命令操作对象;S3.2.2 Extract command operation object: group and classify the filtered data objects to form the extracted command operation object;
S3.2.3执行三维显示命令:针对命令操作对象执行相应的三维显示命令;S3.2.3 Execute 3D display command: execute the corresponding 3D display command for the command operation object;
S3.2.4向三维显示模块发送三维显示更新命令,以触发三维显示模块的效果更新;S3.2.4 Send a 3D display update command to the 3D display module to trigger the effect update of the 3D display module;
S3.2.5更新设备三维显示效果:三维显示模块根据步骤S3.2.4中发送的三维显示更新命令,对显示内容进行刷新,并完成用户语音命令的执行。S3.2.5 Update the 3D display effect of the device: The 3D display module refreshes the display content according to the 3D display update command sent in step S3.2.4, and completes the execution of the user's voice command.
实施例三Example three
下面结合本发明在BIM软件AutodeskRevit以及桌面端电脑上的实际操作方法对The following is a comparison of the actual operation method of the present invention on the BIM software Autodesk Revit and the desktop computer
实施例一、二中的系统和方法进行进一步说明。The systems and methods in the first and second embodiments are further described.
(1)语音输入模块:该模块采用桌面端电脑的麦克风作为语音输入模块,并直接实时读入用户输入的语音命令;(1) Voice input module: This module uses the microphone of the desktop computer as the voice input module, and directly reads the voice commands input by the user in real time;
(2)语音识别及处理模块:该模块及方法首先采用Windows系统内置的语音识别模块实现语音转文本功能,也可采用TensorFlow或pytorch实现的深度学习语音转文本功能;后续文本分词采用python模块jieba实现,停用词、同义词处理采用自编算法实现,语法树分析则采用斯坦福大学的stanford NLP parser实现;(2) Speech recognition and processing module: This module and method first use the built-in speech recognition module of the Windows system to realize the speech-to-text function, or the deep learning speech-to-text function realized by TensorFlow or pytorch; the subsequent text segmentation uses the python module jieba To achieve, stop words and synonyms are processed by self-editing algorithm, and syntax tree analysis is implemented by Stanford NLP parser of Stanford University;
(3)三维显示命令数据库:该模块采用图数据库neo 4j进行三维显示命令存储,主要存储内容包括Revit显示、隐藏构件以及打开、切换视图等命令,同时也包括相应命令对应的数据对象类型、属性等信息;该模块也可采用关系型数据库、列数据库、文档数据库等进行实现;(3) Three-dimensional display command database: This module uses the graph database neo4j to store three-dimensional display commands. The main storage content includes Revit display, hide components, and open and switch views. It also includes the data object type and attributes corresponding to the corresponding command. And other information; this module can also be implemented using relational databases, column databases, document databases, etc.;
(4)BIM元数据库:该同样采用图数据库neo 4j进行Revit内BIM数据相关数据类型、属性及其继承、关联关系等内容,为构建BIM数据对象与用户语音输入生成的文本之间的对应关系提供支撑;该数据库也可采用关系型数据库、列数据库、文档数据库等进行实现;(4) BIM meta-database: The graph database neo 4j is also used to carry out BIM data related data types, attributes and their inheritance, and association relationships in Revit, in order to construct the correspondence between BIM data objects and text generated by user voice input Provide support; the database can also be implemented using relational databases, column databases, document databases, etc.;
(5)用户命令解析模块:该模块及方法采用C#语音进行Revit二次开发实现,并通过相应API与语音输入模块、语音识别及处理模块、三维显示命令数据库、BIM元数据库进行相互调用和数据交换,最终生成Revit视图、视点元素及构件元素的过滤命令和三维显示控制命令序列;(5) User command analysis module: This module and method adopts C# voice to carry out the secondary development of Revit, and through the corresponding API and voice input module, voice recognition and processing module, three-dimensional display command database, BIM metadata database for mutual call and data Exchange, and finally generate filtering commands and 3D display control command sequences for Revit views, viewpoint elements and component elements;
(6)命令执行模块:该模块及方法同样采用C#语音在Revit上二次开发实现,从而实现相应Revit数据的过滤和命令执行,并调用Revit三维显示更新的命令;(6) Command execution module: The module and method are also implemented in the secondary development of Revit using C# voice, so as to realize the filtering and command execution of the corresponding Revit data, and call the command of Revit 3D display update;
(7)三维显示模块:该模块直接由桌面端电脑的显示器与Revit的三维显示界面组 成,当用户调用Revit三维显示更新命令后,可自动刷新显示器三维显示效果,从而实现基于用户语音控制的三维显示交互控制与更新。(7) Three-dimensional display module: This module is directly composed of the display of the desktop computer and the three-dimensional display interface of Revit. When the user calls the Revit three-dimensional display update command, the three-dimensional display effect of the display can be automatically refreshed, so as to realize the three-dimensional display based on the user's voice control. Display interactive control and update.
上述各实施例仅用于说明本发明,其中各部件的结构、连接方式和制作工艺等都是可以有所变化的,凡是在本发明技术方案的基础上进行的等同变换和改进,均不应排除在本发明的保护范围之外。The foregoing embodiments are only used to illustrate the present invention. The structure, connection mode, and manufacturing process of each component can be changed. Any equivalent transformation and improvement based on the technical solution of the present invention should not be used. Excluded from the protection scope of the present invention.

Claims (10)

  1. 一种声控建筑信息模型三维显示系统,其特征在于,包括:语音输入模块、语音识别及处理模块、命令处理模块和三维显示模块;A voice-controlled building information model three-dimensional display system, which is characterized by comprising: a voice input module, a voice recognition and processing module, a command processing module, and a three-dimensional display module;
    所述语音输入模块用于采集用户语音,并将所述用户语音以标准语音数据格式传输给所述语音识别及处理模块;The voice input module is used to collect user voice, and transmit the user voice to the voice recognition and processing module in a standard voice data format;
    所述语音识别及处理模块将所述语音输入模块采集的所述用户语音转换成建筑信息模型能够识别的命令,并将所述命令传输至所述命令处理模块;The voice recognition and processing module converts the user voice collected by the voice input module into a command that can be recognized by the building information model, and transmits the command to the command processing module;
    所述命令处理模块对接收的所述命令进行解析和执行,并将所述命令的执行结果传输至所述三维显示模块;The command processing module parses and executes the received command, and transmits the execution result of the command to the three-dimensional display module;
    所述三维显示模块根据接收的所述命令的执行结果,对所述建筑信息模型的三维显示效果进行更新与实时输出。The three-dimensional display module updates and outputs the three-dimensional display effect of the building information model in real time according to the execution result of the received command.
  2. 根据权利要求1所述的声控建筑信息模型三维显示系统,其特征在于,所述语音识别及处理模块包括三维显示命令数据库,所述三维显示命令数据库存储有关建筑信息模型的三维显示命令及命令相应的操作对象。The voice-controlled building information model three-dimensional display system according to claim 1, wherein the voice recognition and processing module includes a three-dimensional display command database, and the three-dimensional display command database stores three-dimensional display commands and command responses related to the building information model. The object of operation.
  3. 根据权利要求2所述的声控建筑信息模型三维显示系统,其特征在于,所述语音识别及处理模块包括建筑信息模型元数据库,所述建筑信息模型元数据库存储建筑信息模型数据的元数据模型,以及建筑信息模型元数据与所述三维显示命令数据库中三维显示命令的对应关系,所述元数据模型包括建筑信息模型中包含的数据实体及其类型、属性。The voice-controlled building information model three-dimensional display system according to claim 2, wherein the voice recognition and processing module includes a building information model metadata database, and the building information model metadata database stores metadata models of building information model data, And the correspondence between the metadata of the building information model and the three-dimensional display command in the three-dimensional display command database, the metadata model including the data entities contained in the building information model and their types and attributes.
  4. 根据权利要求3所述的声控建筑信息模型三维显示系统,其特征在于,所述语音识别及处理模块通用机器学习或深度学习的语音识别模型,实现用户语音输入向文字的转换,并结合建筑信息模型元数据及三维显示命令数据对语音输入生成的文本进行处理;将处理后的文本进行语法分析,并对生成的语法树进行初步检核,确保用户语音输入生成有效的三维显示控制及交互命令。The voice-controlled building information model three-dimensional display system according to claim 3, wherein the voice recognition and processing module is a general-purpose machine learning or deep learning voice recognition model, which realizes the conversion of user voice input to text, and combines building information Model metadata and three-dimensional display command data process the text generated by voice input; perform grammatical analysis on the processed text, and perform a preliminary check on the generated syntax tree to ensure that the user’s voice input generates effective three-dimensional display control and interactive commands .
  5. 根据权利要求2-4任一项所述的声控建筑信息模型三维显示系统,其特征在于,所述命令处理模块包括命令解析模块和命令执行模块,所述命令解析模块以所述语音识别及处理模块的输出命令为基础,根据三维显示命令及其操作对象的相互关系,生成建筑信息模型数据或视图、视点的过滤命令序列,同时生成三维显示命令调用序列,并将所述过滤命令序列和所述三维显示命令调用序列构建为建筑信息模型的数据处理及命令调用命令流,并将所述命令流传输至所述命令执行模块。The voice-controlled building information model three-dimensional display system according to any one of claims 2-4, wherein the command processing module includes a command parsing module and a command execution module, and the command parsing module uses the voice recognition and processing Based on the output commands of the module, according to the relationship between the three-dimensional display commands and their operating objects, a sequence of filtering commands for building information model data or views and viewpoints is generated, and a sequence of three-dimensional display commands is generated at the same time. The three-dimensional display command call sequence is constructed as a data processing and command call command stream of the building information model, and the command stream is transmitted to the command execution module.
  6. 根据权利要求5所述的声控建筑信息模型三维显示系统,其特征在于,所述命令执行模块将执行所述命令解析模块输出的所述命令流,获取和查询与所述命令流相关数据,并将所述命令流作用于所述相关数据,将命令执行结果发送至所述三维显示模块。The voice-controlled building information model three-dimensional display system according to claim 5, wherein the command execution module executes the command stream output by the command analysis module, obtains and queries data related to the command stream, and The command stream is applied to the related data, and the command execution result is sent to the three-dimensional display module.
  7. 一种声控建筑信息模型三维显示方法,其特征在于,通过如权利要求1-6任一项所述声控建筑信息模型三维显示系统实现,包括如下步骤:A method for three-dimensional display of a voice-activated building information model, characterized in that it is implemented by the three-dimensional display system of a voice-activated building information model according to any one of claims 1-6, and comprises the following steps:
    S1实时获取语音输入,并将所述语音输入转换为标准语音数据格式传输到下一步;S1 obtains voice input in real time, converts the voice input into a standard voice data format and transmits it to the next step;
    S2将所述语音输入模块采集的所述用户语音转换成建筑信息模型能够识别的命令,并将所述命令传输至所述命令处理模块;S2 converts the user voice collected by the voice input module into a command that can be recognized by the building information model, and transmits the command to the command processing module;
    S3所述命令处理模块对接收的所述命令进行解析和执行,生成建筑信息模型数据或视图、视点的过滤命令序列以及三维显示命令调用序列,构建建筑信息模型的数据处理及命令调用命令流;执行所述数据处理及命令调用命令流,自动获取和查询与所述命令流相关数据,并将所述命令流作用于所述相关数据,将命令执行结果发送至所述三维显示模块;S3 The command processing module parses and executes the received commands, generates building information model data or views, a filtering command sequence of viewpoints, and a three-dimensional display command calling sequence, and constructs a building information model data processing and command calling command flow; Executing the data processing and command invoking a command stream, automatically acquiring and querying data related to the command stream, applying the command stream to the relevant data, and sending the command execution result to the three-dimensional display module;
    S4基于所述命令执行结果,自动更新所述三维显示模块的显示结果,实现与用户的动态交互与显示结果呈现。S4 automatically updates the display result of the three-dimensional display module based on the command execution result, so as to realize dynamic interaction with the user and presentation of the display result.
  8. 根据权利要求7所述的声控建筑信息模型三维显示方法,其特征在于,其中所述S2具体包括以下步骤:The 3D display method of a voice-controlled building information model according to claim 7, wherein the S2 specifically includes the following steps:
    S2.1采用机器学习或深度学习等算法模块实现用户语音输入向文本的自动转换,生成语音输入相对应的文本输入;S2.1 uses algorithm modules such as machine learning or deep learning to realize the automatic conversion of user voice input to text, and generate text input corresponding to the voice input;
    S2.2采用统计模型、机器学习模型或深度学习模型对文本进行分词;S2.2 Use statistical models, machine learning models or deep learning models to segment the text;
    S2.3采用自然语音处理模型和算法对分词结果进行词性标注;S2.3 Use natural speech processing models and algorithms to tag the word segmentation results;
    S2.4文本消歧:对停用词进行清理,并将同义词统一为一个标准词;S2.4 Text disambiguation: clean up stop words and unify synonyms into a standard word;
    S2.5基于步骤S2.3中词性标注与步骤S2.4中文本消歧结果,采用自然语音处理算法构建用户输入文本的语法树。In S2.5, based on the part-of-speech tagging in step S2.3 and the text disambiguation result in step S2.4, a natural speech processing algorithm is used to construct a grammar tree of the user input text.
    S2.6导入三维显示命令数据库和建筑信息模型元数据库中数据,根据所述数据库中数据调整所述步骤S2.2中分词结果,将所述分词结果及所述语法树构建结果与读入的所述数据库中数据进行对比分析,核对并确认所述语法树各部分均可转换为三维显示控制命令及其操作对象信息;如不能转换为三维显示控制命令则终止语音控制流程,等待下一次用户语音输入。S2.6 Import the data in the three-dimensional display command database and the building information model metadata database, adjust the word segmentation result in step S2.2 according to the data in the database, and compare the word segmentation result and the grammar tree construction result with the read in Perform comparative analysis on the data in the database, check and confirm that each part of the syntax tree can be converted into three-dimensional display control commands and their operation object information; if they cannot be converted into three-dimensional display control commands, terminate the voice control process and wait for the next user Voice input.
  9. 根据权利要求7或8所述的声控建筑信息模型三维显示方法,其特征在于,其 中所述步骤S3中解析过程具体包括以下步骤:The three-dimensional display method of a voice-activated building information model according to claim 7 or 8, wherein the analysis process in step S3 specifically includes the following steps:
    S3.1.1接收所述语音输入模块中产生的命令,并导入三维显示命令数据库和建筑信息模型元数据库中数据;S3.1.1 receives the command generated in the voice input module, and imports the data in the three-dimensional display command database and the building information model metadata database;
    S3.1.2识别并确定需要提取哪些建筑信息模型数据或视图、视点;S3.1.2 Identify and determine which building information model data or views and viewpoints need to be extracted;
    S3.1.3根据所述步骤S3.2中识别结果,生成用户语音命令对应的数据对象过滤命令序列;S3.1.3 According to the recognition result in step S3.2, generate a data object filtering command sequence corresponding to the user's voice command;
    S3.1.4识别并确定需要执行的三维显示命令信息;S3.1.4 Identify and determine the three-dimensional display command information that needs to be executed;
    S3.1.5根据所述步骤S3.4中三维显示命令识别结果,生成用户语音命令对应的三维显示命令调用序列;S3.1.5 Generate a three-dimensional display command call sequence corresponding to the user's voice command according to the recognition result of the three-dimensional display command in step S3.4;
    S3.1.6对生成的所述数据对象过滤命令序列和所述三维显示命令序列进行排序,确保相关命令和执行顺序,形成所述数据处理及命令调用命令流。S3.1.6 Sort the generated data object filtering command sequence and the three-dimensional display command sequence to ensure related commands and execution sequence to form the data processing and command invocation command flow.
  10. 根据权利要求9所述的声控建筑信息模型三维显示方法,其特征在于,其中所述步骤S3中执行过程具体包括以下步骤:The three-dimensional display method of a sound-controlled building information model according to claim 9, wherein the execution process in step S3 specifically includes the following steps:
    S3.2.1执行对象过滤命令:在BIM软件中执行步骤S3.1.3中生成的数据对象过滤命令,并输出过滤的数据对象列表;S3.2.1 Execute the object filtering command: execute the data object filtering command generated in step S3.1.3 in the BIM software, and output the filtered data object list;
    S3.2.2提取命令操作对象:将过滤后的数据对象进行分组、分类处理,形成提取后的命令操作对象;S3.2.2 Extract command operation object: group and classify the filtered data objects to form the extracted command operation object;
    S3.2.3执行三维显示命令:针对所述命令操作对象执行相应的三维显示命令;S3.2.3 Execute 3D display command: execute the corresponding 3D display command for the command operation object;
    S3.2.4向三维显示模块发送三维显示更新命令,以触发三维显示模块的效果更新;S3.2.4 Send a 3D display update command to the 3D display module to trigger the effect update of the 3D display module;
    S3.2.5更新设备三维显示效果:三维显示模块根据步骤S3.2.4中发送的三维显示更新命令,对显示内容进行刷新,并完成用户语音命令的执行。S3.2.5 Update the 3D display effect of the device: The 3D display module refreshes the display content according to the 3D display update command sent in step S3.2.4, and completes the execution of the user's voice command.
PCT/CN2020/076321 2019-12-11 2020-02-24 Three-dimensional display system and method for sound control building information model WO2021114479A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911265714.4 2019-12-11
CN201911265714.4A CN110889161B (en) 2019-12-11 2019-12-11 Three-dimensional display system and method for sound control building information model

Publications (1)

Publication Number Publication Date
WO2021114479A1 true WO2021114479A1 (en) 2021-06-17

Family

ID=69751437

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/076321 WO2021114479A1 (en) 2019-12-11 2020-02-24 Three-dimensional display system and method for sound control building information model

Country Status (2)

Country Link
CN (1) CN110889161B (en)
WO (1) WO2021114479A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113867773A (en) * 2021-08-18 2021-12-31 华建数创(上海)科技有限公司 Real-time updating mechanism of building model data for illusion engine
CN115454432A (en) * 2022-11-09 2022-12-09 中建八局第三建设有限公司 Examination development method and system based on BIM platform
CN116051729A (en) * 2022-12-15 2023-05-02 北京百度网讯科技有限公司 Three-dimensional content generation method and device and electronic equipment
CN116303697A (en) * 2023-05-18 2023-06-23 深圳鹏锐信息技术股份有限公司 Model display system based on artificial intelligence
CN117290379A (en) * 2023-11-27 2023-12-26 华南理工大学 System for converting natural language into SQL and Revit data interaction
CN118193760A (en) * 2024-03-11 2024-06-14 北京鸿鹄云图科技股份有限公司 Drawing voice annotation method and system based on natural language understanding

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111415412B (en) * 2020-03-18 2023-08-04 北京山维科技股份有限公司 System and method for collecting and editing stereo map
CN112215933B (en) * 2020-10-19 2024-04-30 南京大学 Three-dimensional solid geometry drawing system based on pen type interaction and voice interaction
CN115357959B (en) * 2022-10-20 2023-06-30 广东时谛智能科技有限公司 Shoe model design method and device based on voice instruction
CN116843107B (en) * 2023-09-04 2023-11-10 如是(北京)工程设计有限公司 Building information intelligent management system based on BIM technology

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN205068298U (en) * 2015-11-04 2016-03-02 山东女子学院 Interaction system is wandered to three -dimensional scene
CN107801413A (en) * 2016-06-28 2018-03-13 华为技术有限公司 The terminal and its processing method being controlled to electronic equipment
CN108648599A (en) * 2018-05-08 2018-10-12 同济大学 A kind of mechanical arm 3D pseudo operation experiment porch based on voice control

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7949529B2 (en) * 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US11137659B2 (en) * 2009-12-22 2021-10-05 View, Inc. Automated commissioning of controllers in a window network
US9263032B2 (en) * 2013-10-24 2016-02-16 Honeywell International Inc. Voice-responsive building management system
CN106648057A (en) * 2016-10-09 2017-05-10 大道网络(上海)股份有限公司 Information showing method and system based on virtual reality technology
CN109564706B (en) * 2016-12-01 2023-03-10 英特吉姆股份有限公司 User interaction platform based on intelligent interactive augmented reality
CN106875494A (en) * 2017-03-22 2017-06-20 朱海涛 Model VR operating methods based on image and positioning
CN110164218A (en) * 2018-01-26 2019-08-23 郑州升达经贸管理学院 A kind of Project Management System platform based on BIM network technology
CN108388711A (en) * 2018-02-07 2018-08-10 深圳群伦项目管理有限公司 A kind of engineering design based on BIM and management system
CN110134724A (en) * 2019-05-15 2019-08-16 清华大学 A kind of the data intelligence extraction and display system and method for Building Information Model

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN205068298U (en) * 2015-11-04 2016-03-02 山东女子学院 Interaction system is wandered to three -dimensional scene
CN107801413A (en) * 2016-06-28 2018-03-13 华为技术有限公司 The terminal and its processing method being controlled to electronic equipment
CN108648599A (en) * 2018-05-08 2018-10-12 同济大学 A kind of mechanical arm 3D pseudo operation experiment porch based on voice control

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
1 April 2018, article WANG, MEILING: "Research on Multi-interaction System of Virtual Prototype Based on Augmented Reality", pages: 1 - 60, XP055820273 *
BLINN, NATHAN ET AL. ,: "Visualizing Construction Projects Using Holography,", CONSTRUCTION RESEARCH CONGRESS 2018, 27 June 2018 (2018-06-27) *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113867773A (en) * 2021-08-18 2021-12-31 华建数创(上海)科技有限公司 Real-time updating mechanism of building model data for illusion engine
CN115454432A (en) * 2022-11-09 2022-12-09 中建八局第三建设有限公司 Examination development method and system based on BIM platform
CN116051729A (en) * 2022-12-15 2023-05-02 北京百度网讯科技有限公司 Three-dimensional content generation method and device and electronic equipment
CN116051729B (en) * 2022-12-15 2024-02-13 北京百度网讯科技有限公司 Three-dimensional content generation method and device and electronic equipment
CN116303697A (en) * 2023-05-18 2023-06-23 深圳鹏锐信息技术股份有限公司 Model display system based on artificial intelligence
CN116303697B (en) * 2023-05-18 2023-08-08 深圳鹏锐信息技术股份有限公司 Model display system based on artificial intelligence
CN117290379A (en) * 2023-11-27 2023-12-26 华南理工大学 System for converting natural language into SQL and Revit data interaction
CN117290379B (en) * 2023-11-27 2024-03-15 华南理工大学 System for converting natural language into SQL and Revit data interaction
CN118193760A (en) * 2024-03-11 2024-06-14 北京鸿鹄云图科技股份有限公司 Drawing voice annotation method and system based on natural language understanding

Also Published As

Publication number Publication date
CN110889161A (en) 2020-03-17
CN110889161B (en) 2022-02-18

Similar Documents

Publication Publication Date Title
WO2021114479A1 (en) Three-dimensional display system and method for sound control building information model
Nizam et al. A review of multimodal interaction technique in augmented reality environment
Srinivasan et al. Orko: Facilitating multimodal interaction for visual exploration and analysis of networks
Sharma et al. Speech-gesture driven multimodal interfaces for crisis management
CN104461525B (en) A kind of intelligent consulting platform generation system that can customize
JP5706137B2 (en) Method and computer program for displaying a plurality of posts (groups of data) on a computer screen in real time along a plurality of axes
US20180314882A1 (en) Sorting and displaying digital notes on a digital whiteboard
JP2021120863A (en) Method and device for generating information, electronic apparatus, storage medium, and computer program
CN104090652A (en) Voice input method and device
US11817096B2 (en) Issue tracking system having a voice interface system for facilitating a live meeting directing status updates and modifying issue records
US20080147725A1 (en) Object Verification Enabled Network (OVEN)
Duke Reasoning about gestural interaction
US11017015B2 (en) System for creating interactive media and method of operating the same
JP2022050309A (en) Information processing method, device, system, electronic device, storage medium, and computer program
Li et al. Research on voice interaction technology in vr environment
US20120256926A1 (en) Gesture, text, and shape recognition based data visualization
CN117540028A (en) Platform for intelligent interaction of science popularization knowledge
US12067970B2 (en) Method and apparatus for mining feature information, and electronic device
CN115756161A (en) Multi-modal interactive structure mechanics analysis method, system, computer equipment and medium
US9870063B2 (en) Multimodal interaction using a state machine and hand gestures discrete values
CN115203162A (en) WYSIWYG graph data construction method
Chen Immersive analytics interaction: User preferences and agreements by task type
Masunaga et al. Design and implementation of a multi-modal user interface of the virtual world database system (VWDB)
Wasfy et al. An interrogative visualization environment for large-scale engineering simulations
Bottoni et al. Multilevel modelling and design of visual interactive systems

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20897974

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20897974

Country of ref document: EP

Kind code of ref document: A1