WO2021208372A1 - Indoor visual navigation method, apparatus, and system, and electronic device - Google Patents

Indoor visual navigation method, apparatus, and system, and electronic device Download PDF

Info

Publication number
WO2021208372A1
WO2021208372A1 PCT/CN2020/119479 CN2020119479W WO2021208372A1 WO 2021208372 A1 WO2021208372 A1 WO 2021208372A1 CN 2020119479 W CN2020119479 W CN 2020119479W WO 2021208372 A1 WO2021208372 A1 WO 2021208372A1
Authority
WO
WIPO (PCT)
Prior art keywords
indoor
mobile device
coordinate system
image
camera pose
Prior art date
Application number
PCT/CN2020/119479
Other languages
French (fr)
Chinese (zh)
Inventor
王金戈
谢航
庹东成
陈南
李正权
刘诗文
刘骁
Original Assignee
北京迈格威科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京迈格威科技有限公司 filed Critical 北京迈格威科技有限公司
Priority to JP2022566506A priority Critical patent/JP2023509099A/en
Publication of WO2021208372A1 publication Critical patent/WO2021208372A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/003Navigation within 3D models or images
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01CMEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
    • G01C21/00Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
    • G01C21/20Instruments for performing navigational calculations
    • G01C21/206Instruments for performing navigational calculations specially adapted for indoor navigation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • G06Q10/047Optimisation of routes or paths, e.g. travelling salesman problem
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/006Mixed reality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30244Camera pose

Definitions

  • the present disclosure relates to the field of image processing technology, and in particular to an indoor visual navigation method, device, system and electronic equipment.
  • the purpose of the present disclosure is to provide an indoor visual navigation method, device, system and electronic equipment, which can provide users with indoor navigation services and guide users to reach their destinations conveniently.
  • the embodiments of the present disclosure provide an indoor visual navigation method, the method is executed by a mobile device, and the method includes: if an indoor image to be positioned is collected, uploading the indoor image to a server, so that The server determines the camera pose when the mobile device collects the indoor image; receives the camera pose corresponding to the indoor image returned by the server, and establishes a world coordinate based on the camera pose corresponding to the indoor image
  • the AR coordinate system is aligned; the shortest route is planned in the pre-imported indoor topology map based on the destination information set by the user; the current preview image collected by the mobile device is displayed on the interface of the mobile device, and based on all
  • the AR coordinate system and the shortest route are superimposed on the current preview image with a three-dimensional mark indicating the direction of the route.
  • the step of establishing an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image includes: establishing an initial AR coordinate system; The camera pose adjusts the initial AR coordinate system so that the AR coordinate system is aligned with the world coordinate system.
  • the step of planning the shortest route in the pre-imported indoor topology map based on the destination information set by the user includes: using the destination information set by the user, using a path planning algorithm in advance Plan the shortest route in the imported indoor topology map.
  • the step of superimposing a three-dimensional mark indicating the direction of the route on the current preview image includes: detecting a ground plane on the current preview image; determining the shortest route The three-dimensional coordinates in the AR coordinate system are used to generate a three-dimensional mark indicating the direction of the route based on the determined three-dimensional coordinates; the three-dimensional mark is drawn on the ground plane of the current preview image.
  • the method further includes: if the current camera pose issued by the server is received during the navigation process, correcting the AR coordinate system based on the current camera pose to Keep the corrected AR coordinate system aligned with the world coordinate system.
  • the embodiments of the present disclosure also provide an indoor visual navigation method, the method is executed by a server, and the method includes: if an indoor image to be positioned uploaded by a mobile device is received, determining that the mobile device collects the The camera pose of the indoor image; the camera pose corresponding to the indoor image is sent to the mobile device, so that the mobile device establishes an AR aligned with the world coordinate system based on the camera pose corresponding to the indoor image Coordinate system, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image collected by the mobile device; wherein, the shortest route is the mobile device based on user settings.
  • the predetermined destination information is planned in the indoor topological map.
  • the step of determining the camera pose when the mobile device collects the indoor image includes: feature matching the indoor image with a visual map in a pre-established visual map library , Obtain the camera pose when the mobile device collects the indoor image; wherein the visual map is characterized by a sparse point cloud model of the indoor scene.
  • the step of performing feature matching of the indoor image with a visual map in a pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image includes : Calculate the global image descriptor of the indoor image using a deep hash algorithm; search for multiple key frame images similar to the global image descriptor in the visual map library; obtain the key of each key frame image Frame information; divide the multiple key frame images into multiple clusters according to the key frame information; traverse each of the clusters to obtain the local feature points of the indoor image; calculate the local descriptors and compare them with all Matching the local feature points in the cluster; obtain the 3D map points corresponding to the successfully matched local feature points; if the number of the 3D map points is greater than the preset number, then the camera pose corresponding to the indoor image is obtained.
  • the process of establishing the visual map library includes: acquiring multiple scene images collected by a mobile device in an indoor scene; performing three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain all Describes a visual map library of sparse point cloud models corresponding to multiple scene images.
  • the method further includes: aligning the visual map with a pre-imported indoor plan distribution map.
  • the method further includes: compressing the visual map in the visual map library.
  • the step of compressing the visual map in the visual map library includes: encoding the original feature in the visual map; saving the encoded original feature, and clearing the code The original feature before.
  • the method further includes: periodically acquiring the current preview image collected by the mobile device during the navigation process, and determining the current camera pose when the mobile device collects the current preview image; Send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
  • an embodiment of the present disclosure provides an indoor visual navigation device, the device is set on the side of a mobile device, and the device includes: an image upload module configured to collect an indoor image to be positioned, The image is uploaded to the server so that the server determines the camera pose when the mobile device collects the indoor image; the coordinate system establishment module is configured to receive the camera pose corresponding to the indoor image returned by the server, and The AR coordinate system aligned with the world coordinate system is established based on the camera pose corresponding to the indoor image; the route planning module is configured to plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user; the navigation display module , Configured to display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the On the current preview image.
  • an image upload module configured to collect an indoor image to be positioned, The image is uploaded to the server so that the server determines the camera pose when
  • an embodiment of the present disclosure provides an indoor visual navigation device, the device is set on the server side, and the device includes: a pose determination module configured to receive an indoor image to be positioned uploaded by a mobile device, Determine the camera pose when the mobile device collects the indoor image; the device navigation module is configured to send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device is based on the indoor image.
  • the camera pose corresponding to the image establishes an AR coordinate system aligned with the world coordinate system, and based on the AR coordinate system and the shortest route, superimposes a three-dimensional mark indicating the direction of the route on the current preview image collected by the mobile device;
  • the shortest route is obtained by planning the mobile device in the indoor topology map according to the destination information set by the user.
  • an embodiment of the present disclosure provides an indoor visual navigation system, the system includes a mobile device and a server that are communicatively connected; wherein the mobile device is configured to execute the method according to any one of the first aspect, The server is configured to perform the method according to any one of the second aspects.
  • an embodiment of the present disclosure provides an electronic device, including: a processor and a storage device; the storage device stores a computer program, and the computer program is executed when being run by the processor as in the first aspect The method of any one, or the method of any one of the second aspect.
  • the embodiments of the present disclosure provide a computer-readable storage medium having a computer program stored on the computer-readable storage medium. The steps of the method or the steps of the method described in any one of the second aspects above.
  • the embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment.
  • the collected indoor image to be positioned is uploaded to a server through the mobile device, and the server determines the camera pose when the mobile device collects the indoor image. And send the camera pose corresponding to the indoor image to the mobile device, and then the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and imports it in advance based on the destination information set by the user.
  • the shortest route is planned in the indoor topology map, and finally the current preview image collected by the mobile device is displayed on the interface.
  • a three-dimensional mark indicating the direction of the route is superimposed on the current preview image to realize the indoor Visual navigation.
  • the above method provided in this embodiment can guide the user to follow the shortest route to the destination in the AR mode indoors, which better improves the user experience.
  • FIG. 1 shows a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure
  • Fig. 2 shows a flowchart of an indoor visual navigation method provided by an embodiment of the present disclosure
  • Fig. 3 shows a flowchart of another indoor visual navigation method provided by an embodiment of the present disclosure
  • FIG. 4 shows a flowchart of another indoor visual navigation method provided by an embodiment of the present disclosure
  • FIG. 5 shows a structural block diagram of an indoor visual navigation device provided by an embodiment of the present disclosure
  • Fig. 6 shows a structural block diagram of another indoor visual navigation device provided by an embodiment of the present disclosure.
  • the embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment.
  • the technology can be applied to any occasion where indoor navigation is required.
  • the disclosed embodiments are described in detail.
  • FIG. 1 is a schematic structural diagram of an electronic device.
  • the electronic device 100 includes one or more processors 102, one or more storage devices 104, an input device 106, an output device 108, and an image acquisition device 110. These components pass through The bus system 112 and/or other forms of connection mechanisms (not shown) are interconnected. It should be noted that the components and structure of the electronic device 100 shown in FIG. 1 are only exemplary and not restrictive, and the electronic device may also have other components and structures as required.
  • the processor 102 may be implemented in a hardware form of at least one of a digital signal processor (DSP), a field programmable gate array (FPGA), and a programmable logic array (PLA), and the processor 102 may be a central processing unit.
  • DSP digital signal processor
  • FPGA field programmable gate array
  • PLA programmable logic array
  • a unit CPU
  • GPU graphics processing unit
  • processing units with data processing capabilities and/or instruction execution capabilities, or a combination of several, and can control other components in the electronic device 100 To perform the desired function.
  • the storage device 104 may include one or more computer program products, and the computer program products may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory.
  • the volatile memory may include random access memory (RAM) and/or cache memory (cache), for example.
  • the non-volatile memory may include, for example, read-only memory (ROM), hard disk, flash memory, and the like.
  • One or more computer program instructions may be stored on the computer-readable storage medium, and the processor 102 may run the program instructions to implement the client functions (implemented by the processor) in the embodiments of the present disclosure described below. And/or other desired functions.
  • Various application programs and various data such as various data used and/or generated by the application program, can also be stored in the computer-readable storage medium.
  • the input device 106 may be a device used by a user to input instructions, and may include one or more of a keyboard, a mouse, a microphone, and a touch screen.
  • the output device 108 may output various information (for example, images or sounds) to the outside (for example, a user), and may include one or more of a display, a speaker, and the like.
  • the image capture device 110 may capture images (for example, photos, videos, etc.) desired by the user, and store the captured images in the storage device 104 for use by other components.
  • the example electronic equipment used to implement the indoor visual navigation method, apparatus, system, and electronic equipment according to the embodiments of the present disclosure may be implemented as smart terminals such as smart phones, tablets, wearable electronic equipment, computers, servers, etc. .
  • the embodiments of the present disclosure can provide an indoor visual navigation method on the side of a mobile device.
  • the method can be executed by a mobile device such as a mobile phone, a tablet computer, a wearable electronic device, etc., see the flowchart of the indoor visual navigation method shown in FIG. 2 ,
  • the method mainly includes the following steps S202 to S208:
  • Step S202 If an indoor image to be located is collected, the indoor image is uploaded to the server so that the server can determine the camera pose when the mobile device collects the indoor image.
  • the user When the user is in an indoor scene, he may not know his current location, let alone how to get to the destination from his current location, so he can take a picture first with his mobile phone. Zhang indoor images in the current scene, and then upload the indoor images to the server, through the server to perform visual positioning based on the indoor images.
  • Step S204 Receive the camera pose corresponding to the indoor image returned by the server, and establish an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image.
  • AR Augmented Reality
  • AR navigation is adopted. Since virtual graphics need to be displayed in the real scene, it is necessary to establish an AR coordinate system and adjust the AR coordinate system based on the camera pose corresponding to the indoor image. Align the AR coordinate system with the world coordinate system. Among them, the establishment of the AR coordinate system can be realized by using the existing SLAM algorithm (Simultaneous Localization and Mapping, real-time positioning and map construction), which will not be repeated here.
  • SLAM Simultaneous Localization and Mapping, real-time positioning and map construction
  • Step S206 Plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user.
  • the user can enter the name of the destination in the mobile APP, or directly perform the operation of tapping the destination on the indoor floor plan presented by the mobile APP, which is not limited here.
  • the mobile phone After the mobile phone obtains the destination information set by the user, it can plan the route in the pre-imported indoor topology map.
  • each store can be regarded as a node in the map, and indoor paths can be regarded as an edge path in the map.
  • areas such as shopping malls, libraries, museums and other areas that need to provide indoor navigation services can provide the server with an indoor topology map in advance, and the server can also directly convert it from the indoor floor plan.
  • step S208 the current preview image collected by the mobile device is displayed on the interface of the mobile device, and based on the AR coordinate system and the shortest route, a three-dimensional mark indicating the direction of the route is superimposed on the current preview image.
  • the camera of the mobile device starts to be in a shooting state and continues to collect images, and displays the collected images on the screen of the mobile terminal. It can also be said that the camera of the mobile device is in an image preview mode.
  • the mobile phone camera is in preview mode, and the user can see through the mobile phone interface that the current three-dimensional mark indicating the direction of the route is superimposed and marked in the indoor scene (such as marking the arrow on the ground).
  • the mobile device can guide the user to the destination according to the shortest route in the AR mode indoors, which better improves the user experience.
  • An AR system can be installed in the mobile phone.
  • the AR system is usually a Visual-Inertial Odometry (VIO) that includes loop detection.
  • VIO Visual-Inertial Odometry
  • the AR coordinate system can also be referred to as the VIO coordinate system.
  • the AR system can be the AR Kit that comes with the iOS system, or the AR Core that comes with the Android, or any third-party system that can implement the navigation function of the mobile device, which is not limited here.
  • the AR coordinate system established at the initial stage usually takes the first frame image as the coordinate origin.
  • the camera pose adjusts the initial AR coordinate system so that the mobile phone can smoothly convert the planned path to the world coordinate system, so that the virtual graphics and the real scene image are better combined.
  • the embodiment of the present disclosure provides a specific implementation manner of planning the shortest route in a pre-imported indoor topology map based on the destination information set by the user: based on the destination information set by the user, using The path planning algorithm plans the shortest route in the pre-imported indoor topology map.
  • the path planning algorithm can be an A* algorithm, of course, it can also be other path planning algorithms, which is not limited here.
  • this embodiment can refer to the following steps when performing the above step S208: (1) Detect the ground plane on the current preview image; (2) Determine the shortest route in the AR coordinate system Three-dimensional coordinates, and based on the determined three-dimensional coordinates, a three-dimensional mark is generated for indicating the direction of the route; (3) the three-dimensional mark is drawn on the ground plane of the current preview image.
  • the three-dimensional mark may be a virtual arrow, a dotted path mark, and the like. The user can walk in the direction indicated by the three-dimensional logo, and finally reach the destination with the shortest path.
  • the mobile device in this embodiment receives the current camera pose issued by the server during the navigation process, it will correct the AR coordinate system based on the current camera pose to make the corrected AR coordinate
  • the system remains aligned with the world coordinate system. That is, the server will periodically obtain the current preview image collected by the mobile device during the navigation process, and determine the current camera pose when the mobile device collects the current preview image, and send the current camera pose to the mobile device, and then the mobile device can Correct the AR coordinate system based on the current camera pose to improve navigation accuracy.
  • This embodiment can also provide an indoor visual navigation method on the server side.
  • the method can be executed by, for example, a cloud server.
  • the method mainly includes the following steps S302 to S304 :
  • Step S302 If the indoor image to be located uploaded by the mobile device is received, the camera pose when the mobile device collects the indoor image is determined.
  • the camera pose may include XY coordinates and camera orientation.
  • step S304 the camera pose corresponding to the indoor image is sent to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and will use the AR coordinate system and the shortest route based on the AR coordinate system and the shortest route.
  • the three-dimensional mark indicating the direction of the route is superimposed on the current preview image collected by the mobile device; among them, the shortest route is planned by the mobile device in the indoor topology map based on the destination information set by the user.
  • the server can undertake the large calculation steps required in the navigation process, such as camera pose calculation, and then use the mobile device to quickly and conveniently instruct the user to follow the instructions in the room.
  • the shortest route to the destination improves the user experience.
  • the more time-consuming and space-consuming calculation process can be executed by the server side, such as the server determining the camera pose when the mobile device collects indoor images.
  • the server can set the indoor The image is feature-matched with the visual map in the pre-established visual map library to obtain the camera pose when the mobile device collects indoor images; among them, the visual map is represented by the sparse point cloud model of the indoor scene, which can be understood as a large number of visual features of the scene .
  • the PNP (Perspective-n-Point) problem can be solved according to the 3D-2D matching relationship, that is, the 3D to 2D point pair movement method is solved, so as to solve the camera pose according to the feature point pair movement .
  • This embodiment provides a possible implementation of matching indoor images with visual maps in a pre-established visual map library to obtain the camera pose when the mobile device collects indoor images, mainly including coarse positioning and fine positioning.
  • Two links of positioning are introduced as follows:
  • the deep hash algorithm is used to calculate the global image descriptor of the indoor image, the k most similar key frame images are searched in the visual map library, and the key frame ID is obtained, and then the stored key frame ID is searched according to the key frame ID.
  • Key frame information including the pose of the key frame, local feature points, local descriptors and the coordinates of the corresponding map points in the world coordinate system.
  • the k key frames are clustered according to the position and pose of the key frames, and the key frames with similar positions are grouped into one category. For each cluster, its cluster center provides preliminary coarse positioning results for subsequent fine positioning.
  • the fine positioning process it is first necessary to traverse each cluster, extract local feature points from indoor images, calculate local descriptors and match them with the local features of all key frames in the cluster, and then extract the 3D corresponding to the successfully matched feature points.
  • the PNP problem can be solved to obtain the camera pose corresponding to the indoor image.
  • the pose obtained by the PNP can be used as the initial value to further construct the Bundle Adjustment graph optimization problem, so that the pose corresponding to the indoor image can be optimized to minimize the reprojection error.
  • the edges whose reprojection error is still large are removed, and the Bundle Adjustment graph optimization problem is constructed again with the remaining edges, and finally a more accurate pose corresponding to the indoor image is obtained. If the number of 3D-2D point pairs is too small or the reprojection error is too large after optimization, the key frame in the current cluster is considered to be a mismatch, and the cluster is abandoned. If the reprojection error is small after optimization, it is considered that the pose solution is correct, and the result is output directly, without entering the next clustering cycle.
  • the server can obtain a more accurate camera pose corresponding to the indoor image after the above-mentioned two-step positioning from coarse to fine.
  • the above is only one method for determining the pose provided by this embodiment, and any other method for determining the pose of the camera can also be used, and it is not limited here.
  • the server will also align the visual map with the pre-imported indoor plan distribution map, so that the mobile device used for indoor navigation can smoothly convert the planned path to the world coordinate system.
  • the indoor floor plan can be understood as a building structure drawing, and the indoor floor plan can be uploaded to the server in advance by, for example, a shopping mall.
  • the server may construct a visual map library in advance, and the process of establishing the visual map library includes the following steps: (1) Obtain multiple scene images collected by the mobile device in the indoor scene. In a possible implementation manner, a large number of images of various indoor scenes can be collected in advance, so as to construct a more accurate sparse point cloud model. (2) Based on the SFM (Structure From Motion) algorithm, three-dimensional reconstruction of multiple scene images is performed, and a visual map library containing sparse point cloud models corresponding to multiple scene images is obtained.
  • the SFM mapping process can be implemented using open source algorithms such as COLMAP, Theia, VisualSfM, OpenMVG, etc., and there is no restriction here.
  • the visual map in the visual map library can also be compressed.
  • the original features in the visual map can be encoded by the method of product quantization, and the visual map Only the encoded result is saved in, and the original features are not saved, thus greatly compressing the map size.
  • the server uses the visual database for visual positioning, the encoded visual features can be decoded, and the decoded features can be used for matching and pose estimation.
  • the server can undertake the construction of a large amount of calculation of the visual map library and the determination of the camera pose, thereby reducing the hardware requirements of the mobile device, and also enabling the mobile device to provide users with navigation based on the calculation results of the server more quickly service.
  • the server can also periodically obtain the mobile device The current preview image collected during the navigation process, and determine the current camera pose when the mobile device collects the current preview image; send the current camera pose to the mobile device, so that the mobile device can perform the AR coordinate system based on the current camera pose Fix.
  • the embodiments of the present disclosure also provide a possible implementation of an indoor visual navigation method based on standing on the mobile device side and standing on the server side.
  • an indoor visual navigation method based on standing on the mobile device side and standing on the server side.
  • Step S410 Collect multiple indoor images
  • Step S412 Perform SFM mapping based on the indoor image to generate a visual map database.
  • Step S414 align the visual map in the visual map database with the indoor plan distribution map.
  • the indoor plane distribution map can also be called the architectural structure map.
  • Step S420 receiving the indoor image to be located uploaded by the mobile phone
  • Step S422 Extract the image features of the indoor image to be located, match the extracted image features with the visual map, and estimate the camera pose when the mobile phone collects the indoor image;
  • Step S424 Return the estimated camera pose to the mobile phone
  • Step S430 Establish an AR coordinate system
  • Step S432 align the AR coordinate system with the camera pose when the mobile phone collects indoor images
  • Step S434 Receive destination information set by the user, and plan the shortest route in the indoor topology map;
  • Step S436 Detect the ground plane
  • Step S438 Convert the shortest route into three-dimensional coordinates in the AR coordinate system, and draw the route on the ground plane with arrows.
  • the specific implementation operations of the above steps can refer to the content of the indoor visual navigation method standing on the mobile device side and the indoor visual navigation method standing on the server side, which will not be repeated here.
  • steps S410 to S414 can be collectively referred to as a visual map construction operation
  • steps S420 to S424 can be collectively referred to as a cloud visual positioning operation.
  • Both the visual map construction operation and the cloud visual positioning operation can be performed by the server.
  • Steps S430 to S438 can be collectively referred to as AR navigation operations on the mobile terminal can be performed by mobile devices such as mobile phones.
  • the aforementioned indoor visual navigation method allows users to use their mobile phone to determine their location by photographing the surrounding environment anytime and anywhere. After selecting a destination, they can see the best route planned by the mobile phone through the path selection algorithm through the mobile phone screen. Walk along the route to reach the destination. Moreover, the aforementioned indoor visual navigation method puts a relatively time-consuming and space-consuming calculation process on the server side, so that users can realize real-time positioning and navigation on mobile devices.
  • the embodiment of the present disclosure also provides an indoor visual navigation device arranged on the side of the mobile device.
  • an indoor visual navigation device arranged on the side of the mobile device. Refer to the structural block diagram of the indoor visual navigation device shown in FIG. 5, which includes the following modules:
  • the image upload module 502 is configured to upload the indoor image to the server if the indoor image to be located is collected, so that the server can determine the camera pose when the mobile device collects the indoor image;
  • the coordinate system establishment module 504 is configured to receive the camera pose corresponding to the indoor image returned by the server, and establish an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image;
  • the route planning module 506 is configured to plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user;
  • the navigation display module 508 is configured to display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image.
  • the mobile device can guide the user to the destination according to the shortest route in the AR mode indoors, which better improves the user experience.
  • the coordinate system establishing module 504 is configured to establish an initial AR coordinate system; adjust the initial AR coordinate system based on the camera pose corresponding to the indoor image, so that the AR coordinate system is aligned with the world coordinate system.
  • the route planning module 506 is configured to use a route planning algorithm to plan the shortest route in the pre-imported indoor topology map based on the destination information.
  • the navigation display module 508 is configured to detect the ground plane on the current preview image; determine the three-dimensional coordinates of the shortest route in the AR coordinate system, and generate the three-dimensional coordinates configured to indicate the direction of the route based on the determined three-dimensional coordinates.
  • Three-dimensional logo draw the three-dimensional logo on the ground plane of the current preview image.
  • the above-mentioned device further includes a coordinate system correction module configured to correct the AR coordinate system based on the current camera pose if the current camera pose issued by the server is received during the navigation process, so that The revised AR coordinate system remains aligned with the world coordinate system.
  • a coordinate system correction module configured to correct the AR coordinate system based on the current camera pose if the current camera pose issued by the server is received during the navigation process, so that The revised AR coordinate system remains aligned with the world coordinate system.
  • the embodiment of the present disclosure also provides an indoor visual navigation device arranged on the server side.
  • an indoor visual navigation device arranged on the server side. Refer to the structural block diagram of the indoor visual navigation device shown in FIG. 6, which includes the following modules:
  • the pose determination module 602 is configured to determine the camera pose when the mobile device collects the indoor image if an indoor image to be positioned uploaded by the mobile device is received;
  • the device navigation module 604 is configured to send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and based on the AR coordinate system and
  • the shortest route superimposes a three-dimensional mark used to indicate the direction of the route on the current preview image collected by the mobile device; among them, the shortest route is planned by the mobile device in the indoor topology map according to the destination information set by the user.
  • the server can undertake the large calculation steps required in the navigation process, such as the calculation of camera pose, and then use the mobile device to quickly and conveniently instruct the user to follow the instructions in the room.
  • the shortest route to the destination improves the user experience.
  • the pose determination module 602 is configured to perform feature matching of the indoor image with the visual map in the pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image; wherein, the visual map passes the indoor image. Sparse point cloud model representation of the scene.
  • the above-mentioned device further includes a map building module configured to obtain multiple scene images collected by the mobile device in an indoor scene; and perform three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain a corresponding map containing multiple scene images.
  • a visual map library of the sparse point cloud model configured to obtain multiple scene images collected by the mobile device in an indoor scene; and perform three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain a corresponding map containing multiple scene images.
  • the above-mentioned device further includes: an alignment module configured to align the visual map with a pre-imported indoor plan distribution map.
  • the above-mentioned device further includes: a current pose determination module configured to periodically obtain the current preview image collected by the mobile device during the navigation process, and determine the current camera pose when the mobile device collects the current preview image; The module is configured to send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
  • a current pose determination module configured to periodically obtain the current preview image collected by the mobile device during the navigation process, and determine the current camera pose when the mobile device collects the current preview image
  • the module is configured to send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
  • the embodiment of the present disclosure also provides an indoor visual navigation system, which includes a mobile device and a server.
  • the embodiments of the present disclosure also provide an electronic device, including: a processor and a storage device; the storage device stores a computer program, and the computer program executes an indoor visual navigation method or a station on the side of a mobile device when the computer program is run by the processor. Indoor visual navigation method on the server side.
  • the embodiment of the present disclosure also provides a computer-readable storage medium, and a computer program is stored on the computer-readable storage medium.
  • Indoor visual navigation method a computer-readable storage medium
  • the computer program product of the indoor visual navigation method, device, system, and electronic equipment provided by the embodiments of the present disclosure includes a computer-readable storage medium storing program code, and the instructions included in the program code can be used to execute the previous method embodiments
  • the instructions included in the program code can be used to execute the previous method embodiments
  • the terms “installed”, “connected”, and “connected” should be interpreted broadly, for example, they may be fixed connections or detachable connections. , Or integrally connected; it can be a mechanical connection or an electrical connection; it can be directly connected or indirectly connected through an intermediate medium, and it can be the internal communication between two components.
  • installed should be interpreted broadly, for example, they may be fixed connections or detachable connections. , Or integrally connected; it can be a mechanical connection or an electrical connection; it can be directly connected or indirectly connected through an intermediate medium, and it can be the internal communication between two components.
  • the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the technical solution of the present disclosure essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various implementations of the present disclosure.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .
  • the embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment.
  • the collected indoor image to be positioned is uploaded to a server through the mobile device, and the server determines the camera pose when the mobile device collects the indoor image. And send the camera pose corresponding to the indoor image to the mobile device, and then the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and imports it in advance based on the destination information set by the user.
  • the shortest route is planned in the indoor topology map, and finally the current preview image collected by the mobile device is displayed on the interface, and based on the AR coordinate system and the shortest route, a three-dimensional mark configured to indicate the direction of the route is superimposed on the current preview image to achieve indoor Visual navigation.
  • a three-dimensional mark configured to indicate the direction of the route is superimposed on the current preview image to achieve indoor Visual navigation.

Abstract

Provided are an indoor visual navigation method, apparatus, and system, and an electronic device. The method comprises: upload a collected indoor image to be positioned to a server by means of a mobile device; determine a camera pose by means of the server when the mobile device collects the indoor image, and send the camera pose corresponding to the indoor image to the mobile device; then the mobile device establishes an AR coordinate system aligned with a world coordinate system according to the camera pose corresponding to the indoor image, and plans a shortest route in a pre-imported indoor topological map according to destination information set by a user; and finally display, on an interface, the current preview image collected by the mobile device, and according to the AR coordinate system and the shortest route, superimpose, on the current preview image, a three-dimensional identifier for indicating a route travel direction. The present application can provide indoor navigation service for the user, and guide the user to conveniently arrive at a destination.

Description

室内视觉导航方法、装置、系统及电子设备Indoor visual navigation method, device, system and electronic equipment
相关申请的交叉引用Cross-references to related applications
本公开要求于2020年04月14日提交中国专利局的申请号为202010292954X、名称为“室内视觉导航方法、装置、系统及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。This disclosure claims the priority of a Chinese patent application filed with the Chinese Patent Office on April 14, 2020, with the application number 202010292954X, titled "Indoor visual navigation method, device, system and electronic equipment", the entire content of which is incorporated by reference In this disclosure.
技术领域Technical field
本公开涉及图像处理技术领域,尤其是涉及一种室内视觉导航方法、装置、系统及电子设备。The present disclosure relates to the field of image processing technology, and in particular to an indoor visual navigation method, device, system and electronic equipment.
背景技术Background technique
电子地图导航已成为人们在外出行时主要依靠的寻路方式,然而,现有的导航技术主要是结合GPS技术进行室外导航,当用户位于诸如商场等室内时,只能依靠商场入口处提供的平面图获知所需店铺(目的地)的位置,但在用户前往目的地的过程中,随着自身位置的变化,往往无法清楚地获知自己从当前位置到达目的地的导航路线,因此通常达到目的地需要较多的时间和精力寻路。Electronic map navigation has become the way people mainly rely on wayfinding when traveling. However, the existing navigation technology mainly combines GPS technology for outdoor navigation. When users are indoors such as shopping malls, they can only rely on the floor plan provided at the entrance of the shopping mall. Know the location of the store (destination) you need, but when the user goes to the destination, as the user's location changes, it is often impossible to clearly know the navigation route from the current location to the destination, so it is usually necessary to reach the destination More time and energy to find the way.
发明内容Summary of the invention
有鉴于此,本公开的目的在于提供一种室内视觉导航方法、装置、系统及电子设备,能够为用户提供室内导航服务,指导用户便捷地到达目的地。In view of this, the purpose of the present disclosure is to provide an indoor visual navigation method, device, system and electronic equipment, which can provide users with indoor navigation services and guide users to reach their destinations conveniently.
为了实现上述目的,本公开实施例采用的技术方案如下:In order to achieve the foregoing objectives, the technical solutions adopted in the embodiments of the present disclosure are as follows:
第一方面,本公开实施例提供了一种室内视觉导航方法,所述方法由移动设备执行,所述方法包括:如果采集到待定位的室内图像,将所述室内图像上传至服务器,以使所述服务器确定所述移动设备采集所述室内图像时的相机位姿;接收所述服务器返回的所述室内图像对应的相机位姿,并基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系;基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线;将所述移动设备采集的当前预览图像展示在所述移动设备的界面上,并基于所述AR坐标系和所述最短路线,将用于指示路线行径方向的三维标识叠加在所述当前预览图像上。In the first aspect, the embodiments of the present disclosure provide an indoor visual navigation method, the method is executed by a mobile device, and the method includes: if an indoor image to be positioned is collected, uploading the indoor image to a server, so that The server determines the camera pose when the mobile device collects the indoor image; receives the camera pose corresponding to the indoor image returned by the server, and establishes a world coordinate based on the camera pose corresponding to the indoor image The AR coordinate system is aligned; the shortest route is planned in the pre-imported indoor topology map based on the destination information set by the user; the current preview image collected by the mobile device is displayed on the interface of the mobile device, and based on all The AR coordinate system and the shortest route are superimposed on the current preview image with a three-dimensional mark indicating the direction of the route.
在一种可能的实现方式中,所述基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系的步骤,包括:建立初始的AR坐标系;基于所述室内图像对应的相机位姿调整所述初始的AR坐标系,以使所述AR坐标系与世界坐标系对齐。In a possible implementation, the step of establishing an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image includes: establishing an initial AR coordinate system; The camera pose adjusts the initial AR coordinate system so that the AR coordinate system is aligned with the world coordinate system.
在一种可能的实现方式中,所述基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线的步骤,包括:基于用户设定的目的地信息,利用路径规划算法在预先导入的室内拓扑地图中规划最短路线。In a possible implementation manner, the step of planning the shortest route in the pre-imported indoor topology map based on the destination information set by the user includes: using the destination information set by the user, using a path planning algorithm in advance Plan the shortest route in the imported indoor topology map.
在一种可能的实现方式中,所述将用于指示路线行径方向的三维标识叠加在所述当前预览图像上的步骤,包括:检测所述当前预览图像上的地平面;确定所述最短路线在所述AR坐标系中的三维坐标,并基于确定的所述三维坐标生成用于指示路线行径方向的三维标识;将所述三维标识绘制在所述当前预览图像上的地平面。In a possible implementation manner, the step of superimposing a three-dimensional mark indicating the direction of the route on the current preview image includes: detecting a ground plane on the current preview image; determining the shortest route The three-dimensional coordinates in the AR coordinate system are used to generate a three-dimensional mark indicating the direction of the route based on the determined three-dimensional coordinates; the three-dimensional mark is drawn on the ground plane of the current preview image.
在一种可能的实现方式中,所述方法还包括:在导航过程中如果接收到所述服务器下发的当前相机位姿,基于所述当前相机位姿对所述AR坐标系进行修正,以使修正后的所述AR坐标系与所述世界坐标系保持对齐。In a possible implementation, the method further includes: if the current camera pose issued by the server is received during the navigation process, correcting the AR coordinate system based on the current camera pose to Keep the corrected AR coordinate system aligned with the world coordinate system.
第二方面,本公开实施例还提供一种室内视觉导航方法,所述方法由服务器执行,所述方法包括:如果接收到移动设备上传的待定位的室内图像,确定所述移动设备采集所述室内图像时的相机位姿;将所述室内图像对应的相机位姿下发给所述移动设备,以使所述移动设备基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于所述AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在所述移动设备采集的当前预览图像上;其中,所述最短路线是所述移动设备基于用户设定的目的地信息在室内拓扑地图中规划得到的。In a second aspect, the embodiments of the present disclosure also provide an indoor visual navigation method, the method is executed by a server, and the method includes: if an indoor image to be positioned uploaded by a mobile device is received, determining that the mobile device collects the The camera pose of the indoor image; the camera pose corresponding to the indoor image is sent to the mobile device, so that the mobile device establishes an AR aligned with the world coordinate system based on the camera pose corresponding to the indoor image Coordinate system, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image collected by the mobile device; wherein, the shortest route is the mobile device based on user settings. The predetermined destination information is planned in the indoor topological map.
在一种可能的实现方式中,所述确定所述移动设备采集所述室内图像时的相机位姿的步骤,包括:将所述室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到所述移动设备采集所述室内图像时的相机位姿;其中,所述视觉地图通过室内场景的稀疏点云模型表征。In a possible implementation, the step of determining the camera pose when the mobile device collects the indoor image includes: feature matching the indoor image with a visual map in a pre-established visual map library , Obtain the camera pose when the mobile device collects the indoor image; wherein the visual map is characterized by a sparse point cloud model of the indoor scene.
在一种可能的实现方式中,所述将所述室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到所述移动设备采集所述室内图像时的相机位姿的步骤,包括:利用深度哈希算法计算所述室内图像的全局图像描述符;在所述视觉地图库中查找与所述全局图像描述符相似的多个关键帧图像;获取每个所述关键帧图像的关键帧信息;依据所述关键帧信息将所述多个关键帧图像分为多个聚类;遍历每个所述聚类,获得所述室内图像的局部特征点;计算局部描述子,并与所述聚类中的局部特征点匹配;得到匹配成功的局部特征点对应的3D地图点;若所述3D地图点个数大于预设个数,则得到所述室内图像对应的相机位姿。In a possible implementation manner, the step of performing feature matching of the indoor image with a visual map in a pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image includes : Calculate the global image descriptor of the indoor image using a deep hash algorithm; search for multiple key frame images similar to the global image descriptor in the visual map library; obtain the key of each key frame image Frame information; divide the multiple key frame images into multiple clusters according to the key frame information; traverse each of the clusters to obtain the local feature points of the indoor image; calculate the local descriptors and compare them with all Matching the local feature points in the cluster; obtain the 3D map points corresponding to the successfully matched local feature points; if the number of the 3D map points is greater than the preset number, then the camera pose corresponding to the indoor image is obtained.
在一种可能的实现方式中,所述视觉地图库的建立过程包括:获取移动设备在室内场景采集得到的多张场景图像;基于SFM算法对所述多张场景图像进行三维重建,得到包含所述多张场景图像对应的稀疏点云模型的视觉地图库。In a possible implementation, the process of establishing the visual map library includes: acquiring multiple scene images collected by a mobile device in an indoor scene; performing three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain all Describes a visual map library of sparse point cloud models corresponding to multiple scene images.
在一种可能的实现方式中,所述方法还包括:将所述视觉地图与预先导入的室内平面分布图对齐。In a possible implementation manner, the method further includes: aligning the visual map with a pre-imported indoor plan distribution map.
在一种可能的实现方式中,所述方法还包括:将所述视觉地图库中的视觉地图进行 压缩。In a possible implementation manner, the method further includes: compressing the visual map in the visual map library.
在一种可能的实现方式中,所述将所述视觉地图库中的视觉地图进行压缩的步骤,包括:对所述视觉地图中的原始特征进行编码;保存编码后的原始特征,并清除编码前的原始特征。In a possible implementation manner, the step of compressing the visual map in the visual map library includes: encoding the original feature in the visual map; saving the encoded original feature, and clearing the code The original feature before.
在一种可能的实现方式中,所述方法还包括:定时获取所述移动设备在导航过程中采集的当前预览图像,并确定所述移动设备采集所述当前预览图像时的当前相机位姿;将所述当前相机位姿下发给所述移动设备,以使所述移动设备基于所述当前相机位姿对所述AR坐标系进行修正。In a possible implementation, the method further includes: periodically acquiring the current preview image collected by the mobile device during the navigation process, and determining the current camera pose when the mobile device collects the current preview image; Send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
第三方面,本公开实施例提供了一种室内视觉导航装置,所述装置设置在移动设备侧,所述装置包括:图像上传模块,配置成如果采集到待定位的室内图像,将所述室内图像上传至服务器,以使所述服务器确定所述移动设备采集所述室内图像时的相机位姿;坐标系建立模块,配置成接收所述服务器返回的所述室内图像对应的相机位姿,并基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系;路线规划模块,配置成基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线;导航展示模块,配置成将所述移动设备采集的当前预览图像展示在所述移动设备的界面上,并基于所述AR坐标系和所述最短路线,将用于指示路线行径方向的三维标识叠加在所述当前预览图像上。In a third aspect, an embodiment of the present disclosure provides an indoor visual navigation device, the device is set on the side of a mobile device, and the device includes: an image upload module configured to collect an indoor image to be positioned, The image is uploaded to the server so that the server determines the camera pose when the mobile device collects the indoor image; the coordinate system establishment module is configured to receive the camera pose corresponding to the indoor image returned by the server, and The AR coordinate system aligned with the world coordinate system is established based on the camera pose corresponding to the indoor image; the route planning module is configured to plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user; the navigation display module , Configured to display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the On the current preview image.
第四方面,本公开实施例提供了一种室内视觉导航装置,所述装置设置在服务器侧,所述装置包括:位姿确定模块,配置成如果接收到移动设备上传的待定位的室内图像,确定所述移动设备采集所述室内图像时的相机位姿;设备导航模块,配置成将所述室内图像对应的相机位姿下发给所述移动设备,以使所述移动设备基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于所述AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在所述移动设备采集的当前预览图像上;其中,所述最短路线是所述移动设备根据用户设定的目的地信息在室内拓扑地图中规划得到的。In a fourth aspect, an embodiment of the present disclosure provides an indoor visual navigation device, the device is set on the server side, and the device includes: a pose determination module configured to receive an indoor image to be positioned uploaded by a mobile device, Determine the camera pose when the mobile device collects the indoor image; the device navigation module is configured to send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device is based on the indoor image. The camera pose corresponding to the image establishes an AR coordinate system aligned with the world coordinate system, and based on the AR coordinate system and the shortest route, superimposes a three-dimensional mark indicating the direction of the route on the current preview image collected by the mobile device; Wherein, the shortest route is obtained by planning the mobile device in the indoor topology map according to the destination information set by the user.
第五方面,本公开实施例提供了一种室内视觉导航系统,所述系统包括通信连接的移动设备和服务器;其中,所述移动设备配置成执行如第一方面任一项所述的方法,所述服务器配置成执行如第二方面任一项所述的方法。In a fifth aspect, an embodiment of the present disclosure provides an indoor visual navigation system, the system includes a mobile device and a server that are communicatively connected; wherein the mobile device is configured to execute the method according to any one of the first aspect, The server is configured to perform the method according to any one of the second aspects.
第六方面,本公开实施例提供了一种电子设备,包括:处理器和存储装置;所述存储装置上存储有计算机程序,所述计算机程序在被所述处理器运行时执行如第一方面任一项所述的方法,或如第二方面任一项所述的方法。In a sixth aspect, an embodiment of the present disclosure provides an electronic device, including: a processor and a storage device; the storage device stores a computer program, and the computer program is executed when being run by the processor as in the first aspect The method of any one, or the method of any one of the second aspect.
第七方面,本公开实施例提供了一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器运行时执行上述第一方面任一项所述的方法的步骤或上述第二方面任一项所述的方法的步骤。In a seventh aspect, the embodiments of the present disclosure provide a computer-readable storage medium having a computer program stored on the computer-readable storage medium. The steps of the method or the steps of the method described in any one of the second aspects above.
本公开实施例提供了一种室内视觉导航方法、装置、系统及电子设备,通过移动设备将采集到的待定位的室内图像上传至服务器,通过服务器确定移动设备采集室内图像时的相机位姿,并将室内图像对应的相机位姿下发给移动设备,然后移动设备基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线,最后将移动设备采集的当前预览图像展示在界面上,并基于AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在当前预览图像上,从而实现室内视觉导航。本实施例提供的上述方式能够在室内以AR方式指导用户按照最短路线前往目的地,较好地提高了用户体验。The embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment. The collected indoor image to be positioned is uploaded to a server through the mobile device, and the server determines the camera pose when the mobile device collects the indoor image. And send the camera pose corresponding to the indoor image to the mobile device, and then the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and imports it in advance based on the destination information set by the user. The shortest route is planned in the indoor topology map, and finally the current preview image collected by the mobile device is displayed on the interface. Based on the AR coordinate system and the shortest route, a three-dimensional mark indicating the direction of the route is superimposed on the current preview image to realize the indoor Visual navigation. The above method provided in this embodiment can guide the user to follow the shortest route to the destination in the AR mode indoors, which better improves the user experience.
本公开实施例的其他特征和优点将在随后的说明书中阐述,或者,部分特征和优点可以从说明书推知或毫无疑义地确定,或者通过实施本公开实施例的上述技术即可得知。Other features and advantages of the embodiments of the present disclosure will be described in the following specification, or part of the features and advantages can be inferred from the specification or determined without doubt, or can be learned by implementing the above-mentioned technology of the embodiments of the present disclosure.
为使本公开的上述目的、特征和优点能更明显易懂,下文特举较佳实施例,并配合所附附图,作详细说明如下。In order to make the above objectives, features and advantages of the present disclosure more comprehensible, preferred embodiments accompanied with accompanying drawings are described in detail as follows.
附图说明Description of the drawings
为了更清楚地说明本公开具体实施方式或现有技术中的技术方案,下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本公开的一些实施方式,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the specific embodiments of the present disclosure or the technical solutions in the prior art, the following will briefly introduce the drawings that need to be used in the specific embodiments or the description of the prior art. Obviously, the appendix in the following description The drawings are some embodiments of the present disclosure. For those of ordinary skill in the art, without creative work, other drawings can be obtained based on these drawings.
图1示出了本公开实施例所提供的一种电子设备的结构示意图;FIG. 1 shows a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure;
图2示出了本公开实施例所提供的一种室内视觉导航方法流程图;Fig. 2 shows a flowchart of an indoor visual navigation method provided by an embodiment of the present disclosure;
图3示出了本公开实施例所提供的另一种室内视觉导航方法流程图;Fig. 3 shows a flowchart of another indoor visual navigation method provided by an embodiment of the present disclosure;
图4示出了本公开实施例所提供的另一种室内视觉导航方法流程图;FIG. 4 shows a flowchart of another indoor visual navigation method provided by an embodiment of the present disclosure;
图5示出了本公开实施例所提供的一种室内视觉导航装置的结构框图;FIG. 5 shows a structural block diagram of an indoor visual navigation device provided by an embodiment of the present disclosure;
图6示出了本公开实施例所提供的另一种室内视觉导航装置的结构框图。Fig. 6 shows a structural block diagram of another indoor visual navigation device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
为使本公开实施例的目的、技术方案和优点更加清楚,下面将结合附图对本公开的技术方案进行描述,显然,所描述的本实施例中介绍的各实现方式仅是一部分可能的实现方式,而不是全部的实现方式。In order to make the objectives, technical solutions, and advantages of the embodiments of the present disclosure clearer, the technical solutions of the present disclosure will be described below in conjunction with the accompanying drawings. Obviously, the described implementations in this embodiment are only a part of the possible implementations. , Not the full implementation.
基于用户无法利用诸如手机等移动终端进行室内导航的问题,本公开实施例提供的一种室内视觉导航方法、装置、系统及电子设备,该技术可应用于任何需要进行室内导航的场合,以下对本公开实施例进行详细介绍。Based on the problem that users cannot use mobile terminals such as mobile phones for indoor navigation, the embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment. The technology can be applied to any occasion where indoor navigation is required. The disclosed embodiments are described in detail.
首先,参照图1来描述用于实现本公开实施例的一种室内视觉导航方法、装置、系统及电子设备的示例电子设备100。First, with reference to FIG. 1, an example electronic device 100 for implementing an indoor visual navigation method, apparatus, system, and electronic device according to an embodiment of the present disclosure will be described.
如图1所示的一种电子设备的结构示意图,电子设备100包括一个或多个处理器102、一个或多个存储装置104、输入装置106、输出装置108以及图像采集装置110,这些组件通过总线系统112和/或其它形式的连接机构(未示出)互连。应当注意,图1所示的电子设备100的组件和结构只是示例性的,而非限制性的,根据需要,所述电子设备也可以具有其他组件和结构。As shown in FIG. 1 is a schematic structural diagram of an electronic device. The electronic device 100 includes one or more processors 102, one or more storage devices 104, an input device 106, an output device 108, and an image acquisition device 110. These components pass through The bus system 112 and/or other forms of connection mechanisms (not shown) are interconnected. It should be noted that the components and structure of the electronic device 100 shown in FIG. 1 are only exemplary and not restrictive, and the electronic device may also have other components and structures as required.
所述处理器102可以采用数字信号处理器(DSP)、现场可编程门阵列(FPGA)、可编程逻辑阵列(PLA)中的至少一种硬件形式来实现,所述处理器102可以是中央处理单元(CPU)、图形处理单元(GPU)或者具有数据处理能力和/或指令执行能力的其它形式的处理单元中的一种或几种的组合,并且可以控制所述电子设备100中的其它组件以执行期望的功能。The processor 102 may be implemented in a hardware form of at least one of a digital signal processor (DSP), a field programmable gate array (FPGA), and a programmable logic array (PLA), and the processor 102 may be a central processing unit. A unit (CPU), a graphics processing unit (GPU), or other forms of processing units with data processing capabilities and/or instruction execution capabilities, or a combination of several, and can control other components in the electronic device 100 To perform the desired function.
所述存储装置104可以包括一个或多个计算机程序产品,所述计算机程序产品可以包括各种形式的计算机可读存储介质,例如易失性存储器和/或非易失性存储器。所述易失性存储器例如可以包括随机存取存储器(RAM)和/或高速缓冲存储器(cache)等。所述非易失性存储器例如可以包括只读存储器(ROM)、硬盘、闪存等。在所述计算机可读存储介质上可以存储一个或多个计算机程序指令,处理器102可以运行所述程序指令,以实现下文所述的本公开实施例中(由处理器实现)的客户端功能以及/或者其它期望的功能。在所述计算机可读存储介质中还可以存储各种应用程序和各种数据,例如所述应用程序使用和/或产生的各种数据等。The storage device 104 may include one or more computer program products, and the computer program products may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory. The volatile memory may include random access memory (RAM) and/or cache memory (cache), for example. The non-volatile memory may include, for example, read-only memory (ROM), hard disk, flash memory, and the like. One or more computer program instructions may be stored on the computer-readable storage medium, and the processor 102 may run the program instructions to implement the client functions (implemented by the processor) in the embodiments of the present disclosure described below. And/or other desired functions. Various application programs and various data, such as various data used and/or generated by the application program, can also be stored in the computer-readable storage medium.
所述输入装置106可以是用户用来输入指令的装置,并且可以包括键盘、鼠标、麦克风和触摸屏等中的一个或多个。The input device 106 may be a device used by a user to input instructions, and may include one or more of a keyboard, a mouse, a microphone, and a touch screen.
所述输出装置108可以向外部(例如,用户)输出各种信息(例如,图像或声音),并且可以包括显示器、扬声器等中的一个或多个。The output device 108 may output various information (for example, images or sounds) to the outside (for example, a user), and may include one or more of a display, a speaker, and the like.
所述图像采集装置110可以拍摄用户期望的图像(例如照片、视频等),并且将所拍摄的图像存储在所述存储装置104中以供其它组件使用。The image capture device 110 may capture images (for example, photos, videos, etc.) desired by the user, and store the captured images in the storage device 104 for use by other components.
示例性地,用于实现根据本公开实施例的室内视觉导航方法、装置、系统及电子设备的示例电子设备可以被实现为诸如智能手机、平板电脑、可穿戴电子设备、计算机、服务器等智能终端。Exemplarily, the example electronic equipment used to implement the indoor visual navigation method, apparatus, system, and electronic equipment according to the embodiments of the present disclosure may be implemented as smart terminals such as smart phones, tablets, wearable electronic equipment, computers, servers, etc. .
本公开实施例可以站在移动设备侧提供了一种室内视觉导航方法,该方法可以由诸如手机、平板电脑、可穿戴电子设备等移动设备执行,参见图2所示的室内视觉导航方法流程图,该方法主要包括如下步骤S202~步骤S208:The embodiments of the present disclosure can provide an indoor visual navigation method on the side of a mobile device. The method can be executed by a mobile device such as a mobile phone, a tablet computer, a wearable electronic device, etc., see the flowchart of the indoor visual navigation method shown in FIG. 2 , The method mainly includes the following steps S202 to S208:
步骤S202,如果采集到待定位的室内图像,将室内图像上传至服务器,以使服务器确 定移动设备采集室内图像时的相机位姿。Step S202: If an indoor image to be located is collected, the indoor image is uploaded to the server so that the server can determine the camera pose when the mobile device collects the indoor image.
以下均以移动设备是手机为例进行说明,用户置身于室内场景时可能不知道自己当前所处的位置,更不知道从当前所处的位置如何前往目的地,因此可以先通过手机先拍摄一张当前场景下的室内图像,然后将室内图像上传给服务器,通过服务器基于室内图像进行视觉定位。The following is an example of the mobile device being a mobile phone. When the user is in an indoor scene, he may not know his current location, let alone how to get to the destination from his current location, so he can take a picture first with his mobile phone. Zhang indoor images in the current scene, and then upload the indoor images to the server, through the server to perform visual positioning based on the indoor images.
步骤S204,接收服务器返回的室内图像对应的相机位姿,并基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系。Step S204: Receive the camera pose corresponding to the indoor image returned by the server, and establish an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image.
AR(Augmented Reality,增强现实)技术是一种将虚拟信息与真实世界进行融合的技术,可以将虚拟信息应用到真实世界,通过将虚拟信息(诸如虚拟图形)与真实场景进行叠加,可以呈现出虚拟图形与真实场景图像在同一个画面或同一个空间的效果,从而使用户体验虚拟与现实结合的场景。在一种可能的实现方式中,采用AR导航方式实现,由于需要将虚拟图形显示于真实场景中,因此需要建立AR坐标系,并基于室内图像对应的相机位姿对AR坐标系进行调整,以使AR坐标系与世界坐标系对齐。其中,AR坐标系的建立可以利用现有的SLAM算法(Simultaneous Localization and Mapping,即时定位与地图构建)实现,在此不再赘述。AR (Augmented Reality) technology is a technology that integrates virtual information with the real world. It can apply virtual information to the real world. By superimposing virtual information (such as virtual graphics) with real scenes, it can show The effect of virtual graphics and real scene images on the same screen or in the same space, so that users can experience a scene that combines virtual and reality. In one possible implementation method, AR navigation is adopted. Since virtual graphics need to be displayed in the real scene, it is necessary to establish an AR coordinate system and adjust the AR coordinate system based on the camera pose corresponding to the indoor image. Align the AR coordinate system with the world coordinate system. Among them, the establishment of the AR coordinate system can be realized by using the existing SLAM algorithm (Simultaneous Localization and Mapping, real-time positioning and map construction), which will not be repeated here.
步骤S206,基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线。Step S206: Plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user.
用户可以在手机APP中输入目的地名称,或者直接在手机APP呈现的室内平面图上执行点选目的地的操作,在此不进行限定。当手机获取到用户设定的目的地信息后,则可以在预先导入的室内拓扑地图中进行路线规划。在室内拓扑地图中,可以将各店铺作为地图中的节点,将室内路径作为地图中的边径。在一种可能的实现方式中,诸如商场、图书馆、博物馆等需要提供室内导航服务的地区可以事先向服务器提供室内拓扑地图,服务器也可以直接根据室内平面图转换而来。The user can enter the name of the destination in the mobile APP, or directly perform the operation of tapping the destination on the indoor floor plan presented by the mobile APP, which is not limited here. After the mobile phone obtains the destination information set by the user, it can plan the route in the pre-imported indoor topology map. In an indoor topology map, each store can be regarded as a node in the map, and indoor paths can be regarded as an edge path in the map. In a possible implementation, areas such as shopping malls, libraries, museums and other areas that need to provide indoor navigation services can provide the server with an indoor topology map in advance, and the server can also directly convert it from the indoor floor plan.
步骤S208,将移动设备采集的当前预览图像展示在移动设备的界面上,并基于AR坐标系和最短路线,将用于指示路线行径方向的三维标识叠加在当前预览图像上。In step S208, the current preview image collected by the mobile device is displayed on the interface of the mobile device, and based on the AR coordinate system and the shortest route, a three-dimensional mark indicating the direction of the route is superimposed on the current preview image.
移动设备的相机开始处于拍摄状态并持续采集图像,并将采集到的图像展示在移动终端的屏幕上,也可称为该移动设备的相机处于图像预览模式。用户在手持手机进行导航时,手机摄像头处于预览模式,用户可以通过手机界面看到当前叠加有用于指示路线行径方向的三维标识标注于室内场景内(诸如将箭头标注于地面上)。The camera of the mobile device starts to be in a shooting state and continues to collect images, and displays the collected images on the screen of the mobile terminal. It can also be said that the camera of the mobile device is in an image preview mode. When the user is holding the mobile phone while navigating, the mobile phone camera is in preview mode, and the user can see through the mobile phone interface that the current three-dimensional mark indicating the direction of the route is superimposed and marked in the indoor scene (such as marking the arrow on the ground).
通过本实施例提供的上述室内视觉导航方法,移动设备可以在室内以AR方式指导用户按照最短路线前往目的地,较好地提高了用户体验。Through the indoor visual navigation method provided in this embodiment, the mobile device can guide the user to the destination according to the shortest route in the AR mode indoors, which better improves the user experience.
在基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系时,可以首先建立初始的AR坐标系,然后基于室内图像对应的相机位姿调整初始的AR坐标系,以使AR 坐标系与世界坐标系对齐。When establishing an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, you can first establish the initial AR coordinate system, and then adjust the initial AR coordinate system based on the camera pose corresponding to the indoor image to make the AR coordinate The system is aligned with the world coordinate system.
在手机内可以设置有AR系统,AR系统通常是一个包含回环检测的视觉惯性里程计(Visual-Inertial Odometry,VIO),其中,AR坐标系又可称为VIO坐标系。AR系统可以是iOS系统中自带的AR Kit,或者Android自带的AR Core,还可以是任意可实现移动设备导航功能的第三方系统,在此不进行限制。An AR system can be installed in the mobile phone. The AR system is usually a Visual-Inertial Odometry (VIO) that includes loop detection. The AR coordinate system can also be referred to as the VIO coordinate system. The AR system can be the AR Kit that comes with the iOS system, or the AR Core that comes with the Android, or any third-party system that can implement the navigation function of the mobile device, which is not limited here.
初期建立的AR坐标系通常以采集的第一帧图像为坐标原点,为了能够使基于AR坐标系绘制的虚拟图像能够与世界坐标系下的真实场景图像较好的融合,因此需要利用室内图像对应的相机位姿对初始的AR坐标系进行调整,以便于手机能够将规划好的路径顺利转换至世界坐标系处,从而使得虚拟图形与真实场景图像更好的相结合。The AR coordinate system established at the initial stage usually takes the first frame image as the coordinate origin. In order to enable the virtual image drawn based on the AR coordinate system to be better integrated with the real scene image in the world coordinate system, it is necessary to use the corresponding indoor image The camera pose adjusts the initial AR coordinate system so that the mobile phone can smoothly convert the planned path to the world coordinate system, so that the virtual graphics and the real scene image are better combined.
在一种可能的实现方式中,本公开实施例给出了基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线的具体实施方式:基于用户设定的目的地信息,利用路径规划算法在预先导入的室内拓扑地图中规划最短路线。路径规划算法可以为A*算法,当然也可以为其它路径规划算法,在此不进行限定。In a possible implementation manner, the embodiment of the present disclosure provides a specific implementation manner of planning the shortest route in a pre-imported indoor topology map based on the destination information set by the user: based on the destination information set by the user, using The path planning algorithm plans the shortest route in the pre-imported indoor topology map. The path planning algorithm can be an A* algorithm, of course, it can also be other path planning algorithms, which is not limited here.
为了便于清楚地为用户提供导航方向,本实施例在执行上述步骤S208时,可以参照如下步骤实现:(1)检测当前预览图像上的地平面;(2)确定最短路线在AR坐标系中的三维坐标,并基于确定的三维坐标生成用于指示路线行径方向的三维标识;(3)将三维标识绘制在当前预览图像上的地平面。诸如,三维标识可以为虚拟箭头、虚线路径标识等。用户可以按照三维标识指示的方向进行行走,可最终以最短路径到达目的地。In order to clearly provide the user with the navigation direction, this embodiment can refer to the following steps when performing the above step S208: (1) Detect the ground plane on the current preview image; (2) Determine the shortest route in the AR coordinate system Three-dimensional coordinates, and based on the determined three-dimensional coordinates, a three-dimensional mark is generated for indicating the direction of the route; (3) the three-dimensional mark is drawn on the ground plane of the current preview image. For example, the three-dimensional mark may be a virtual arrow, a dotted path mark, and the like. The user can walk in the direction indicated by the three-dimensional logo, and finally reach the destination with the shortest path.
考虑到在AR导航过程中,由于初始视觉定位位姿的误差和AR系统自身的漂移等原因,导航路线可能会逐渐变得不准确,例如表现为导航路线开始与建筑物相交。因此为了保证全程的准确导航,本实施例中的移动设备在导航过程中如果接收到服务器下发的当前相机位姿,基于当前相机位姿对AR坐标系进行修正,以使修正后的AR坐标系与世界坐标系保持对齐。也即,服务器会定时获取移动设备在导航过程中采集的当前预览图像,并确定移动设备采集当前预览图像时的当前相机位姿,并将当前相机位姿下发给移动设备,然后移动设备可以基于当前相机位姿对AR坐标系进行修正,从而提升导航准确性。Considering that in the process of AR navigation, the navigation route may gradually become inaccurate due to the error of the initial visual positioning pose and the drift of the AR system itself, for example, the navigation route begins to intersect the building. Therefore, in order to ensure accurate navigation throughout the entire process, if the mobile device in this embodiment receives the current camera pose issued by the server during the navigation process, it will correct the AR coordinate system based on the current camera pose to make the corrected AR coordinate The system remains aligned with the world coordinate system. That is, the server will periodically obtain the current preview image collected by the mobile device during the navigation process, and determine the current camera pose when the mobile device collects the current preview image, and send the current camera pose to the mobile device, and then the mobile device can Correct the AR coordinate system based on the current camera pose to improve navigation accuracy.
本实施例还可以站在服务器侧提供了一种室内视觉导航方法,该方法可以由诸如云服务器执行,参见图3所示的室内视觉导航方法流程图,该方法主要包括如下步骤S302~步骤S304:This embodiment can also provide an indoor visual navigation method on the server side. The method can be executed by, for example, a cloud server. Refer to the flowchart of the indoor visual navigation method shown in FIG. 3, the method mainly includes the following steps S302 to S304 :
步骤S302,如果接收到移动设备上传的待定位的室内图像,确定移动设备采集室内图像时的相机位姿。其中,相机位姿可以包括XY坐标和相机方向朝向。Step S302: If the indoor image to be located uploaded by the mobile device is received, the camera pose when the mobile device collects the indoor image is determined. Among them, the camera pose may include XY coordinates and camera orientation.
步骤S304,将室内图像对应的相机位姿下发给移动设备,以使移动设备基于室内图像 对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在移动设备采集的当前预览图像上;其中,最短路线是移动设备基于用户设定的目的地信息在室内拓扑地图中规划得到的。In step S304, the camera pose corresponding to the indoor image is sent to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and will use the AR coordinate system and the shortest route based on the AR coordinate system and the shortest route. The three-dimensional mark indicating the direction of the route is superimposed on the current preview image collected by the mobile device; among them, the shortest route is planned by the mobile device in the indoor topology map based on the destination information set by the user.
通过本实施例提供的上述室内视觉导航方法,可以由服务器承担导航过程中所需的诸如相机位姿计算等计算量较大的步骤,然后通过移动设备快速便捷地在室内以AR方式指导用户按照最短路线前往目的地,较好地提高了用户体验。Through the above-mentioned indoor visual navigation method provided by this embodiment, the server can undertake the large calculation steps required in the navigation process, such as camera pose calculation, and then use the mobile device to quickly and conveniently instruct the user to follow the instructions in the room. The shortest route to the destination improves the user experience.
在本实施例中,可以将较为耗费时间和空间的计算过程都由服务器侧执行,诸如由服务器确定移动设备采集室内图像时的相机位姿,在一种可能的实现方式中,服务器可以将室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到移动设备采集室内图像时的相机位姿;其中,视觉地图通过室内场景的稀疏点云模型表征,可以理解为场景的大量视觉特征。在一种可能的实现方式中,可以根据3D-2D匹配关系来求解PNP(Perspective-n-Point)问题,也即求解3D到2D点对的运动方法,以便根据特征点对运动求解相机位姿。In this embodiment, the more time-consuming and space-consuming calculation process can be executed by the server side, such as the server determining the camera pose when the mobile device collects indoor images. In a possible implementation manner, the server can set the indoor The image is feature-matched with the visual map in the pre-established visual map library to obtain the camera pose when the mobile device collects indoor images; among them, the visual map is represented by the sparse point cloud model of the indoor scene, which can be understood as a large number of visual features of the scene . In a possible implementation, the PNP (Perspective-n-Point) problem can be solved according to the 3D-2D matching relationship, that is, the 3D to 2D point pair movement method is solved, so as to solve the camera pose according to the feature point pair movement .
本实施例给出一种将室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到移动设备采集室内图像时的相机位姿的一种可能的实现方式,主要包括粗定位和细定位两个环节,分别介绍如下:This embodiment provides a possible implementation of matching indoor images with visual maps in a pre-established visual map library to obtain the camera pose when the mobile device collects indoor images, mainly including coarse positioning and fine positioning. Two links of positioning are introduced as follows:
在粗定位环节中,利用深度哈希算法计算室内图像的全局图像描述符,在视觉地图库中查找与其最相似的k个关键帧图像,得到关键帧ID,然后根据关键帧ID查找已存储的关键帧信息,包括关键帧的位姿、局部特征点、局部描述子以及对应的地图点在世界坐标系中的坐标。根据关键帧的位姿对这k个关键帧聚类,将位置相似的关键帧聚为一类。对于每个聚类,其聚类中心为后续的细定位提供了初步的粗定位结果。In the rough positioning link, the deep hash algorithm is used to calculate the global image descriptor of the indoor image, the k most similar key frame images are searched in the visual map library, and the key frame ID is obtained, and then the stored key frame ID is searched according to the key frame ID. Key frame information, including the pose of the key frame, local feature points, local descriptors and the coordinates of the corresponding map points in the world coordinate system. The k key frames are clustered according to the position and pose of the key frames, and the key frames with similar positions are grouped into one category. For each cluster, its cluster center provides preliminary coarse positioning results for subsequent fine positioning.
在细定位环节中,首先需要遍历每个聚类,对室内图像提取局部特征点,计算局部描述子并与聚类中的所有关键帧的局部特征匹配,然后取出匹配成功的特征点对应的3D地图点,若匹配成功的3D-2D点对数大于预设个数(诸如大于5),则可以求解PNP问题,得到室内图像对应的相机位姿。可以将PNP求得的位姿作为初始值,进一步构建Bundle Adjustment图优化问题,从而可以优化室内图像对应的位姿,使得重投影误差最小。在位姿优化后剔除重投影误差仍然较大的边,用其余边再次构建Bundle Adjustment图优化问题,最后得到较精确的室内图像对应的位姿。如果该过程中出现3D-2D点对数过少,或优化后重投影误差过大的现象,则认为当前聚类中的关键帧为错误匹配,放弃该聚类。如果优化后重投影误差较小,则认为位姿求解是正确的,直接输出结果,不再进入下个聚类的循环。In the fine positioning process, it is first necessary to traverse each cluster, extract local feature points from indoor images, calculate local descriptors and match them with the local features of all key frames in the cluster, and then extract the 3D corresponding to the successfully matched feature points. For map points, if the number of 3D-2D point pairs successfully matched is greater than the preset number (such as greater than 5), the PNP problem can be solved to obtain the camera pose corresponding to the indoor image. The pose obtained by the PNP can be used as the initial value to further construct the Bundle Adjustment graph optimization problem, so that the pose corresponding to the indoor image can be optimized to minimize the reprojection error. After the pose optimization, the edges whose reprojection error is still large are removed, and the Bundle Adjustment graph optimization problem is constructed again with the remaining edges, and finally a more accurate pose corresponding to the indoor image is obtained. If the number of 3D-2D point pairs is too small or the reprojection error is too large after optimization, the key frame in the current cluster is considered to be a mismatch, and the cluster is abandoned. If the reprojection error is small after optimization, it is considered that the pose solution is correct, and the result is output directly, without entering the next clustering cycle.
服务器在经过上述由粗到细的两步定位之后,可以得到较为精确的室内图像对应的相机位姿。当然,以上仅为本实施例提供的一种位姿确定方法,还可以采用其它任何确定相 机位姿的方式实现,在此不进行限制。The server can obtain a more accurate camera pose corresponding to the indoor image after the above-mentioned two-step positioning from coarse to fine. Of course, the above is only one method for determining the pose provided by this embodiment, and any other method for determining the pose of the camera can also be used, and it is not limited here.
在一种可能的实现方式中,服务器还会将视觉地图与预先导入的室内平面分布图对齐,以便在用于室内导航的移动设备将规划的路径顺利转换至世界坐标系。其中,室内平面分布图又可以理解为建筑结构图,可以由诸如商场预先将室内平面分布图上传给服务器。In a possible implementation, the server will also align the visual map with the pre-imported indoor plan distribution map, so that the mobile device used for indoor navigation can smoothly convert the planned path to the world coordinate system. Among them, the indoor floor plan can be understood as a building structure drawing, and the indoor floor plan can be uploaded to the server in advance by, for example, a shopping mall.
服务器可以预先构建视觉地图库,视觉地图库的建立过程包括如下步骤:(1)获取移动设备在室内场景采集得到的多张场景图像。在一种可能的实现方式中,可以预先采集室内各个场景的大量图像,以便构建较为精确的稀疏点云模型。(2)基于SFM(Structure From Motion)算法对多张场景图像进行三维重建,得到包含多张场景图像对应的稀疏点云模型的视觉地图库。SFM建图过程可以采用诸如COLMAP、Theia、VisualSfM、OpenMVG等开源算法实现,在此不进行限制。The server may construct a visual map library in advance, and the process of establishing the visual map library includes the following steps: (1) Obtain multiple scene images collected by the mobile device in the indoor scene. In a possible implementation manner, a large number of images of various indoor scenes can be collected in advance, so as to construct a more accurate sparse point cloud model. (2) Based on the SFM (Structure From Motion) algorithm, three-dimensional reconstruction of multiple scene images is performed, and a visual map library containing sparse point cloud models corresponding to multiple scene images is obtained. The SFM mapping process can be implemented using open source algorithms such as COLMAP, Theia, VisualSfM, OpenMVG, etc., and there is no restriction here.
在一种可能的实现方式中,为了节约服务器的磁盘存储空间,还可以将视觉地图库中的视觉地图进行压缩,诸如,可采用乘积量化等方法对视觉地图中的原始特征进行编码,视觉地图中只保存编码后的结果,不保存原始特征,从而大幅压缩地图大小。在服务器利用视觉数据库进行视觉定位时,可以对编码后的视觉特征进行解码,用解码后的特征进行匹配和位姿估计。In a possible implementation, in order to save the disk storage space of the server, the visual map in the visual map library can also be compressed. For example, the original features in the visual map can be encoded by the method of product quantization, and the visual map Only the encoded result is saved in, and the original features are not saved, thus greatly compressing the map size. When the server uses the visual database for visual positioning, the encoded visual features can be decoded, and the decoded features can be used for matching and pose estimation.
通过上述方式,服务器可以承担较大计算量的视觉地图库构建以及相机位姿确定环节,从而降低了移动设备的硬件要求,也使得移动设备能够更为快速地基于服务器的计算结果为用户提供导航服务。Through the above method, the server can undertake the construction of a large amount of calculation of the visual map library and the determination of the camera pose, thereby reducing the hardware requirements of the mobile device, and also enabling the mobile device to provide users with navigation based on the calculation results of the server more quickly service.
在一种可能的实现方式中,考虑到AR导航过程中,由于初始视觉定位位姿的误差和AR系统自身的漂移等原因,导航路线可能会逐渐变得不准确,服务器还可以定时获取移动设备在导航过程中采集的当前预览图像,并确定移动设备采集当前预览图像时的当前相机位姿;将当前相机位姿下发给移动设备,以使移动设备基于当前相机位姿对AR坐标系进行修正。In a possible implementation, considering that during the AR navigation process, the navigation route may gradually become inaccurate due to the error of the initial visual positioning pose and the drift of the AR system itself, and the server can also periodically obtain the mobile device The current preview image collected during the navigation process, and determine the current camera pose when the mobile device collects the current preview image; send the current camera pose to the mobile device, so that the mobile device can perform the AR coordinate system based on the current camera pose Fix.
本公开实施例还提供了一种以站在移动设备侧和站在服务器侧为基础的室内视觉导航方法的可能的实现方式,具体可参照如图4所示的一种室内视觉导航方法流程图,包括如下步骤:The embodiments of the present disclosure also provide a possible implementation of an indoor visual navigation method based on standing on the mobile device side and standing on the server side. For details, please refer to the flowchart of an indoor visual navigation method as shown in FIG. 4 , Including the following steps:
步骤S410:采集多张室内图像;Step S410: Collect multiple indoor images;
步骤S412:基于室内图像进行SFM建图,生成视觉地图数据库。Step S412: Perform SFM mapping based on the indoor image to generate a visual map database.
步骤S414:将视觉地图数据库中的视觉地图与室内平面分布图对齐。其中,室内平面分布图又可称为建筑结构图。Step S414: align the visual map in the visual map database with the indoor plan distribution map. Among them, the indoor plane distribution map can also be called the architectural structure map.
步骤S420:接收到手机上传的待定位的室内图像;Step S420: receiving the indoor image to be located uploaded by the mobile phone;
步骤S422:提取待定位的室内图像的图像特征,将提取的图像特征与视觉地图进行匹配,估计手机采集室内图像时的相机位姿;Step S422: Extract the image features of the indoor image to be located, match the extracted image features with the visual map, and estimate the camera pose when the mobile phone collects the indoor image;
步骤S424:将估计得到的相机位姿返回至手机;Step S424: Return the estimated camera pose to the mobile phone;
步骤S430:建立AR坐标系;Step S430: Establish an AR coordinate system;
步骤S432:将AR坐标系与手机采集室内图像时的相机位姿对齐;Step S432: align the AR coordinate system with the camera pose when the mobile phone collects indoor images;
步骤S434:接收用户设定的目的地信息,在室内拓扑地图中规划最短路线;Step S434: Receive destination information set by the user, and plan the shortest route in the indoor topology map;
步骤S436:检测地平面;Step S436: Detect the ground plane;
步骤S438:将最短路线转换为AR坐标系中的三维坐标,并将路线用箭头绘制在地平面上。Step S438: Convert the shortest route into three-dimensional coordinates in the AR coordinate system, and draw the route on the ground plane with arrows.
上述步骤的具体实施操作可以参见站在移动设备侧的室内视觉导航方法和站在服务器侧的室内视觉导航方法的内容,在此不再赘述。The specific implementation operations of the above steps can refer to the content of the indoor visual navigation method standing on the mobile device side and the indoor visual navigation method standing on the server side, which will not be repeated here.
其中,步骤S410~步骤S414可以统称为视觉地图构建操作,步骤S420~步骤S424可以统称为云端视觉定位操作,视觉地图构建操作和云端视觉定位操作均可由服务器执行,步骤S430~步骤S438可以统称为移动端AR导航操作,可以由手机等移动设备执行。Among them, steps S410 to S414 can be collectively referred to as a visual map construction operation, and steps S420 to S424 can be collectively referred to as a cloud visual positioning operation. Both the visual map construction operation and the cloud visual positioning operation can be performed by the server. Steps S430 to S438 can be collectively referred to as AR navigation operations on the mobile terminal can be performed by mobile devices such as mobile phones.
本实施例提供的上述室内视觉导航方法,允许用户随时随地使用手机通过拍摄周围环境的方式确定自身的位置,选择目的地后可以透过手机屏幕看到手机通过路径选择算法规划的最佳路线,沿着该路线行走即可到达目的地。而且上述室内视觉导航方法将比较耗费时间和空间的计算过程放在服务器端进行,使得用户可在移动设备上实现实时的定位和导航。The aforementioned indoor visual navigation method provided by this embodiment allows users to use their mobile phone to determine their location by photographing the surrounding environment anytime and anywhere. After selecting a destination, they can see the best route planned by the mobile phone through the path selection algorithm through the mobile phone screen. Walk along the route to reach the destination. Moreover, the aforementioned indoor visual navigation method puts a relatively time-consuming and space-consuming calculation process on the server side, so that users can realize real-time positioning and navigation on mobile devices.
本公开实施例还提供了一种设置在移动设备侧的室内视觉导航装置,参见图5所示的室内视觉导航装置的结构框图,包括如下模块:The embodiment of the present disclosure also provides an indoor visual navigation device arranged on the side of the mobile device. Refer to the structural block diagram of the indoor visual navigation device shown in FIG. 5, which includes the following modules:
图像上传模块502,配置成如果采集到待定位的室内图像,将室内图像上传至服务器,以使服务器确定移动设备采集室内图像时的相机位姿;The image upload module 502 is configured to upload the indoor image to the server if the indoor image to be located is collected, so that the server can determine the camera pose when the mobile device collects the indoor image;
坐标系建立模块504,配置成接收服务器返回的室内图像对应的相机位姿,并基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系;The coordinate system establishment module 504 is configured to receive the camera pose corresponding to the indoor image returned by the server, and establish an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image;
路线规划模块506,配置成基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线;The route planning module 506 is configured to plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user;
导航展示模块508,配置成将移动设备采集的当前预览图像展示在移动设备的界面上,并基于AR坐标系和最短路线,将用于指示路线行径方向的三维标识叠加在当前预览图像上。The navigation display module 508 is configured to display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image.
通过本实施例提供的上述室内视觉导航装置,移动设备可以在室内以AR方式指导用 户按照最短路线前往目的地,较好地提高了用户体验。Through the aforementioned indoor visual navigation device provided in this embodiment, the mobile device can guide the user to the destination according to the shortest route in the AR mode indoors, which better improves the user experience.
在一种可能的实现方式中,坐标系建立模块504配置成建立初始的AR坐标系;基于室内图像对应的相机位姿调整初始的AR坐标系,以使AR坐标系与世界坐标系对齐。In a possible implementation, the coordinate system establishing module 504 is configured to establish an initial AR coordinate system; adjust the initial AR coordinate system based on the camera pose corresponding to the indoor image, so that the AR coordinate system is aligned with the world coordinate system.
在一种可能的实现方式中,路线规划模块506配置成基于目的地信息,利用路径规划算法在预先导入的室内拓扑地图中规划最短路线。In a possible implementation manner, the route planning module 506 is configured to use a route planning algorithm to plan the shortest route in the pre-imported indoor topology map based on the destination information.
在一种可能的实现方式中,导航展示模块508配置成检测当前预览图像上的地平面;确定最短路线在AR坐标系中的三维坐标,并基于确定的三维坐标生成配置成指示路线行径方向的三维标识;将三维标识绘制在当前预览图像上的地平面。In a possible implementation manner, the navigation display module 508 is configured to detect the ground plane on the current preview image; determine the three-dimensional coordinates of the shortest route in the AR coordinate system, and generate the three-dimensional coordinates configured to indicate the direction of the route based on the determined three-dimensional coordinates. Three-dimensional logo; draw the three-dimensional logo on the ground plane of the current preview image.
在一种可能的实现方式中,上述装置还包括坐标系修正模块,配置成在导航过程中如果接收到服务器下发的当前相机位姿,基于当前相机位姿对AR坐标系进行修正,以使修正后的AR坐标系与世界坐标系保持对齐。In a possible implementation, the above-mentioned device further includes a coordinate system correction module configured to correct the AR coordinate system based on the current camera pose if the current camera pose issued by the server is received during the navigation process, so that The revised AR coordinate system remains aligned with the world coordinate system.
本实施例所提供的装置,其实现原理及产生的技术效果和前述实施例相同,为简要描述,装置实施例部分未提及之处,可参考前述方法实施例中相应内容。The implementation principles and technical effects of the device provided in this embodiment are the same as those in the foregoing embodiment. For a brief description, for parts not mentioned in the device embodiment, please refer to the corresponding content in the foregoing method embodiment.
本公开实施例还提供了一种设置在服务器侧的室内视觉导航装置,参见图6所示的室内视觉导航装置的结构框图,包括如下模块:The embodiment of the present disclosure also provides an indoor visual navigation device arranged on the server side. Refer to the structural block diagram of the indoor visual navigation device shown in FIG. 6, which includes the following modules:
位姿确定模块602,配置成如果接收到移动设备上传的待定位的室内图像,确定移动设备采集室内图像时的相机位姿;The pose determination module 602 is configured to determine the camera pose when the mobile device collects the indoor image if an indoor image to be positioned uploaded by the mobile device is received;
设备导航模块604,配置成将室内图像对应的相机位姿下发给移动设备,以使移动设备基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在移动设备采集的当前预览图像上;其中,最短路线是移动设备根据用户设定的目的地信息在室内拓扑地图中规划得到的。The device navigation module 604 is configured to send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and based on the AR coordinate system and The shortest route superimposes a three-dimensional mark used to indicate the direction of the route on the current preview image collected by the mobile device; among them, the shortest route is planned by the mobile device in the indoor topology map according to the destination information set by the user.
通过本实施例提供的上述室内视觉导航装置,可以由服务器承担导航过程中所需的诸如相机位姿计算等计算量较大的步骤,然后通过移动设备快速便捷地在室内以AR方式指导用户按照最短路线前往目的地,较好地提高了用户体验。Through the above-mentioned indoor visual navigation device provided by this embodiment, the server can undertake the large calculation steps required in the navigation process, such as the calculation of camera pose, and then use the mobile device to quickly and conveniently instruct the user to follow the instructions in the room. The shortest route to the destination improves the user experience.
在一种实施方式中,位姿确定模块602配置成将室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到移动设备采集室内图像时的相机位姿;其中,视觉地图通过室内场景的稀疏点云模型表征。In one embodiment, the pose determination module 602 is configured to perform feature matching of the indoor image with the visual map in the pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image; wherein, the visual map passes the indoor image. Sparse point cloud model representation of the scene.
在一种实施方式中,上述装置还包括地图建立模块,配置成获取移动设备在室内场景采集得到的多张场景图像;基于SFM算法对多张场景图像进行三维重建,得到包含多张场景图像对应的稀疏点云模型的视觉地图库。In one embodiment, the above-mentioned device further includes a map building module configured to obtain multiple scene images collected by the mobile device in an indoor scene; and perform three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain a corresponding map containing multiple scene images. A visual map library of the sparse point cloud model.
在一种实施方式中,上述装置还包括:对齐模块,配置成将视觉地图与预先导入的室内平面分布图对齐。In one embodiment, the above-mentioned device further includes: an alignment module configured to align the visual map with a pre-imported indoor plan distribution map.
在一种实施方式中,上述装置还包括:当前位姿确定模块,配置成定时获取移动设备在导航过程中采集的当前预览图像,并确定移动设备采集当前预览图像时的当前相机位姿;修正模块,配置成将当前相机位姿下发给移动设备,以使移动设备基于当前相机位姿对AR坐标系进行修正。In one embodiment, the above-mentioned device further includes: a current pose determination module configured to periodically obtain the current preview image collected by the mobile device during the navigation process, and determine the current camera pose when the mobile device collects the current preview image; The module is configured to send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
本公开实施例所提供的装置,其实现原理及产生的技术效果和前述实施例相同,为简要描述,装置实施例部分未提及之处,可参考站在移动设备侧的室内视觉导航方法实施例中相应内容。The implementation principles and technical effects of the device provided by the embodiments of the present disclosure are the same as those of the previous embodiments. For a brief description, the parts not mentioned in the device embodiments can be implemented with reference to the indoor visual navigation method on the side of the mobile device. The corresponding content in the example.
本公开实施例还提供了一种室内视觉导航系统,该系统包括移动设备和服务器。The embodiment of the present disclosure also provides an indoor visual navigation system, which includes a mobile device and a server.
本公开实施例还提供了一种电子设备,包括:处理器和存储装置;存储装置上存储有计算机程序,计算机程序在被处理器运行时执行如站在移动设备侧的室内视觉导航方法或站在服务器侧的室内视觉导航方法。The embodiments of the present disclosure also provide an electronic device, including: a processor and a storage device; the storage device stores a computer program, and the computer program executes an indoor visual navigation method or a station on the side of a mobile device when the computer program is run by the processor. Indoor visual navigation method on the server side.
本公开实施例还提供了一种计算机可读存储介质,计算机可读存储介质上存储有计算机程序,计算机程序被处理器运行时执行站在移动设备侧的室内视觉导航方法或站在服务器侧的室内视觉导航方法。The embodiment of the present disclosure also provides a computer-readable storage medium, and a computer program is stored on the computer-readable storage medium. Indoor visual navigation method.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统具体工作过程,可以参考前述实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of the description, the specific working process of the system described above can refer to the corresponding process in the foregoing embodiment, which will not be repeated here.
本公开实施例所提供的室内视觉导航方法、装置、系统及电子设备的计算机程序产品,包括存储了程序代码的计算机可读存储介质,所述程序代码包括的指令可用于执行前面方法实施例中所述的方法,具体实现可参见本公开实施例前述内容,在此不再赘述。The computer program product of the indoor visual navigation method, device, system, and electronic equipment provided by the embodiments of the present disclosure includes a computer-readable storage medium storing program code, and the instructions included in the program code can be used to execute the previous method embodiments For the specific implementation of the described method, please refer to the foregoing content of the embodiments of the present disclosure, and details are not described herein again.
另外,在本公开实施例的描述中,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,可以是两个元件内部的连通。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本公开中的具体含义。In addition, in the description of the embodiments of the present disclosure, unless otherwise clearly specified and limited, the terms "installed", "connected", and "connected" should be interpreted broadly, for example, they may be fixed connections or detachable connections. , Or integrally connected; it can be a mechanical connection or an electrical connection; it can be directly connected or indirectly connected through an intermediate medium, and it can be the internal communication between two components. For those of ordinary skill in the art, the specific meanings of the above-mentioned terms in the present disclosure can be understood in specific situations.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本公开的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本公开各个实现方式所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the technical solution of the present disclosure essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various implementations of the present disclosure. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks and other media that can store program codes. .
在本公开的描述中,需要说明的是,术语“中心”、“上”、“下”、“左”、“右”、“竖直”、“水平”、“内”、“外”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本公开和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本公开的限制。此外,术语“第一”、“第二”、“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性。In the description of the present disclosure, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc. The indicated orientation or positional relationship is based on the orientation or positional relationship shown in the drawings, and is only for the convenience of describing the present disclosure and simplifying the description, and does not indicate or imply that the pointed device or element must have a specific orientation or a specific orientation. The structure and operation cannot therefore be construed as a limitation of the present disclosure. In addition, the terms "first", "second", and "third" are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance.
最后应说明的是:以上所述实施例,仅为本公开的可选地实施方式,用以说明本公开的技术方案,而非对其限制,本公开的保护范围并不局限于此,尽管参照前述实施例对本公开进行了详细的说明,本领域的普通技术人员应当理解:任何熟悉本技术领域的技术人员在本公开揭露的技术范围内,其依然可以对前述实施例所记载的技术方案进行修改或可轻易想到变化,或者对其中部分技术特征进行等同替换;而这些修改、变化或者替换,并不使相应技术方案的本质脱离本公开实施例技术方案的精神和范围,都应涵盖在本公开的保护范围之内。因此,本公开的保护范围应以所述权利要求的保护范围为准。Finally, it should be noted that the above-mentioned embodiments are only optional implementations of the present disclosure, which are used to illustrate the technical solutions of the present disclosure, but not to limit it. The protection scope of the present disclosure is not limited thereto, although The present disclosure has been described in detail with reference to the foregoing embodiments, and those of ordinary skill in the art should understand that any person skilled in the art within the technical scope disclosed in the present disclosure can still apply the technical solutions described in the foregoing embodiments. Modifications or changes can be easily conceived, or equivalent replacements of some of the technical features; and these modifications, changes or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present disclosure, and should be covered in Within the protection scope of this disclosure. Therefore, the protection scope of the present disclosure should be subject to the protection scope of the claims.
工业实用性Industrial applicability
本公开实施例提供了一种室内视觉导航方法、装置、系统及电子设备,通过移动设备将采集到的待定位的室内图像上传至服务器,通过服务器确定移动设备采集室内图像时的相机位姿,并将室内图像对应的相机位姿下发给移动设备,然后移动设备基于室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线,最后将移动设备采集的当前预览图像展示在界面上,并基于AR坐标系和最短路线将配置成指示路线行径方向的三维标识叠加在当前预览图像上,从而实现室内视觉导航。这样能够在室内以AR方式指导用户按照最短路线前往目的地,较好地提高了用户体验。The embodiments of the present disclosure provide an indoor visual navigation method, device, system, and electronic equipment. The collected indoor image to be positioned is uploaded to a server through the mobile device, and the server determines the camera pose when the mobile device collects the indoor image. And send the camera pose corresponding to the indoor image to the mobile device, and then the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and imports it in advance based on the destination information set by the user. The shortest route is planned in the indoor topology map, and finally the current preview image collected by the mobile device is displayed on the interface, and based on the AR coordinate system and the shortest route, a three-dimensional mark configured to indicate the direction of the route is superimposed on the current preview image to achieve indoor Visual navigation. In this way, the user can be guided to the destination according to the shortest route in the AR mode indoors, and the user experience is better improved.

Claims (18)

  1. 一种室内视觉导航方法,其中,所述方法由移动设备执行,所述方法包括:An indoor visual navigation method, wherein the method is executed by a mobile device, and the method includes:
    如果采集到待定位的室内图像,将所述室内图像上传至服务器,以使所述服务器确定所述移动设备采集所述室内图像时的相机位姿;If an indoor image to be located is collected, upload the indoor image to a server, so that the server can determine the camera pose when the mobile device collects the indoor image;
    接收所述服务器返回的所述室内图像对应的相机位姿,并基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系;Receiving the camera pose corresponding to the indoor image returned by the server, and establishing an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image;
    基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线;Plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user;
    将所述移动设备采集的当前预览图像展示在所述移动设备的界面上,并基于所述AR坐标系和所述最短路线,将用于指示路线行径方向的三维标识叠加在所述当前预览图像上。Display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image superior.
  2. 根据权利要求1所述的方法,其中,所述基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系的步骤,包括:The method according to claim 1, wherein the step of establishing an AR coordinate system aligned with a world coordinate system based on the camera pose corresponding to the indoor image comprises:
    建立初始的AR坐标系;Establish the initial AR coordinate system;
    基于所述室内图像对应的相机位姿调整所述初始的AR坐标系,以使所述AR坐标系与世界坐标系对齐。The initial AR coordinate system is adjusted based on the camera pose corresponding to the indoor image, so that the AR coordinate system is aligned with the world coordinate system.
  3. 根据权利要求1或2所述的方法,其中,所述基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线的步骤,包括:The method according to claim 1 or 2, wherein the step of planning the shortest route in a pre-imported indoor topology map based on the destination information set by the user comprises:
    基于用户设定的目的地信息,利用路径规划算法在预先导入的室内拓扑地图中规划最短路线。Based on the destination information set by the user, the route planning algorithm is used to plan the shortest route in the pre-imported indoor topology map.
  4. 根据权利要求1-3任一项所述的方法,其中,所述将用于指示路线行径方向的三维标识叠加在所述当前预览图像上的步骤,包括:The method according to any one of claims 1 to 3, wherein the step of superimposing a three-dimensional mark for indicating the direction of the route on the current preview image comprises:
    检测所述当前预览图像上的地平面;Detecting the ground plane on the current preview image;
    确定所述最短路线在所述AR坐标系中的三维坐标,并基于确定的所述三维坐标生成用于指示路线行径方向的三维标识;Determine the three-dimensional coordinates of the shortest route in the AR coordinate system, and generate a three-dimensional identifier for indicating the direction of the route based on the determined three-dimensional coordinates;
    将所述三维标识绘制在所述当前预览图像上的地平面。Draw the three-dimensional mark on the ground plane of the current preview image.
  5. 根据权利要求1-4任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-4, wherein the method further comprises:
    在导航过程中如果接收到所述服务器下发的当前相机位姿,基于所述当前相机位姿对所述AR坐标系进行修正,以使修正后的所述AR坐标系与所述世界坐标系保持对齐。During the navigation process, if the current camera pose issued by the server is received, the AR coordinate system is corrected based on the current camera pose, so that the corrected AR coordinate system is the same as the world coordinate system. Keep it aligned.
  6. 一种室内视觉导航方法,其中,所述方法由服务器执行,所述方法包括:An indoor visual navigation method, wherein the method is executed by a server, and the method includes:
    如果接收到移动设备上传的待定位的室内图像,确定所述移动设备采集所述室内 图像时的相机位姿;If an indoor image to be located uploaded by a mobile device is received, determine the camera pose when the mobile device collects the indoor image;
    将所述室内图像对应的相机位姿下发给所述移动设备,以使所述移动设备基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于所述AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在所述移动设备采集的当前预览图像上;其中,所述最短路线是所述移动设备基于用户设定的目的地信息在室内拓扑地图中规划得到的。Send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image, and based on the AR The coordinate system and the shortest route are superimposed on the current preview image collected by the mobile device with a three-dimensional mark indicating the direction of the route; wherein, the shortest route is the indoor topology of the mobile device based on the destination information set by the user. Planned in the map.
  7. 根据权利要求6所述的方法,其中,所述确定所述移动设备采集所述室内图像时的相机位姿的步骤,包括:The method according to claim 6, wherein the step of determining the camera pose when the mobile device collects the indoor image comprises:
    将所述室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到所述移动设备采集所述室内图像时的相机位姿;其中,所述视觉地图通过室内场景的稀疏点云模型表征。The indoor image is feature-matched with the visual map in the pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image; wherein the visual map is based on the sparse point cloud model of the indoor scene Characterization.
  8. 根据权利要求7所述的方法,其中,所述将所述室内图像与预先建立的视觉地图库中的视觉地图进行特征匹配,得到所述移动设备采集所述室内图像时的相机位姿的步骤,包括:The method according to claim 7, wherein the step of matching the features of the indoor image with a visual map in a pre-established visual map library to obtain the camera pose when the mobile device collects the indoor image ,include:
    利用深度哈希算法计算所述室内图像的全局图像描述符;Calculating the global image descriptor of the indoor image by using a deep hash algorithm;
    在所述视觉地图库中查找与所述全局图像描述符相似的多个关键帧图像;Searching for multiple key frame images similar to the global image descriptor in the visual map library;
    获取每个所述关键帧图像的关键帧信息;Acquiring key frame information of each of the key frame images;
    依据所述关键帧信息将所述多个关键帧图像分为多个聚类;Dividing the plurality of key frame images into a plurality of clusters according to the key frame information;
    遍历每个所述聚类,获得所述室内图像的局部特征点;Traverse each of the clusters to obtain local feature points of the indoor image;
    计算局部描述子,并与所述聚类中的局部特征点匹配;Calculate local descriptors and match them with local feature points in the cluster;
    得到匹配成功的局部特征点对应的3D地图点;Obtain the 3D map points corresponding to the successfully matched local feature points;
    若所述3D地图点个数大于预设个数,则得到所述室内图像对应的相机位姿。If the number of 3D map points is greater than the preset number, the camera pose corresponding to the indoor image is obtained.
  9. 根据权利要求7或8所述的方法,其中,所述视觉地图库的建立过程包括:The method according to claim 7 or 8, wherein the process of establishing the visual map library comprises:
    获取移动设备在室内场景采集得到的多张场景图像;Acquire multiple scene images collected by the mobile device in the indoor scene;
    基于SFM算法对所述多张场景图像进行三维重建,得到包含所述多张场景图像对应的稀疏点云模型的视觉地图库。Performing three-dimensional reconstruction on the multiple scene images based on the SFM algorithm to obtain a visual map library containing sparse point cloud models corresponding to the multiple scene images.
  10. 根据权利要求7-9任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 7-9, wherein the method further comprises:
    将所述视觉地图与预先导入的室内平面分布图对齐。Align the visual map with the pre-imported indoor plan distribution map.
  11. 根据权利要求7-10任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 7-10, wherein the method further comprises:
    将所述视觉地图库中的视觉地图进行压缩。Compress the visual map in the visual map library.
  12. 根据权利要求11所述的方法,其中,所述将所述视觉地图库中的视觉地图进行压缩的步骤,包括:The method according to claim 11, wherein the step of compressing the visual map in the visual map library comprises:
    对所述视觉地图中的原始特征进行编码;Encoding the original features in the visual map;
    保存编码后的原始特征,并清除编码前的原始特征。Save the original features after encoding, and clear the original features before encoding.
  13. 根据权利要求6-12任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 6-12, wherein the method further comprises:
    定时获取所述移动设备在导航过程中采集的当前预览图像,并确定所述移动设备采集所述当前预览图像时的当前相机位姿;Periodically acquiring the current preview image collected by the mobile device during the navigation process, and determining the current camera pose when the mobile device collects the current preview image;
    将所述当前相机位姿下发给所述移动设备,以使所述移动设备基于所述当前相机位姿对所述AR坐标系进行修正。Send the current camera pose to the mobile device, so that the mobile device corrects the AR coordinate system based on the current camera pose.
  14. 一种室内视觉导航装置,其中,所述装置设置在移动设备侧,所述装置包括:An indoor visual navigation device, wherein the device is arranged on the side of a mobile device, and the device includes:
    图像上传模块,配置成如果采集到待定位的室内图像,将所述室内图像上传至服务器,以使所述服务器确定所述移动设备采集所述室内图像时的相机位姿;An image upload module configured to upload the indoor image to a server if the indoor image to be located is collected, so that the server determines the camera pose when the mobile device collects the indoor image;
    坐标系建立模块,配置成接收所述服务器返回的所述室内图像对应的相机位姿,并基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系;A coordinate system establishment module configured to receive the camera pose corresponding to the indoor image returned by the server, and establish an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image;
    路线规划模块,配置成基于用户设定的目的地信息在预先导入的室内拓扑地图中规划最短路线;The route planning module is configured to plan the shortest route in the pre-imported indoor topology map based on the destination information set by the user;
    导航展示模块,配置成将所述移动设备采集的当前预览图像展示在所述移动设备的界面上,并基于所述AR坐标系和所述最短路线,将用于指示路线行径方向的三维标识叠加在所述当前预览图像上。The navigation display module is configured to display the current preview image collected by the mobile device on the interface of the mobile device, and based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route On the current preview image.
  15. 一种室内视觉导航装置,其中,所述装置设置在服务器侧,所述装置包括:An indoor visual navigation device, wherein the device is arranged on the server side, and the device includes:
    位姿确定模块,配置成如果接收到移动设备上传的待定位的室内图像,确定所述移动设备采集所述室内图像时的相机位姿;The pose determination module is configured to determine the camera pose when the mobile device collects the indoor image if an indoor image to be positioned uploaded by the mobile device is received;
    设备导航模块,配置成将所述室内图像对应的相机位姿下发给所述移动设备,以使所述移动设备基于所述室内图像对应的相机位姿建立与世界坐标系对齐的AR坐标系,并基于所述AR坐标系和最短路线将用于指示路线行径方向的三维标识叠加在所述移动设备采集的当前预览图像上;其中,所述最短路线是所述移动设备根据用户设定的目的地信息在室内拓扑地图中规划得到的。A device navigation module configured to send the camera pose corresponding to the indoor image to the mobile device, so that the mobile device establishes an AR coordinate system aligned with the world coordinate system based on the camera pose corresponding to the indoor image , And based on the AR coordinate system and the shortest route, superimpose a three-dimensional mark indicating the direction of the route on the current preview image collected by the mobile device; wherein, the shortest route is set by the mobile device according to the user The destination information is planned in the indoor topology map.
  16. 一种室内视觉导航系统,其中,所述系统包括通信连接的移动设备和服务器;其中,所述移动设备配置成执行如权利要求1至5任一项所述的方法,所述服务器配置成执行如权利要求6至13任一项所述的方法。An indoor visual navigation system, wherein the system includes a mobile device and a server that are communicatively connected; wherein the mobile device is configured to execute the method according to any one of claims 1 to 5, and the server is configured to execute The method according to any one of claims 6 to 13.
  17. 一种电子设备,其中,包括:处理器和存储装置;An electronic device, including: a processor and a storage device;
    所述存储装置上存储有计算机程序,所述计算机程序在被所述处理器运行时执行如权利要求1至5任一项所述的方法,或如权利要求6至13任一项所述的方法。A computer program is stored on the storage device, and the computer program, when run by the processor, executes the method according to any one of claims 1 to 5, or the method according to any one of claims 6 to 13 method.
  18. 一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序, 其中,所述计算机程序被处理器运行时执行上述权利要求1至5任一项所述的方法的步骤或上述权利要求6至13任一项所述的方法的步骤。A computer-readable storage medium having a computer program stored on the computer-readable storage medium, wherein the computer program executes the steps of the method according to any one of claims 1 to 5 or the above when the computer program is run by a processor The steps of the method of any one of claims 6-13.
PCT/CN2020/119479 2020-04-14 2020-09-30 Indoor visual navigation method, apparatus, and system, and electronic device WO2021208372A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2022566506A JP2023509099A (en) 2020-04-14 2020-09-30 INDOOR VISUAL NAVIGATION METHOD, APPARATUS, SYSTEM AND ELECTRONIC DEVICE

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010292954.XA CN111627114A (en) 2020-04-14 2020-04-14 Indoor visual navigation method, device and system and electronic equipment
CN202010292954.X 2020-04-14

Publications (1)

Publication Number Publication Date
WO2021208372A1 true WO2021208372A1 (en) 2021-10-21

Family

ID=72273170

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/119479 WO2021208372A1 (en) 2020-04-14 2020-09-30 Indoor visual navigation method, apparatus, and system, and electronic device

Country Status (3)

Country Link
JP (1) JP2023509099A (en)
CN (1) CN111627114A (en)
WO (1) WO2021208372A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114071392A (en) * 2021-12-28 2022-02-18 智小途(上海)数字科技有限公司 UWB indoor high-precision three-dimensional live-action data construction method and system
CN114111803A (en) * 2022-01-26 2022-03-01 中国人民解放军战略支援部队航天工程大学 Visual navigation method of indoor satellite platform
CN115290110A (en) * 2022-08-25 2022-11-04 广东车卫士信息科技有限公司 AR navigation method, system and computer readable storage medium
CN115454055A (en) * 2022-08-22 2022-12-09 中国电子科技南湖研究院 Multilayer fusion map representation method for indoor autonomous navigation and operation
CN116313020A (en) * 2023-05-22 2023-06-23 合肥工业大学 Intelligent processing method and system for medical service
CN116402826A (en) * 2023-06-09 2023-07-07 深圳市天趣星空科技有限公司 Visual coordinate system correction method, device, equipment and storage medium
CN116858215A (en) * 2023-09-05 2023-10-10 武汉大学 AR navigation map generation method and device
CN117152245A (en) * 2023-01-31 2023-12-01 荣耀终端有限公司 Pose calculation method and device

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111627114A (en) * 2020-04-14 2020-09-04 北京迈格威科技有限公司 Indoor visual navigation method, device and system and electronic equipment
CN112543859B (en) * 2020-10-28 2022-07-15 华为技术有限公司 Positioning method, positioning device, electronic equipment and storage medium
CN112577492B (en) * 2020-12-15 2023-06-13 国科易讯(北京)科技有限公司 Path planning method and system
CN112729327A (en) * 2020-12-24 2021-04-30 浙江商汤科技开发有限公司 Navigation method, navigation device, computer equipment and storage medium
CN113063424B (en) * 2021-03-29 2023-03-24 湖南国科微电子股份有限公司 Method, device, equipment and storage medium for intra-market navigation
CN113240816B (en) * 2021-03-29 2022-01-25 泰瑞数创科技(北京)有限公司 AR and semantic model based city accurate navigation method and device
CN113159433B (en) * 2021-04-28 2022-02-22 中国科学院沈阳应用生态研究所 Dynamic navigation path searching method for integrated indoor mixed three-dimensional road network
CN113587928B (en) * 2021-07-28 2022-12-16 北京百度网讯科技有限公司 Navigation method, navigation device, electronic equipment, storage medium and computer program product
CN113865593A (en) * 2021-09-14 2021-12-31 山东新一代信息产业技术研究院有限公司 Indoor navigation method, equipment and medium
CN117308939A (en) * 2022-06-20 2023-12-29 中兴通讯股份有限公司 AR navigation method, terminal and storage medium
CN115578539B (en) * 2022-12-07 2023-09-19 深圳大学 Indoor space high-precision visual position positioning method, terminal and storage medium
CN117128959A (en) * 2023-04-18 2023-11-28 荣耀终端有限公司 Car searching navigation method, electronic equipment, server and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106791784A (en) * 2016-12-26 2017-05-31 深圳增强现实技术有限公司 Augmented reality display methods and device that a kind of actual situation overlaps
CN107782314A (en) * 2017-10-24 2018-03-09 张志奇 A kind of augmented reality indoor positioning air navigation aid based on barcode scanning
US20180373352A1 (en) * 2017-06-23 2018-12-27 Pixart Imaging Inc. Navagation device with fast frame rate upshift and operating method thereof
CN109272454A (en) * 2018-07-27 2019-01-25 阿里巴巴集团控股有限公司 A kind of the coordinate system calibration method and device of augmented reality equipment
CN111627114A (en) * 2020-04-14 2020-09-04 北京迈格威科技有限公司 Indoor visual navigation method, device and system and electronic equipment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110288763A1 (en) * 2010-05-18 2011-11-24 Alpine Electronics, Inc. Method and apparatus for displaying three-dimensional route guidance
CN106679668B (en) * 2016-12-30 2018-08-03 百度在线网络技术(北京)有限公司 Air navigation aid and device
CN110019580B (en) * 2017-08-25 2022-07-12 腾讯科技(深圳)有限公司 Map display method, map display device, storage medium and terminal
CN110017841A (en) * 2019-05-13 2019-07-16 大有智能科技(嘉兴)有限公司 Vision positioning method and its air navigation aid
CN110260867B (en) * 2019-07-29 2021-06-15 浙江华睿科技有限公司 Method, equipment and device for determining and correcting neutral position in robot navigation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106791784A (en) * 2016-12-26 2017-05-31 深圳增强现实技术有限公司 Augmented reality display methods and device that a kind of actual situation overlaps
US20180373352A1 (en) * 2017-06-23 2018-12-27 Pixart Imaging Inc. Navagation device with fast frame rate upshift and operating method thereof
CN107782314A (en) * 2017-10-24 2018-03-09 张志奇 A kind of augmented reality indoor positioning air navigation aid based on barcode scanning
CN109272454A (en) * 2018-07-27 2019-01-25 阿里巴巴集团控股有限公司 A kind of the coordinate system calibration method and device of augmented reality equipment
CN111627114A (en) * 2020-04-14 2020-09-04 北京迈格威科技有限公司 Indoor visual navigation method, device and system and electronic equipment

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114071392A (en) * 2021-12-28 2022-02-18 智小途(上海)数字科技有限公司 UWB indoor high-precision three-dimensional live-action data construction method and system
CN114071392B (en) * 2021-12-28 2023-07-25 智小途(上海)数字科技有限公司 UWB indoor high-precision three-dimensional live-action data construction method and system
CN114111803A (en) * 2022-01-26 2022-03-01 中国人民解放军战略支援部队航天工程大学 Visual navigation method of indoor satellite platform
CN115454055A (en) * 2022-08-22 2022-12-09 中国电子科技南湖研究院 Multilayer fusion map representation method for indoor autonomous navigation and operation
CN115454055B (en) * 2022-08-22 2023-09-19 中国电子科技南湖研究院 Multi-layer fusion map representation method for indoor autonomous navigation and operation
CN115290110A (en) * 2022-08-25 2022-11-04 广东车卫士信息科技有限公司 AR navigation method, system and computer readable storage medium
CN117152245A (en) * 2023-01-31 2023-12-01 荣耀终端有限公司 Pose calculation method and device
CN116313020A (en) * 2023-05-22 2023-06-23 合肥工业大学 Intelligent processing method and system for medical service
CN116313020B (en) * 2023-05-22 2023-08-18 合肥工业大学 Intelligent processing method and system for medical service
CN116402826A (en) * 2023-06-09 2023-07-07 深圳市天趣星空科技有限公司 Visual coordinate system correction method, device, equipment and storage medium
CN116402826B (en) * 2023-06-09 2023-09-26 深圳市天趣星空科技有限公司 Visual coordinate system correction method, device, equipment and storage medium
CN116858215A (en) * 2023-09-05 2023-10-10 武汉大学 AR navigation map generation method and device
CN116858215B (en) * 2023-09-05 2023-12-05 武汉大学 AR navigation map generation method and device

Also Published As

Publication number Publication date
CN111627114A (en) 2020-09-04
JP2023509099A (en) 2023-03-06

Similar Documents

Publication Publication Date Title
WO2021208372A1 (en) Indoor visual navigation method, apparatus, and system, and electronic device
WO2020259248A1 (en) Depth information-based pose determination method and device, medium, and electronic apparatus
US11494973B2 (en) Generating floor maps for buildings from automated analysis of visual data of the buildings' interiors
US9646406B2 (en) Position searching method and apparatus based on electronic map
CN110769240B (en) Photographing-based 3D modeling system and method, and automatic 3D modeling device and method
KR102194063B1 (en) User location determination based on augmented reality
US9558559B2 (en) Method and apparatus for determining camera location information and/or camera pose information according to a global coordinate system
US9699375B2 (en) Method and apparatus for determining camera location information and/or camera pose information according to a global coordinate system
US11557083B2 (en) Photography-based 3D modeling system and method, and automatic 3D modeling apparatus and method
Tanskanen et al. Live metric 3D reconstruction on mobile phones
US9224243B2 (en) Image enhancement using a multi-dimensional model
US10438412B2 (en) Techniques to facilitate accurate real and virtual object positioning in displayed scenes
TW201715476A (en) Navigation system based on augmented reality technique analyzes direction of users' moving by analyzing optical flow through the planar images captured by the image unit
TW201229962A (en) Augmenting image data based on related 3D point cloud data
US9036044B1 (en) Adjusting camera parameters associated with a plurality of images
US20240029384A1 (en) 3-D Reconstruction Using Augmented Reality Frameworks
WO2017133147A1 (en) Live-action map generation method, pushing method and device for same
WO2019109828A1 (en) Ar service processing method, device, server, mobile terminal, and storage medium
KR102277503B1 (en) Terminal devicem, service server and method for indoor positioning based on object detection
US20220067974A1 (en) Cloud-Based Camera Calibration
TWM560099U (en) Indoor precise navigation system using augmented reality technology
US20230251094A1 (en) Navigation video generation and acquisition methods and apparatuses, server, device, and medium
CN104501797A (en) Navigation method based on augmented reality IP map
WO2019089763A1 (en) Aligned augmented reality views
WO2023088127A1 (en) Indoor navigation method, server, apparatus and terminal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20930976

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022566506

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20930976

Country of ref document: EP

Kind code of ref document: A1