Bioinspired planar intelligent nanophotonic sensor for wide-angle accurate motion perception and prediction

Zhongyi Yuan; Chengyao Hao; Yin Wang; Yue Wu; Muyang Li; Chengyi Feng; Zhenguo Hu; Zaichen Zhang; Ji Chen

doi:10.37188/lam.2026.064

In nature, certain insects possess specialised compound eye structures that provide an ultra-wide field-of-view (FoV) and rapid response capabilities, enabling them to capture prey and avoid obstacles. Herein, inspired by compound eyes, a planar intelligent nanophotonic sensor (PINS) based on a metalens array, which possesses an ultrawide horizontal FoV exceeding 135°, is demonstrated. By leveraging a deep neural network, meta-motion sense (MMS), accurate optical flow can be extracted from PINS-captured wide-FoV scenes, enabling a comprehensive characterisation of the motion velocities and directions of all dynamic objects. Compared to traditional machine-vision-based object recognition algorithms, the proposed approach exhibits significantly higher accuracy and robustness, particularly in detecting small, slow, or background-blended moving targets, and offers an intelligent predictive capability for forecasting the motion trajectories of objects. The proposed device combines the advantages of high compactness, superior motion-detection performance, and intelligent functionality, offering a promising foundation for next-generation applications in autonomous navigation, situational awareness, and military surveillance.

HTML

Introduction

In the natural world, insects, such as dragonflies, locusts, and bees, exhibit exceptional visual capabilities because of their unique compound eyes. Unlike human vision, these biological systems are composed of numerous ommatidia, each functioning as an individual photoreceptive unit, collectively offering a wide field-of-view (FoV), high temporal resolution, and excellent motion sensitivity^1–3, as shown in Fig. 1a. These features enable insects to perceive and respond swiftly to dynamic changes in their surroundings, enabling effective navigation, predator avoidance, and prey capture. Such capabilities are enabled by cooperation between the compound-eye optical structure and the downstream neural processing, which together support wide-angle visual acquisition and sensitive motion perception. Inspired by these biological examples, researchers have sought to mimic the compound-eye architectures in developing advanced imaging systems^4–10. However, traditional bioinspired compound eyes typically require a curved arrangement of the lens array to increase the detection FoV, which introduces considerable fabrication challenges and large device volumes.

Recent advances in metalenses have enabled the development of feasible pathways for planar bioinspired compound eyes with highly compact footprints. Metalenses are ultrathin imaging components composed of subwavelength nanostructures arranged to provide a specific focusing phase profile^11–15. By implementing wavefront modulation within a single planar layer, they offer clear advantages in terms of compactness and lightweight design while maintaining high-efficiency and high-quality imaging with sub-micron structural thickness^16–19. Compared with conventional miniaturised refractive imaging systems, which typically require multiple optical elements to balance the FOV and image quality, metalenses provide a flatter and lighter architecture that is well-suited for highly integrated array-based imaging platforms. Recently, metalenses have been assembled into arrays to realise complex imaging functionalities in highly integrated systems. Notable implementations include meta-microscopy that surpasses the space-bandwidth product limit^20,21, light-field cameras with extreme extended depth-of-field²², achromatic metalens array-based light-field cameras²³, and spectroscopic light-field cameras²⁴. However, most of these metalens-array-based imaging devices fail to replicate the ultrawide-FoV characteristics of the compound eyes of insects. Moreover, these devices are predominantly designed for static scenes or stationary objects and exhibit limited capabilities for detecting dynamic or moving targets.

This study introduces a planar intelligent nanophotonic sensor (PINS) based on a metalens array capable of ultrawide-angle and precise motion perception. Although metalenses are arranged on a flat substrate, their carefully engineered phase profiles enable each individual metalens to be imaged from a distinct angular direction²⁵, thereby emulating the wide-FoV functionality of insect compound eyes. This design enables the PINS to achieve horizontal imaging coverage exceeding 135°. This study further integrated a deep neural network, called meta-motion sense (MMS), to accurately extract optical flow from sequential wide-FoV images captured by the PINS, in a manner analogous to downstream neural processing that transforms visual inputs from insect compound eyes into motion-related information. This allows for a comprehensive characterisation of the motion dynamics in a scene, including the velocity and direction of multiple moving targets^26–32. To efficiently train the MMS neural network to process the specialised metalens imaging data, a convolution imaging model was developed using point spread functions (PSFs). This imaging model enables the rapid and high-quality generation of large-scale training datasets. Benefiting from this targeted training strategy, the PINS achieved high-accuracy motion detection even under challenging conditions, including ultra-small or ultra-slow-moving objects and dynamic targets embedded in complex backgrounds, scenarios where conventional machine vision systems often fail. By thoroughly analysing the extracted optical flow information, the proposed method enables intelligent and accurate prediction of future trajectories of moving objects. This predictive capability holds significant potential for future applications in autonomous driving, motion risk assessment, and other dynamic scene-understanding tasks. Furthermore, the proposed PINS is lightweight and readily deployable on mobile platforms such as miniature unmanned aerial vehicles (MAVs), wearable electronics, and other portable devices, thereby significantly increasing its adaptability and expanding its applicability across diverse scenarios.

Discussion

The functional advantages of the compound eyes of insects were emulated by integrating a well-designed metalens-array-based optical system with an MMS neural network. Building on this foundation, an MTP framework was introduced to enable intelligent trajectory forecasting. This biomimetic strategy enabled both ultrawide FoV imaging and accurate motion sensing and prediction within a compact platform. This wide-angle imaging capability is primarily determined by the hardware design of the metalens array, whereas the motion detection performance is influenced by a combination of factors including the imaging quality, temporal resolution, and accuracy of the optical flow estimation enabled by the MMS neural network. Several hardware-level improvements can be pursued to further enhance the motion sensing capabilities of the system. Employing high-performance CMOS sensors with smaller pixel sizes and faster frame rates can significantly improve spatial and temporal resolutions. Smaller pixels enable the detection of finer spatial displacements, whereas higher acquisition speeds allow the capture of subtle temporal variations in object motion. In addition, integrating faster CPUs or GPUs for downstream image processing can substantially accelerate the extraction of motion information from captured signals, enabling near-real-time motion analysis for dynamic scenes. This is particularly beneficial for scenarios involving sudden stops or sharp turns, where the prediction accuracy may otherwise temporarily decrease owing to abrupt deviations from the preceding motion trend. A higher frame rate can alleviate this limitation by reducing the interframe motion gap and improving the robustness of trajectory prediction in dynamic scenes. Meanwhile, real-time inertial data from the MAV can be used to estimate and remove the MAV-induced global flow component from the MMS-derived optical flow such that the residual flow mainly reflects the motion of independently moving objects in the scene. Furthermore, the robustness of the MTP framework can be improved in future studies for scenarios involving object crossing and partial occlusion^46,47.

One of the key advantages of the proposed wide-angle imaging approach is the modular design of the metalens array in which each individual metalens is responsible for a specific angular FoV. This architecture enables not only flexible expansion of the overall imaging coverage by adding additional metalenses to the array to capture side-FoV scenes but also flexible control of the overlap between adjacent angular sectors through the design of the added linear tilt phases. Consequently, the system can be configured to provide either richer motion information redundancy or broader spatial coverage, depending on the application requirements, as illustrated in Supplementary Note S16. The challenge of reducing the light intensity at large off-axis angles can also be addressed in this metalens array design. The proposed method permits an increase in the aperture size of metalenses dedicated to wide-angle detection, thereby enhancing light throughput and mitigating intensity degradation. Another fundamental advantage of the proposed system is its ability to reliably detect and predict the motion of multiple dynamic objects across wide scenes. This is enabled by the MMS neural network for optical flow estimation and by the MPT framework for future trajectory prediction. This architecture allows for frame-by-frame tracking and future trajectory estimation of multiple targets simultaneously, even under challenging conditions, such as small size, slow motion, or visual blending with the background. Although the current horizontal FoV extension already meets the demands of most ground-based applications, such as autonomous driving and landscape surveillance, the viewing range can be further extended to two-dimensional coverage using a metalens array architecture, enabling near-omnidirectional angular observation. The binocular configuration of the system introduces additional functionality for depth-aware motion analysis. The detailed implementation and underlying principles of the stereo-depth extension are presented in Supplementary Note S17.

In conclusion, a planar intelligent nanophotonic sensor inspired by the compound eyes of insects was realized by combining hardware and algorithmic innovations to achieve precise motion state detection and intelligent trajectory prediction across ultra-wide-angle scenes. Through the optical flow generated by the neural network, the device enables precise motion state detection and subsequent trajectory prediction of moving objects across an ultrawide horizontal FoV of 135°. To ensure the effective training of the neural network, a PSF-based convolutional imaging model that enables the rapid generation of large-scale high-quality wide-angle scenes and optical flow data pairs was proposed. A well-trained neural network demonstrates exceptional accuracy and robustness in motion detection, enabling the precise tracking of minute displacements, small-scale targets, and motion under complex background conditions, which remains a challenge for conventional computer vision-based recognition approaches. The integration of the MPT prediction framework enabled the system to observe and track moving targets and accurately forecast their future positions. Owing to its compact size and lightweight design, the PINS is particularly well-suited for integration into MAVs, enabling agile and flexible deployment in dynamic environments. This significantly broadens the range of potential applications for autonomous driving and intelligent transportation systems. Equipped with a PINS, MAVs can perform real-time motion detection and trajectory prediction for vehicles and pedestrians, thereby facilitating informed decision-making and effective risk avoidance.

Methods

Optimization of metalens array phase profile.

The phase profile of the central FoV metalens was determined by ray-tracing optimisation using Zemax. Six incident angles (0°, 4.5°, 9°, 13.5°, 18°, and 22.5°) were selected for the analysis to cover the designed central field of view. The metalens was modelled as a “Binary2” surface, with the phase profile described by an even-order polynomial $ \phi (r)=\sum\nolimits_{i=1}^{5}{a}_{i}\cdot {(r/R)}^{2i} $, where R is the metalens radius, r is the radial coordinate, and a_i are the coefficients to be optimised. The optimisation objective was to minimise the focal spot size for the selected incident angles. Global optimisation in Zemax yielded the optimal coefficients a₁ to a₅ are −888.05, 4.73, 1.84, −1.35, and 0.0076, respectively. The phase profiles of the two side-FoV metalenses were subsequently generated by introducing tilt phases of ±kdsin45° to the centre-FoV metalens phase, enabling off-axis focusing. The corresponding ray-tracing verification of the focusing performance of metalenses with different FoVs is provided in Supplementary Note S2.

Design and fabrication of metalens structures

Metalens structures were fabricated using silicon nitride nanoparticles with a height of 1,000 nm and varying cross-sectional sizes. Through finite-difference-time-domain (FDTD) simulation, eight types of nano-posts were identified to cover the 2π phase range with high transmittance (>90%), and their diameters were 81, 110, 134, 153, 169, 184, 199, and 212 nm, respectively. The complete structural pattern was constructed by assigning appropriately sized nano-posts on a pixel-by-pixel basis, in accordance with the designed phase profile. For metalens fabrication, first, a 1,000 nm-thick silicon nitride layer was deposited on a SiO₂ substrate via plasma-enhanced chemical vapour deposition (PECVD). Subsequently, a 225-nm PMMA A4 resist and 50-nm AR-PC 5090 conductive layer were spin-coated and exposed using an electron-beam lithography system (Elionix ELS-F125). After development (MIBK:IPA = 1:3, 120 s; IPA rinse, 60 s), a 40-nm Cr layer was deposited and patterned via lift-off to serve as a hard mask for the dry etching of silicon nitride. Finally, the sample was etched and the Cr mask was removed using ceric ammonium nitrate, yielding the final metalens structure on the substrate.

Acknowledgements

This study was supported by the National High-Level Personnel of Special Support, Basic Research Program of Jiangsu (No. BK20252002), Young Elite Scientists Sponsorship Program by CAST (No. 2022QNRC001), National Natural Science Foundation of China (Nos. 61960206005 and 61871111), and Jiangsu Key R&D Program Project (No. BE2023011-2), Fundamental Research Funds for the Central Universities (No. 2242022k60001), and the Project of the National Mobile Communications Research Laboratory (No. 2026A03).

Supplementary information

SI for 10.37188-lam.2026.064.pdf

Reference (47)

[1]	Labhart, T. & Meyer, E. P. Detectors for polarized skylight in insects: a survey of ommatidial specializations in the dorsal rim area of the compound eye. Microscopy Research & Technique 47, 368-379 (1999).
[2]	Nilsson, D. E. & Kelber, A. A functional analysis of compound eye evolution. Arthropod Structure & Development 36, 373-385 (2007).
[3]	Perl, C. D. & Niven, J. E. Differential scaling within an insect compound eye. Biology Letters 12, 20160042 (2016).
[4]	Cheng, Y. et al. Review of state-of-the-art artificial compound eye imaging systems. Bioinspiration & Biomimetics 14, 031002 (2019).
[5]	Jeong, K. H., Kim, J. & Lee, L. P. Biologically inspired artificial compound eyes. Science 312, 557-561 (2006).
[6]	Deng, Z. F. et al. Dragonfly-eye-inspired artificial compound eyes with sophisticated imaging. Advanced Functional Materials 26, 1995-2001 (2016).
[7]	Liang, W. L., Pan, J. G. & Su, G. D. J. One-lens camera using a biologically based artificial compound eye with multiple focal lengths. Optica 6, 326-334 (2019).
[8]	Dai, B. et al. Biomimetic apposition compound eye fabricated using microfluidic-assisted 3D printing. Nature Communications 12, 6458 (2021).
[9]	Wang, Y. et al. Memristor-based biomimetic compound eye for real-time collision detection. Nature Communications 12, 5979 (2021).
[10]	Hu, Z. Y. et al. Miniature optoelectronic compound eye camera. Nature Communications 13, 5634 (2022).
[11]	Chen, J. et al. Metamaterials: from fundamental physics to intelligent design. Interdisciplinary Materials 2, 5-29 (2023).
[12]	Li, T. et al. Revolutionary meta-imaging: from superlens to metalens. Photonics Insights 2, R01 (2023).
[13]	Chen, J. et al. China's top 10 optical breakthroughs: advancements in optical imaging devices based on metalens arrays (invited). Laser & Optoelectronics Progress 61, 2200001 (2024).
[14]	Khorasaninejad, M. & Capasso, F. Metalenses: versatile multifunctional photonic components. Science 358, eaam8100 (2017).
[15]	Chen, W. T., Zhu, A. Y. & Capasso, F. Flat optics with dispersion-engineered metasurfaces. Nature Reviews Materials 5, 604-620 (2020).
[16]	Pan, M. Y. et al. Dielectric metalens for miniaturized imaging systems: progress and challenges. Light: Science & Applications 11, 195 (2022).
[17]	Zou, X. J. Imaging based on metalenses. PhotoniX 1, 2 (2020).
[18]	Peng, Y. Y. et al. Metalens in improving imaging quality: advancements, challenges, and prospects for future display. Laser & Photonics Reviews 18, 2300731 (2024).
[19]	Liang, H. W. et al. High performance metalenses: numerical aperture, aberrations, chromaticity, and trade-offs. Optica 6, 1461-1470 (2019).
[20]	Xu, B. B. et al. Metalens-integrated compact imaging devices for wide-field microscopy. Advanced Photonics 2, 066004 (2020).
[21]	Ye, X. et al. Chip-scale metalens microscope for wide-field and depth-of-field imaging. Advanced Photonics 4, 046006 (2022).
[22]	Fan, Q. et al. Trilobite-inspired neural nanophotonic light-field camera with extreme depth-of-field. Nature Communications 13, 2130 (2022).
[23]	Lin, R. J. et al. Achromatic metalens array for full-colour light-field imaging. Nature Nanotechnology 14, 227-231 (2019).
[24]	Hua, X. et al. Ultra-compact snapshot spectral light-field imaging. Nature Communications 13, 2732 (2022).
[25]	Chen, J. et al. Planar wide-angle-imaging camera enabled by metalens array. Optica 9, 431-437 (2022).
[26]	Barron, J. L., Fleet, D. J. & Beauchemin, S. S. Performance of optical flow techniques. International Journal of Computer Vision 12, 43-77 (1994).
[27]	Beauchemin, S. S. & Barron, J. L. The computation of optical flow. ACM Computing Surveys 27, 433-466 (1995).
[28]	Sun, D. Q. et al. Learning optical flow. Proceedings of the 10th European Conference on Computer Vision. Marseille: Springer, 2008, 83-97.
[29]	Fortun, D., Bouthemy, P. & Kervrann, C. Optical flow modeling and computation: a survey. Computer Vision and Image Understanding 134, 1-21 (2015).
[30]	Teed, Z. & Deng, J. RAFT: recurrent all-pairs field transforms for optical flow. Proceedings of the 16th European Conference on Computer Vision. Glasgow, UK: Springer, 2020, 402-419.
[31]	Zhai, M. L. et al. Optical flow and scene flow estimation: a survey. Pattern Recognition 114, 107861 (2021).
[32]	Caldelli, R. et al. Optical flow based CNN for detection of unlearnt deepfake manipulations. Pattern Recognition Letters 146, 31-37 (2021).
[33]	Arbabi, A. et al. Dielectric metasurfaces for complete control of phase and polarization with subwavelength spatial resolution and high transmission. Nature Nanotechnology 10, 937-943 (2015).
[34]	Devlin, R. C. et al. Broadband high-efficiency dielectric metasurfaces for the visible spectrum. Proceedings of the National Academy of Sciences of the United States of America 113, 10473-10478 (2016).
[35]	Balthasar Mueller, J. P. et al. Metasurface polarization optics: independent phase control of arbitrary orthogonal states of polarization. Physical Review Letters 118, 113901 (2017).
[36]	Wu, Y. K. et al. TiO₂ metasurfaces: from visible planar photonics to photochemistry. Science Advances 5, eaax0939 (2019).
[37]	Dosovitskiy, A. et al. FlowNet: learning optical flow with convolutional networks. Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015, 2758-2766.
[38]	Butler, D. J. et al. A naturalistic open source movie for optical flow evaluation. Proceedings of the 12th European Conference on Computer Vision. Florence, Italy: Springer, 2012, 611-625.
[39]	Geiger, A. et al. Vision meets robotics: the KITTI dataset. The International Journal of Robotics Research 32, 1231-1237 (2013).
[40]	Kondermann, D. et al. Stereo ground truth with error bars. Proceedings of the 12th Asian Conference on Computer Vision. Singapore: Springer, 2014, 595-610.
[41]	Zhang, Y. X. et al. Deep-learning enhanced high-quality imaging in metalens-integrated camera. Optics Letters 49, 2853-2856 (2024).
[42]	Jiang, P. Y. et al. A review of YOLO algorithm developments. Procedia Computer Science 199, 1066-1073 (2022).
[43]	Diwan, T., Anirudh, G. & Tembhurne, J. V. Object detection using YOLO: challenges, architectural successors, datasets and applications. Multimedia Tools and Applications 82, 9243-9275 (2023).
[44]	Shinde, S., Kothari, A. & Gupta, V. YOLO based human action recognition and localization. Procedia Computer Science 133, 831-838 (2018).
[45]	Li, Q. et al. Kalman filter and its application. Proceedings of 2015 8th International Conference on Intelligent Networks and Intelligent Systems (ICINIS). Tianjin, China: IEEE, 2015, 74-77.
[46]	Shi, L. K. et al. Global-local and occlusion awareness network for object tracking in UAVs. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 16, 8834-8844 (2023).
[47]	Wojke, N. , Bewley, A. & Paulus, D. Simple online and realtime tracking with a deep association metric. Proceedings of 2017 IEEE International Conference on Image Processing (ICIP). Beijing, China: IEEE, 2017, 3645-3649.

Bioinspired planar intelligent nanophotonic sensor for wide-angle accurate motion perception and prediction

Abstract

References

Rights and permissions

通讯作者: 陈斌, bchen63@163.com

Research Summary

Article Metrics