Multi-task large-scale integrated optical vision processor using ultra-fast parallel nanofabrication

Wenqi Ouyang; Wen Lyu; Jianming Xiong; Jiayong Peng; Mingcheng Luo; Kaifei Tang; Shih-Chi Chen; Chaoran Huang

doi:10.37188/lam.2026.096

Optical neural networks (ONNs) promise ultra-fast low-power machine vision; however, visible-wavelength implementations are constrained by limited neuron density and accuracy. Although random projections provide efficient untrained feature encoding, we advance ONN performance using a high-throughput randomised multi-focus two-photon lithography (TPL) platform that fabricates millions of 500 nm neurons at the millimetre scale within 15 min. The resulting platform achieves ≥97% classification accuracy in multiple image classification and keypoint detection tasks using minimal digital parameters that outperform other devices of comparable neuron densities while enabling compact integration with camera systems through its transparent design. Our results indicate that ONNs can serve as scalable and practical solutions for high-performance multi-task machine vision.

HTML

Introduction

Deep learning has achieved remarkable advancements in recent years ¹, providing effective solutions to various challenges in artificial intelligence, particularly in machine vision applications such as image and video recognition ², object detection ³, and image segmentation ⁴. In traditional machine vision architectures, imaging, perception, and processing are considered as separate and sequential tasks because of their distinct functional requirements ⁵. Images captured by high-resolution sensors in bulky imaging systems are processed by graphics processing units (GPUs) to expedite computational tasks. However, von Neumann's hardware struggles to meet the computational demands of deep learning models, which presents challenges related to speed, power consumption, and data storage ⁶. Recent developments have leveraged free-space optical processing to address these limitations, offering exceptional speed, low power consumption, and the capacity to manage multiple data streams simultaneously ^7–
14. These attributes, coupled with the broad frequency range of light, enable ultra-high bandwidth and data throughput, thereby making optical processors highly suitable for machine vision tasks that demand performance, scalability, and energy efficiency ^{6,
7,
10,
13-
16}. Optical processors present new opportunities for enhanced functionality and efficiency when integrated with imaging systems. The emergence of all-optical DNNs represents a significant advancement in optical machine vision, employing three-dimensional (3D)-printed diffractive surfaces for image classification and recognition at terahertz frequencies ^{8,
9,
17,
18}. Early demonstrations of diffractive optical neural networks are often implemented in the terahertz regime, where the long wavelength relaxes fabrication precision requirements and facilitates rapid proof-of-concept validation. However, operation in the visible/near-visible spectrum is more relevant for practical machine vision and camera-integrated platforms. Meanwhile, visible-to-near-infrared (NIR) wavelength implementation demands substantially finer feature sizes and higher neuron densities to achieve wavelength-scale phase modulation and sufficient spatial bandwidth ^{11,
12,
16,
19,
20}. These requirements impose stringent fabrication precision and scalability challenges for large-area devices. Motivated by these constraints, we developed a high-throughput randomised multi-focus two-photon lithography (TPL) platform that enables the rapid fabrication of high-density diffractive layers at the millimetre scale. Qu et al. demonstrated this potential by developing a high-accuracy optoelectronic hybrid neural network using a single metasurface, which achieved a classification accuracy of 98.05% on MNIST while overcoming misalignment challenges ²⁰. Recently, Chen et al. introduced LightGen, a dimensionality-manipulation-based all-optical computing framework that enables large-scale photonic integration and optical latent-space transformation ²¹, highlighting the emerging role of structured optical dimensionality conversion in scalable photonic information processing. However, challenges introduced by free-space visible-wavelength applications are yet to be addressed, including the need for smaller neuron sizes and higher fabrication precision at large scale, which is constrained by traditional manufacturing methods and design limitations ^{6,
11,
12,
16}. Existing fabrication techniques such as wet chemical etching and reactive ion etching (RIE) encounter significant obstacles in producing high-density, high-precision diffractive layers necessary for visible-wavelength DNNs ²². The reliance on costly, time-consuming processes such as electron beam lithography (EBL) limits scalability and practical implementation ^22,
23. In addition, fabrication errors can accumulate across the diffractive layers, restricting the complexity of the model and experimental performance ^{6,
16,
24}. These issues result in DNNs with limited neuron densities, small model sizes (typically fewer than one million neurons), and suboptimal generalisation capabilities, which makes them inadequate for large-scale machine-learning task ^6,
16.

To address these limitations, random projection-based optoelectronic computing systems have been developed, which are powerful feature-encoding methods that enhance classification performance in machine learning ^{25–
32}. In our previous work ³¹, we experimentally demonstrated that a diffractive neural network based on an optical metasurface composed of 41 million photonic neurons performed random projections. Further, we demonstrated that, at this scale of neuron count, a single-layer diffractive neural network can outperform multi-layered diffractive neural networks and even rival large-scale AI models such as ResNet and Vision Transformers. However, conventional implementations using static scatterers or EBL-fabricated metasurfaces are limited by flexibility, fabrication cost, and environmental constraints.

In this study, we combined this concept with a custom-built randomised multi-focus TPL system ³³, which offers significant advantages in terms of fabrication speed, precision, cost, and structural versatility. This enables rapid prototyping of complex millimetre-scale 3D nano-structures within 15 min without cleanroom requirements, making them highly scalable for large-area optical systems and significantly reducing fabrication time. Finally, we demonstrate a multi-task DNN processor for visible-wavelength image processing using this system. The diffractive encoding device is cost-effectively printed with a density of 4 million neurons/mm ² and demonstrates significant advancements in neuronal scalability, supporting up to four million neurons. This scalability and high neuron density are enabled by our custom-built randomised multi-focus TPL system, which achieves 500 nm wavelength-scale pixels at a processing speed of 0.267 million neurons per minute. Our processor integrates a diffractive layer with a lens into a compact millimetre-scale design (1 mm) followed by a simple single-layer digital network to process the projected Fourier-region information onto a camera. The proposed system realises superior performance including high recognition accuracy (≥97% accuracy), reduced hardware requirements, and minimal electronic post-processing for various machine vision tasks such as hand-drawn figure classification, object recognition, human action recognition, flow cytometry image classification, and keypoint detection in human faces. The training costs are significantly reduced because only the digital readout layer requires training, thereby eliminating the need for optical network training. Further, our method is compatible with a wide range of imaging systems, enabling the great potential of Optical neural network(ONN)-based devices in applications such as light detection and ranging (LiDAR) ³⁴, optical coherence tomography ³⁵, biomedical diagnostics ³⁶, and human-computer interaction ³⁷. This integrated approach enhances the scalability, performance, and practicality of DNNs in visible wavelengths, paving the way for their broader application in optical machine vision. Furthermore, this method facilitates cost-effective high-throughput mass production and rapid product development. Overall, this study marks a critical step towards scalable, integrated, and high-performance optical processing solutions that can shift the paradigm in machine vision technology.

High-Throughput 3D Nanofabrication of DOEs via Multi-Focus TPL

Our previous work has shown that the size and statistical distribution of the projection matrix play a critical role in the performance of diffractive optical computing systems ³¹. Traditional methods for implementing such projection matrices often rely on static physical scatterers or metasurfaces fabricated by EBL, which are limited by design flexibility, fabrication cost, and environmental constraints. In contrast, our DOEs are directly fabricated via a multi-focus TPL ³³, which enabled the construction of a large-scale, optically encoded transmission matrix with precisely defined spatial modulation via hundreds of 3D programmable laser foci. Compared to EBL-based metasurfaces, our method offers significant advantages in terms of fabrication speed, cost, and structural versatility. It supports true 3D structuring without requiring vacuum or cleanroom conditions and operates as a single-step direct-write process with no postprocessing requirements. This enables the rapid prototyping of complex DOEs within 15 min, which makes it highly scalable for large-area optical systems.

We implement a randomised multi-focus TPL nanofabrication platform using binary digital micromirror device (DMD) holography (Supplementary Fig. S1) driven by a low-repetition-rate femtosecond laser amplifier (800 nm wavelength, 1 kHz repetition rate, and 100 fs pulse width) to provide high peak power for parallel nanofabrication. The DMD employs a binary-hologram-based weighted Gerchberg-Saxton algorithm for generating a > 99.9 % uniform 3D multi-focus array in the Fourier plane. Subsequently, this array is focused under a dip-in configuration into a droplet of custom photoresist using a 100× oil-immersion objective (NA = 1.3), and the photoresist has a post-polymerisation refractive index of 1.520 at 520 nm ³³.

Unlike conventional TPL systems that print DOE pixels sequentially, our randomised scanning strategy combines temporal randomisation with spatial parallelisation. Within each 50-μm-wide DOE unit, pixels of 500 nm lateral pitch and height levels ranging from 100 to 500 nm are realised, with the z-positions of the laser foci defining the final height, thereby enabling a direct 3D phase gradient structuring. Each pixel comprises four adjacent voxels arranged in a quadrilateral (200 nm spacing); each voxel is exposed to a single femtosecond pulse, and the four voxels are sequentially exposed to four consecutive pulses to prevent capillary-force-induced collapse and ensure structural robustness ^33,
46. Traditional sequential exposure normally causes polymerisation-diffusion-induced pixel merging and blurred boundaries. A temporal interval of at least 20 ms is enforced between neighbouring pixel exposures to suppress this issue. We employ a randomised sub-block scanning scheme across 25 parallel foci to isolate pixel exposures and suppress crosstalk and stitching artefacts, while enabling a fast fabrication time of only 1.6 s per 50 μm unit. The results of both random and sequential scanning methods are shown in Fig.  S2 and movie S1. This randomised multi-focus scanning strategy is the first demonstration that combines voxel-level randomisation with parallel multi-beam exposure to ensure high pixel fidelity and rapid production ( Fig. 1b- e). During large-area writing, a piezoelectric hexapod compensates for stitching alignment errors with sub 100-nm precision, thereby enabling the high-quality assembly of microscale thickness structures across millimetre scales and rapid prototyping of complex diffractive optical elements (Supplementary Fig. S6). Accounting fo the stage-stitching time, the entire 1 mm ² area is fabricated in only 15  min.

Random projection-based image classification

Experimental results for various complex vision tasks

Discussion

We simulate the effect of random matrix dimension on classification accuracy. We set random matrices to dimensions of up to 2,000 on the diffractive layer based on the TPL nanofabrication system outlined in Fig. 1b. We evaluate the effect of neuron density on classification accuracy using the Fashion-MNIST dataset, as shown in Fig. 5. The performance of the ONN processors is highly dependent on their physical parameters, particularly neuron density, which directly affects diffraction behaviour ¹², as evidenced by the camera-captured images shown in Fig. 5a. Consequently, optimising neuron density is crucial for maximising performance. Our results show that the highest classification accuracy is achieved with a unit-cell period of 500 nm. As shown in Fig. 5, samples with different neuron densities are fabricated with unit periods of 500, 600, 700, 800, and 1,200 nm, all printed within a uniform area of 1 × 1 mm ² for the Fashion-MNIST dataset. The SEM images of these samples in Fig. 5c clearly illustrate this trend. Using our custom-built randomised multi-focus parallel TPL system, large quantities of samples can be produced rapidly and cost-effectively. The transmission efficiency of each sample exceeds 70%, as shown in Table S3 (see details). The optical layer performs random-projection-based feature encoding in the Fourier domain, whereas explicit dimensionality reduction is achieved through digital spatial downsampling. These two stages jointly determine the overall computational performance of the system. When the electronic network parameters of the single-layer fully connected layer are reduced from 100,000 to 1,000, the effect of neuron density on the classification accuracy becomes more pronounced. As shown in Fig. 5b, the performance gap between different digital parameter counts decreases with increasing neuron density and becomes negligible at 4 million neurons/mm ². This indicates that a higher optical neuron density progressively offloads the computational burden from the digital backend. Conversely, multiclass separability becomes limited when the number of digital weights falls significantly below 1,000, thereby reducing the classification robustness. These results highlight the flexibility of optoelectronic co-design, where the optical neuron density and digital complexity can be jointly optimised. For comparison, we replace our printed DOE with an optical diffuser (Daheng Optics, GCL-201103, size: 25.4 mm, 1,500 lines/inch) while retaining all other experimental settings, and we test its performance. The optical diffuser exhibits the lowest classification accuracy because of its uncontrolled surface morphology and limited spatial-frequency engineering capability. This result underscores the importance of neuron-density controllability and engineered phase distribution for achieving stable and high-performance optical random projection. The high-precision nanofabrication platform ensures repeatability and scalability that cannot be achieved using natural scattering media. This highlights the crucial role of neuron density in the computational functionality of ONNs, particularly when electronic network parameters are constrained. This observation aligns well with those of previous studies, emphasising the importance of higher neuron densities in the visible spectrum for achieving superior classification performance ^11,
12. These insights will guide future designs that aim to optimise the interplay between neuron density and computational capacity in optical neural networks.

Finally, our unique TPL platform is compatible with the visible to near-infrared operating wavelengths subject to material transparency and refractive index stability of the photoresist. With broader material selection and nanoimprint replication strategies, the operational range could potentially extend from the near-UV to infrared regimes. In addition, centimetre-scale devices are feasible through tiled writing and imprint replication, which enables larger optical apertures for practical imaging systems.

In summary, we introduced a multi-task integrated ONN processor fabricated using a custom-built randomised multi-focus TPL system. The processor features DOEs that can perform random-projection-based image classification at the speed of light. This enables efficient pre-sensor feature extraction from optical inputs, facilitating their deployment across a wide range of machine-vision tasks. The processor achieves millimetre-scale integration of millions of photonic neurones, rapid fabrication within 15 min, low training costs, and a compact digital readout layer with only 1,000 parameters, while ensuring high computational performance and compatibility with standard imaging systems for fast and energy-efficient operation, thereby resulting in high computation speeds and significantly reduced power consumption. Our free-space optoelectronic computing system shifts computation from electronics to optics, offering distinctive advantages in terms of compactness, practicality, and low power consumption. Further, it can be extended to incoherent illumination for real-world imaging applications. These improvements represent a major advancement towards scalable and high-performance optical processing solutions for machine vision. Furthermore, integrating the front end of an ONN processor with a sensor containing computing units at the back end offers a promising solution for data readout and transport without the need for analogue-to-digital conversion ^{5,
62–
65}. This results in low-latency, low-power processing that significantly enhances overall efficiency ^{5,
62–
66}.

Materials and methods

Materials and preparation of photoresist

Pentaerythritol tetraacrylate (PETA, technical grade), Bisphenol A bis(phthalic anhydride) (BPADA, 97%), 4-hydroxyanisole (MEHQ, 99%), and isopropanol (IPA, 99%) were purchased from Sigma Aldrich. All chemicals and photoresists were used as received, without further purification. The photoresist was prepared using previously reported methods ^38,
67. A 0.2 wt% concentration of the initiator was dissolved in a monomer mixture of PETA (32 wt%) and BPADA (68 wt%) under vigorous sonication.

Characterisations

Scanning electron microscopy (SEM) images were acquired using a JEOL JSM-7800F field-emission scanning electron microscope operating at an accelerating voltage of 5 kV with a tilt stage. Prior to imaging, the samples were coated with a platinum layer using an Edwards sputter coater for enhancing conductivity. Optical images were captured using a COSSIM CMY-310 optical microscope.

3D parallel nanofabrication system

We present binary DMD holography for randomised multi-focus TPL nanofabrication. A schematic of the experimental setup is shown in Supplementary Fig. S1. The setup uses a low-repetition-rate femtosecond laser (Spitfire Pro) with its output diffracted by a 600-lines/mm grating and relayed through 4f telescopes for dispersion pre-compensation before illuminating the DMD. The DMD projects a binary Lee hologram encoding a custom multi-focus array that is Fourier transformed and spatially filtered. A 4f relay system (a lens and a 100× oil-immersion objective lens with NA = 1.3) demagnifies and focuses the pulses into the photoresist in a dip-in configuration. After development, cured DOE structures remain on the substrate. The back-illumination by yellow light enables real-time imaging via the same objective lens, while a six-axis nano-positioning stage ensures precise alignment for large-area stitching of ultra-thin structures.

Training readout neural network

A single-layer FCN is employed as the readout neural network for the image classification task. Figure S4 illustrates the sequential training procedure for neural networks implemented in the digital backend. Using the Fashion-MNIST dataset as an example, the captured light-field output image is first preprocessed by cropping its central region (e.g. 100 × 100 pixels), followed by downsampling to a resolution of 10 × 10 pixels. Then, the resulting image is flattened into a feature vector $ x $ and input into a single-layer fully connected neural network. A total of 1,000 weight parameters $ {\boldsymbol{W}}_{\boldsymbol{t}} $ are trained to generate the final classification result $ \boldsymbol{y} $, which can be mathematically formulated as $ \boldsymbol{y}={\rm{softmax}}\left(\boldsymbol{b}+{\boldsymbol{W}}_{\boldsymbol{t}}\boldsymbol{x}\right) $. The optimisation network training is complemented using Python 3.12 and Pytorch 2.3.1. The optimisation framework employs the Adam algorithm over 40 training epochs, with parameter updates driven by minimisation of the negative log-likelihood loss between predicted probabilities and ground-truth labels. Computational workflows are accelerated using an Intel Core i7-13620 K/NVIDIA RTX 3080 Ti hardware configuration. For the facial point detection task, Supplementary Fig. S7a illustrates the architecture of the proposed optoelectronic neural network. The system processes 96 × 96-pixel facial images using an optical encoder, followed by a lightweight digital backend that functions as a feature-extraction decoder.

Training dataset processing

For MNIST, Fashion-MNIST, and CIFAR-10 datasets, 1,000 images per class are selected to form a dataset of 10,000 images, followed by a random 80:20 training–test split. The Weizmann and fluorescent image datasets containing 5,687 and 2,000 images, respectively, are randomly split into training and testing sets using the same 80:20 ratio.

Experimental setup for image classification

As shown in Supplementary Fig. S3, we construct an experimental setup for image classification. The input images sourced from datasets such as MNIST and Fashion-MNIST, each measuring 28 × 28 pixels, are binarised, displayed, and projected onto an SLM (UPOLabs HDSLM80R Plus) positioned in front of the ONN image sensor. The output from the optical layer is detected by a CCD camera. However, it can also be captured by a 10 × 10 photodetector array. The captured data are processed using a single-layer FCN to extract relevant information such as image classification results. The optical layer employs a random phase design for feature encoding and dimensionality reduction during image pre-processing.

The light source used is a green continuous-wave laser (Oeabt OM-12A520-3-G) with a wavelength of 520 nm and an output power of 3 mW. A 4f optical system that consists of two lenses with focal lengths of 25.4 mm and 150 mm, is used to expand a speckle diameter. A linear polariser modulates the beam polarisation state, orienting it at a 45° angle to the horizontal direction. Subsequently, the light field is reflected by the SLM after phase modulation. A second linear polariser with a 135° polarisation angle enables intensity modulation through polarisation interference. A second 4f system is used to reduce the speckle diameter to match the size of the DOE. Finally, after passing through the DOE and focusing lens (focal length: 25 mm), the light field is relayed to a CCD camera (ThorLabs Kiralux CS235CU) for image recording. For compact integration, CCD modules shown in Figs. 1, 3, and 4 are different models (HIKROBOT MV-CS060-10UC-PRO) selected to enable smaller-sized integration compared to that of the CCD described above.

Acknowledgements

We acknowledge funding support from the HKSAR Research Grants Council, Research Grant Council YCRG C4004-24Y, C1002-22Y, ECS 24203724, 14211224, C4074-22GF, T46-705/23-R, SRFS2526-4S01; Innovation and Technology Commission ITS/237/22; InnoHK Centre projects funded by the Innovation and Technology Commission A-CUHK-16-5-14; NSFC 62405258; Basic Research Program of Jiangsu (No. BK20253062); Fundamental Research Funds for the Central Universities (No. 30925010603); and National Key Laboratory of Integrated Circuits and Microsystems (No. NICL2025KF2001).

Supplementary information

SI for 10.37188-lam.2026.096_Video_1.mp4
SI for 10.37188-lam.2026.096.pdf

Reference (67)

[1]	LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436-444 (2015).
[2]	Xiao, X. , Xu, D. & Wan, W. G. Overview: Video recognition from handcrafted method to deep learning method. 2016 International Conference on Audio, Language and Image Processing (ICALIP). Shanghai, China: IEEE, 2016, 646-651.
[3]	Zhao, Z. Q. et al. Object detection with deep learning: A review. IEEE Transactions on Neural Networks and Learning Systems 30, 3212-3232 (2019).
[4]	Minaee, S. et al. Image segmentation using deep learning: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 3523-3542 (2022).
[5]	Huang, Z. et al. Pre-sensor computing with compact multilayer optical neural network. Science Advances 10, eado8516 (2024).
[6]	Hu, J. T. et al. Diffractive optical computing in free space. Nature Communications 15, 1525 (2024).
[7]	Prucnal, P. R. & Shastri, B. J. Neuromorphic Photonic. (Boca Raton: CRC Press, 2017).
[8]	Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004-1008 (2018).
[9]	Luo, Y. et al. Design of task-specific optical systems using broadband diffractive neural networks. Light: Science & Applications 8, 112 (2019).
[10]	Wetzstein, G. et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39-47 (2020).
[11]	Chen, H. et al. Diffractive deep neural networks at visible wavelengths. Engineering 7, 1483-1491 (2021).
[12]	Goi, E. et al. Nanoprinted high-neuron-density optical linear perceptrons performing near-infrared inference on a CMOS chip. Light: Science & Applications 10, 40 (2021).
[13]	Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nature Photonics 15, 102-114 (2021).
[14]	Huang, C. R. et al. Prospects and applications of photonic neural networks. Advances in Physics: X 7, 1981155 (2022).
[15]	McMahon, P. L. The physics of optical computing. Nature Reviews Physics 5, 717-734 (2023).
[16]	Fu, T. Z. et al. Optical neural networks: progress and challenges. Light: Science & Applications 13, 263 (2024).
[17]	Mengu, D. et al. Analysis of diffractive optical neural networks and their integration with electronic neural networks. IEEE Journal of Selected Topics in Quantum Electronics 26, 3700114 (2020).
[18]	Bai, B. J. et al. All-optical image classification through unknown random diffusers using a single-pixel diffractive network. Light: Science & Applications 12, 69 (2023).
[19]	Goi, E., Schoenhardt, S. & Gu, M. Direct retrieval of Zernike-based pupil functions using integrated diffractive deep neural networks. Nature Communications 13, 7531 (2022).
[20]	Qu, G. Y. et al. All-dielectric metasurface empowered optical-electronic hybrid neural networks. Laser & Photonics Reviews 16, 2100732 (2022).
[21]	Chen, Y. T. et al. All-optical synthesis chip for large-scale intelligent semantic vision generation. Science 390, 1259-1265 (2025).
[22]	Wang, H. et al. Toward near-perfect diffractive optical elements via nanoscale 3D printing. ACS Nano 14, 10452-10461 (2020).
[23]	Ngo, T. D. et al. Additive manufacturing (3D printing): A review of materials, methods, applications and challenges. Composites Part B: Engineering 143, 172-196 (2018).
[24]	Mengu, D. et al. Misalignment resilient diffractive optical networks. Nanophotonics 9, 4207-4219 (2020).
[25]	Rahimi, A. & Recht, B. Random features for large-scale kernel machines. Proceedings of the 21st International Conference on Neural Information Processing Systems. Vancouver, Canada: Curran Associates Inc. , 2007, 1177-1184.
[26]	Bull, G. , Gao, J. B. & Antolovich, M. Image segmentation using random features. Proceedings of SPIE 9069, Fifth International Conference on Graphic and Image Processing. Hong Kong, China: SPIE, 2014, 90691Z.
[27]	Saade, A. et al. Random projections through multiple optical scattering: Approximating kernels at the speed of light. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Shanghai, China: IEEE, 2016, 6215-6219.
[28]	Pierangeli, D., Marcucci, G. & Conti, C. Photonic extreme learning machine by free-space optical propagation. Photonics Research 9, 1446-1454 (2021).
[29]	Gigan, S. Imaging and computing with disorder. Nature Physics 18, 980-985 (2022).
[30]	Wang, H. et al. Large-scale photonic computing with nonlinear disordered media. Nature Computational Science 4, 429-439 (2024).
[31]	Luo, M. C. et al. Large-scale artificial intelligence with 41 million nanophotonic neurons on a metasurface. Print at https://arxiv.org/abs/2504.20416 (2025).
[32]	Xu, Z. H. et al. Design and analysis of optical extreme learning machine based on free space propagation. Acta Optica Sinica 45, 0320001 (2025).
[33]	Ouyang, W. Q. et al. Ultrafast 3D nanofabrication via digital holography. Nature Communications 14, 1716 (2023).
[34]	Li, N. X. et al. A progress review on solid-state LiDAR and nanophotonics-based LiDAR sensors. Laser & Photonics Reviews 16, 2100511 (2022).
[35]	Culemann, D., Knuettel, A. & Voges, E. Integrated optical sensor in glass for optical coherence tomography (OCT). IEEE Journal of Selected Topics in Quantum Electronics 6, 730-734 (2000).
[36]	Pirzada, M. & Altintas, Z. Recent progress in optical sensors for biomedical diagnostics. Micromachines 11, 356 (2020).
[37]	Xia, F. et al. Nonlinear optical encoding enabled by recurrent linear scattering. Nature Photonics 18, 1067-1075 (2024).
[38]	Saha, S. K. et al. Scalable submicrometer additive manufacturing. Science 366, 105-109 (2019).
[39]	Yang, D. et al. Rapid two-photon polymerization of an arbitrary 3D microstructure with 3D focal field engineering. Macromolecular Rapid Communications 40, 1900041 (2019).
[40]	Bunea, A. I. et al. Micro 3D printing by two-photon polymerization: Configurations and parameters for the nanoscribe system. Micro 1, 164-180 (2021).
[41]	Kiefer, P. et al. A multi-photon (7 × 7)-focus 3D laser printer based on a 3D-printed diffractive optical element and a 3D-printed multi-lens array. Light: Advanced Manufacturing 4, 3 (2024).
[42]	Wang, X. E. et al. 3D nanolithography via holographic multi-focus metalens. Laser & Photonics Reviews 18, 2400181 (2024).
[43]	Chang, J. L. et al. Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification. Scientific Reports 8, 12324 (2018).
[44]	Luo, X. H. et al. Metasurface-enabled on-chip multiplexed diffractive neural networks in the visible. Light: Science & Applications 11, 158 (2022).
[45]	Zhang, H. Y. et al. Multichannel meta-imagers for accelerating machine vision. Nature Nanotechnology 19, 471-478 (2024).
[46]	Sun, M. M. et al. Modeling of two-photon polymerization in the strong-pulse regime. Additive Manufacturing 60, 103241 (2022).
[47]	Baraniuk, R. et al. A simple proof of the restricted isometry property for random matrices. Constructive Approximation 28, 253-263 (2008).
[48]	Liu, J. M. et al. Directional conversion of a THz propagating wave into surface waves in deformable metagratings. Optics Express 29, 21749-21762 (2021).
[49]	Lyu, W. et al. Deep-subwavelength gap modes in all-dielectric metasurfaces for high-efficiency and large-angle wavefront bending. Optics Express 30, 12080-12091 (2022).
[50]	Li, J. X. et al. Class-specific differential detection in diffractive optical neural networks improves inference accuracy. Advanced Photonics 1, 046001 (2019).
[51]	Duan, Z. Y., Chen, H. & Lin, X. Optical multi-task learning using multi-wavelength diffractive deep neural networks. Nanophotonics 12, 893-903 (2023).
[52]	Zhang, J. J. et al. Advanced image classification using a differential diffractive network with “learned” structured illumination. ACS Photonics 11, 5289-5298 (2024).
[53]	Zheng, M. J. et al. Diffractive neural networks with improved expressive power for gray-scale image classification. Photonics Research 12, 1159-1166 (2024).
[54]	Blank, M. et al. Actions as space-time shapes. Tenth IEEE International Conference on Computer Vision (ICCV'05). Beijing, China: IEEE, 2005, 1395-1402.
[55]	Gorelick, L. et al. Actions as space-time shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 2247-2253 (2007).
[56]	He, K. M. et al. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA: IEEE, 2016, 770-778.
[57]	Schraivogel, D. et al. High-speed fluorescence image-enabled cell sorting. Science 375, 315-320 (2022).
[58]	LeCun, Y. et al. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 2278-2324 (1998).
[59]	Kaggle. Facial Keypoints detection (2013). at https://www.kaggle.com/c/facial-keypoints-detection URL.
[60]	Lee, K. C. M. et al. Toward deep biophysical cytometry: prospects and challenges. Trends in Biotechnology 39, 1249-1262 (2021).
[61]	Wang, T. Y. et al. Image sensing with multilayer nonlinear optical neural networks. Nature Photonics 17, 408-415 (2023).
[62]	Chen, Y. T. et al. All-analog photoelectronic chip for high-speed vision tasks. Nature 623, 48-57 (2023).
[63]	Jang, H. et al. In-sensor optoelectronic computing using electrostatically doped silicon. Nature Electronics 5, 519-525 (2022).
[64]	Wang, T. Y. et al. Reconfigurable optoelectronic memristor for in-sensor computing applications. Nano Energy 89, 106291 (2021).
[65]	Bong, K. et al. 14.6 A 0.62mW ultra-low-power convolutional-neural-network face-recognition processor and a CIS integrated with always-on Haar-like face detector. 2017 IEEE International Solid-State Circuits Conference (ISSCC). San Francisco, CA, USA: IEEE, 2017, 248-249.
[66]	Wu, N. F. et al. Intelligent nanophotonics: When machine learning sheds light. eLight 5, 5 (2025).
[67]	Rumi, M. et al. Structure–property relationships for two-photon absorbing chromophores: Bis-donor diphenylpolyene and bis(styryl)benzene derivatives. Journal of the American Chemical Society 122, 9500-9510 (2000).

Multi-task large-scale integrated optical vision processor using ultra-fast parallel nanofabrication

Abstract

References

Rights and permissions

通讯作者: 陈斌, bchen63@163.com

Research Summary

Article Metrics