Thin-film neural networks for optical inverse problem

Lingjie Fan; Ang Chen; Tongyu Li; Jiao Chu; Yang Tang; Jiajun Wang; Maoxiong Zhao; Tangyao Shen; Minjia Zheng; Fang Guan; Haiwei Yin; Lei Shi; Jian Zi

doi:10.37188/lam.2021.027

Volume 2 Issue 4

Article Contents

Rights and permissions

Light: Advanced Manufacturing > Published> Article> 2021, 2(4) : 395-402

Citation:

Thin-film neural networks for optical inverse problem

Lingjie Fan ^{1, 2
,} ,
Ang Chen ² ,
Tongyu Li ^{1, 2} ,
Jiao Chu ¹ ,
Yang Tang ¹ ,
Jiajun Wang ¹ ,
Maoxiong Zhao ^{1, 2} ,
Tangyao Shen ^{1, 2} ,
Minjia Zheng ^{1, 2} ,
Fang Guan ³ ,
Haiwei Yin ² ,
Lei Shi ^{1, 2, 3, 4
, ^*
,
,
,} ,
Jian Zi ^{1, 2, 3, 4
, ^*
,
,}

Light: Advanced Manufacturing 2, Article number: (2021)

More Information

1.
State Key Laboratory of Surface Physics, Key Laboratory of Micro- and Nano-Photonic Structures (Ministry of Education) and Department of Physics, Fudan University, Shanghai 200433, China
2.
Shanghai Engineering Research Center of Optical Metrology for Nano-fabrication (SERCOM), Shanghai 200433, China
3.
Institute for Nanoelectronic devices and Quantum computing, Fudan University, Shanghai 200438, China
4.
Collaborative Innovation Center of Advanced Microsstructures, Nanjing University, Nanjing 210093, China

Corresponding author:
Lei Shi (lshi@fudan.edu.cn); Jian Zi (jzi@fudan.edu.cn)
These authors contributed equally: Lingjie Fan, Ang Chen, Tongyu Li
Received: 15 September 2021
Revised: 09 October 2021
Accepted: 27 October 2021
Accepted article preview online: 29 October 2021
Published online: 22 November 2021

doi: https://doi.org/10.37188/lam.2021.027

Abstract

The thin-film optical inverse problem has attracted a great deal of attention in science and industry, and is widely applied to optical coatings. However, as the number of layers increases, the time it takes to extract the parameters of thin films drastically increases. Here, we introduce the idea of exploiting the structural similarity of all-optical neural networks and applied it to the optical inverse problem. We propose thin-film neural networks (TFNNs) to efficiently adjust all the parameters of multilayer thin films. To test the performance of TFNNs, we implemented a TFNN algorithm, and a reflectometer at normal incidence was built. Operating on multilayer thin films with 232 layers, it is shown that TFNNs can reduce the time consumed by parameter extraction, which barely increased with the number of layers compared with the conventional method. TFNNs were also used to design multilayer thin films to mimic the optical response of three types of cone cells in the human retina. The light passing through these multilayer thin films was then recorded as a colored photo.

Supplementary Information for Thin-film neural networks for optical inverse problem.pdf

References

[1]	Manifacier, J. C., Gasiot, J. & Fillard, J. P. A simple method for the determination of the optical constants n, k and the thickness of a weakly absorbing thin film. Journal of Physics E:Scientific Instruments 9, 1002-1004 (1976). doi: 10.1088/0022-3735/9/11/032
[2]	Ylilammi, M. & Ranta-Aho, T. Optical determination of the film thicknesses in multilayer thin film structures. Thin Solid Films 232, 56-62 (1993). doi: 10.1016/0040-6090(93)90762-E
[3]	Tang, H. et al. Electrical and optical properties of TiO₂ anatase thin films. Journal of Applied Physics 75, 2042-2047 (1994). doi: 10.1063/1.356306
[4]	Kwak, H. et al. Non-destructive thickness characterisation of 3D multilayer semiconductor devices using optical spectral measurements and machine learning. Light:Advanced Manufacturing 2, 9-19 (2021).
[5]	Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004-1008 (2018). doi: 10.1126/science.aat8084
[6]	Zhou, T. K. et al. In situ optical backpropagation training of diffractive optical neural networks: publisher’s note. Photonics Research 8, 1323 (2020). doi: 10.1364/PRJ.401673
[7]	Zhou, T. K. et al. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nature Photonics 15, 367-373 (2021). doi: 10.1038/s41566-021-00796-w
[8]	Yan, T. et al. Fourier-space diffractive deep neural network. Physical Review Letters 123, 023901 (2019). doi: 10.1103/PhysRevLett.123.023901
[9]	Hughes, T. W. et al. Training of photonic neural networks through in situ backpropagation and gradient measurement. Optica 5, 864-871 (2018). doi: 10.1364/OPTICA.5.000864
[10]	Wu, J. M. et al. Analog optical computing for artificial intelligence. Engineering. http://dx. doi.org/10.1016/j.eng.2021.06.021 (in the press).
[11]	Li, L. F. Formulation and comparison of two recursive matrix algorithms for modeling layered diffraction gratings. Journal of the Optical Society of America A 13, 1024-1035 (1996). doi: 10.1364/JOSAA.13.001024
[12]	Katsidis, C. C. & Siapkas, D. I. General transfer-matrix method for optical multilayer systems with coherent, partially coherent, and incoherent interference. Applied Optics 41, 3978-3987 (2002). doi: 10.1364/AO.41.003978
[13]	Forouhi, A. R. & Bloomer, I. Optical dispersion relations for amorphous semiconductors and amorphous dielectrics. Physical Review B 34, 7018-7026 (1986). doi: 10.1103/PhysRevB.34.7018
[14]	Forouhi, A. R. & Bloomer, I. Optical properties of crystalline semiconductors and dielectrics. Physical Review B 38, 1865-1874 (1988). doi: 10.1103/PhysRevB.38.1865
[15]	Jiang, J. et al. What is the space of spectral sensitivity functions for digital color cameras?. 2013 IEEE Workshop on Applications of Computer Vision (WACV). Clearwater Beach, FL, USA: IEEE, 2013,doi: 10.1109/WACV.2013.6475015.
[16]	Peurifoy J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Science Advances 4, eaar4206 (2018). doi: 10.1126/sciadv.aar4206
[17]	Liu, D. J. et al. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photonics 5, 1365-1369 (2018). doi: 10.1021/acsphotonics.7b01377
[18]	So, S. et al. Deep learning enabled inverse design in nanophotonics. Nanophotonics 9, 1041-1057 (2020). doi: 10.1515/nanoph-2019-0474
[19]	Molesky, S. et al. Inverse design in nanophotonics. Nature Photonics 12, 659-670 (2018).
[20]	Gao, L. et al. A bidirectional deep neural network for accurate silicon color design. Advanced Materials 31, 1905467 (2019). doi: 10.1002/adma.201905467
[21]	Wu, B. et al. Machine prediction of topological transitions in photonic crystals. Physical Review Applied 14, 044032 (2020). doi: 10.1103/PhysRevApplied.14.044032
[22]	Hu, B. Q. et al. Robust inverse-design of scattering spectrum in core-shell structure using modified denoising autoencoder neural network. Optics Express 27, 36276-36285 (2019). doi: 10.1364/OE.27.036276
[23]	Ma, W. et al. Deep learning for the design of photonic structures. Nature Photonics 15, 77-90 (2021). doi: 10.1038/s41566-020-0685-y
[24]	Ma, W. et al. Probabilistic representation and inverse design of metamaterials based on a deep generative model with semi-supervised learning strategy. Advanced Materials 31, 1901111 (2019). doi: 10.1002/adma.201901111
[25]	Minkov, M. et al. Inverse design of photonic crystals through automatic differentiation. ACS Photonics 7, 1729-1741 (2020). doi: 10.1021/acsphotonics.0c00327
[26]	Liu, V. & Fan, S. H. S⁴ : a free electromagnetic solver for layered periodic structures. Computer Physics Communications 183, 2233-2244 (2012). doi: 10.1016/j.cpc.2012.04.026
[27]	Anderson, E. et al. LAPACK Users’ Guide. 3rd edn. (Philadelphia: Society for Industrial and Applied Mathematics, 1999).
[28]	Madsen, K. , Nielsen, H. B. & Tingleff, O. Methods for Non-Linear Least Squares Problems. (IMM, 2004).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article′s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article′s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(6) / Tables(1)

Get Citation

PDF

XML

Research Summary

Multilayer thin films as neural networks: From metrology to inverse design

Recently, with the rise of deep learning, neural networks have been widely used to design various photonic structures. However, for the previously reported neural networks for inverse design, a large dataset is needed to approximate the Maxwell equations. Neural networks do reduce the time for design, but the time for pre-preparation dramatically increases. Here, the authors propose an alternative way to combine neural networks with photonics. In this approach, a neural-network like structure is directly constructed in a particular physical process without any dataset. The parameters in light propagating process could be updated according to the gradient obtained from the backpropagation process. This alternative method can be easily applied to metrology and inverse design.

show all

Article Metrics

Article views(7441) PDF downloads(2276) Citation(0) Citation counts are provided from Web of Science. The counts may vary by service, and are reliant on the availability of their data.

HTML

Introduction

Optical inverse problems, such as optical metrology and inverse optical design, have always been a hot topic because of their wide applications in science and industry^1–4. For conventional frameworks of the thin-film optical inverse problem, two spaces exist: the parameter space and the data space. All the possible parameters of thin films (e.g., thickness and refractive index as a function of wavelength) form the parameter space, while the optical responses of thin films to different parameters (e.g., reflectance spectra) form the data space. To solve a typical thin-film optical inverse problem, the initial parameters in the parameter space are selected as the starting point, and then the optical responses in the data space are computed by electromagnetic simulations. To determine the direction of updates of the optical response in the simulation, the simulated response is compared with the target response, i.e., the measured spectrum for optical metrology and the desired spectrum for inverse optical design. By conducting several electromagnetic simulations in each direction of the parameter space and comparing the differences between the spectra obtained from these simulations, the parameter changes required to update the optical response can be determined. This process was performed iteratively until the simulated response matched the target response. For simple thin films with a few layers, conventional approaches are effective because of the low-dimensional parameter space and reduced calculations. However, with the rapid increase in the number of layers and parameters, accurate thickness characterization and the design of multilayer thin films become more difficult owing to the high-dimensional parameter space and lengthy calculations. To satisfy the demand for fast and convenient solutions to the optical inverse problem, a new framework to solve this problem is required.

Fundamentally, the optical inverse problem is a parameter optimization process of nanophotonic structures. Recent studies on all-optical neural networks (NNs) have established a correlation between multilayer nanophotonic structures and multilayer NNs by exploiting their structural similarity^5–10, making it possible to optimize the parameters of nanophotonic structures during the learning process based on backpropagation. Therefore, common NN training tasks, such as handwritten digit recognition with the MNIST dataset⁵ and human pose estimation⁷, may be achieved with all-optical NNs at a high precision. In this paper, we introduce the idea of exploiting the structural similarity of all-optical NNs for application to the optical inverse problem. Thin-film NNs (TFNNs) are proposed to optimize and extract all the multilayer thin film parameters during the backpropagation process. As the input of TFNNs, incident light fields with normalized source spectra propagate through every film following the calculation steps in the transfer matrix method. Similar to the weights and activation functions when NNs connect two neural layers, transfer matrices characterize the propagation process of the light fields in TFNNs between two thin-film layers. The outputs of TFNNs are the reflectance and transmittance spectra. The thickness and refractive index of each layer in the thin films become the TFNN parameters. Then, in the new framework, the thickness and refractive index in each layer in the thin films can be optimized through the training process based on backpropagation in TFNNs, which is very similar to the process in NNs.

In this paper, we first explain the principle of the optical inverse problem with the NN-like framework. Mapping from the data space to the parameter space is implied in the training process of TFNNs. Then, the mathematical details of TFNNs are demonstrated by exploiting their structural similarity with multilayer NNs. In the section on optical metrology, the reflectance of thin films at normal incidence is measured as the target for training TFNNs. For monolayer thin films, both the thickness and refractive index of the layer are optimized. The multilayer thin films are treated as TFNNs to optimize all the thickness in hundreds of layers. The time required for optimization is significantly shortened compared with conventional methods (e.g., for thin films with 232 layers, conventional approaches take 67.498 s per iteration; our method takes 0.924 s per iteration). In the section on inverse optical design, the design of multilayer thin films based on TFNNs is introduced. Then, we designed and fabricated three types of multilayer thin films that mimic three types of cone cells in the human retina. An image-forming system is built, which records the light passing through these multilayer thin films as a colored photo.

Discussion

In summary, we proposed the concept of exploiting the structural similarity of all-optical NNs and thin films, and applied it to the optical inverse problem. Thus, we proposed TFNNs as a new framework for the thin-film optical inverse problem. A connection between multilayer thin films and multilayer NNs was constructed by exploiting their structural similarity. In optical metrology, through the training of TFNNs for extracting thin-film parameters, we can effectively optimize all the parameters in monolayer and multilayer thin films. In inverse optical design, we introduced the design idea and process of TFNNs. Then, TFNNs were used to design multilayer thin films to mimic the optical response of three types of cone cells in the human retina.

To obtain a more in-depth understanding of TFNNs, we distinguished our TFNNs from previously reported artificial neural networks (ANNs) for inverse optical design^16–24. For ANNs, the structural parameters and thickness of each layer were used as the input, while the optical responses and spectra of the thin films were used as the output. Then, the ANNs can learn how to approximate Maxwell’s equations. For the training of ANNs, a dataset was required (50,000 samples for thin films with 8 layers¹⁶, and 500,000 samples for thin films with 20 layers¹⁷). The details of using ANNs to solve the optical inverse problem of thin films with hundreds of layers are provided in the Supplementary Information (S7). The input of TFNNs is a normalized source spectrum; the output is the reflectance spectrum of the thin films. Because it is directly constructed using Maxwell’s equations, the spectra generated by TFNNs are accurate. For the training of TFNNs, the parameters can be updated according to the gradient obtained from the backpropagation process of TFNNs, without datasets for thin films with hundreds of layers. The details of the comparison between TFNNs and ANNs are presented in the Supplementary Information (S8).

For the further development of TFNNs, we note that the interface matrix and propagation matrix are all 2 × 2 complex matrices. This indicates that there are only two complex neurons in each layer. To add more neurons in each layer, uniform layers can be replaced by textured layers²⁵. The electromagnetic fields and permittivity function were expanded into a Fourier series to determine the eigensolutions of Maxwell's equations in a periodic textured medium. The eigenmodes interacted with each other at the interface and propagated independently in the bulk of the layer. The size of the interface matrix and propagation matrix was dependent on the order of the Fourier expansion²⁶. The extension of this method to other nanophotonic structures is discussed in the Supplementary Information.

Materials and methods

Implementation of TFNNs.

A homegrown program was written in C language to implement TFNNs because of the speed of the C language. The entire framework can be divided into three parts: LinearC, Model, and LMAlgo (details in the Supplementary Information, S15). LinearC provides basic mathematical functions based on the BLAS and LAPACK²⁷. A model was built to construct the forward propagation and backpropagation processes of TFNNs. LMAlgo is the optimal algorithm based on the Levenberg–Marquardt algorithms²⁸ for training TFNNs.

Platforms for testing TFNNs.

To provide a fair performance comparison, we tested the conventional framework and TFNNs on the same platform: a personal computer with an Intel(R) Core(TM) i5-4210H CPU (2.90GHz). Both the conventional framework and TFNNs used the C language as the backend and Python as the frontend by building C extensions. Note that Python was only selected because of its convenience and universal use. Other frontends, such as C#, can also be constructed for other purposes.

Acknowledgements

This work was supported by the China National Key Basic Research Program (2018YFA0306201) and the National Science Foundation of China (11774063, 11727811, and 91963212). A.C. was supported by the Shanghai Rising-Star Program (20QR1402200). L.S. was further supported by the Science and Technology Commission of Shanghai Municipality (19XD143600, 2019SHZDZX01, 19DZ2253000, 20501110500).

Supplementary information

Supplementary Information for Thin-film neural networks for optical inverse problem.pdf

Reference (28)

[1]	Manifacier, J. C., Gasiot, J. & Fillard, J. P. A simple method for the determination of the optical constants n, k and the thickness of a weakly absorbing thin film. Journal of Physics E:Scientific Instruments 9, 1002-1004 (1976).
[2]	Ylilammi, M. & Ranta-Aho, T. Optical determination of the film thicknesses in multilayer thin film structures. Thin Solid Films 232, 56-62 (1993).
[3]	Tang, H. et al. Electrical and optical properties of TiO₂ anatase thin films. Journal of Applied Physics 75, 2042-2047 (1994).
[4]	Kwak, H. et al. Non-destructive thickness characterisation of 3D multilayer semiconductor devices using optical spectral measurements and machine learning. Light:Advanced Manufacturing 2, 9-19 (2021).
[5]	Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004-1008 (2018).
[6]	Zhou, T. K. et al. In situ optical backpropagation training of diffractive optical neural networks: publisher’s note. Photonics Research 8, 1323 (2020).
[7]	Zhou, T. K. et al. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nature Photonics 15, 367-373 (2021).
[8]	Yan, T. et al. Fourier-space diffractive deep neural network. Physical Review Letters 123, 023901 (2019).
[9]	Hughes, T. W. et al. Training of photonic neural networks through in situ backpropagation and gradient measurement. Optica 5, 864-871 (2018).
[10]	Wu, J. M. et al. Analog optical computing for artificial intelligence. Engineering. http://dx. doi.org/10.1016/j.eng.2021.06.021 (in the press).
[11]	Li, L. F. Formulation and comparison of two recursive matrix algorithms for modeling layered diffraction gratings. Journal of the Optical Society of America A 13, 1024-1035 (1996).
[12]	Katsidis, C. C. & Siapkas, D. I. General transfer-matrix method for optical multilayer systems with coherent, partially coherent, and incoherent interference. Applied Optics 41, 3978-3987 (2002).
[13]	Forouhi, A. R. & Bloomer, I. Optical dispersion relations for amorphous semiconductors and amorphous dielectrics. Physical Review B 34, 7018-7026 (1986).
[14]	Forouhi, A. R. & Bloomer, I. Optical properties of crystalline semiconductors and dielectrics. Physical Review B 38, 1865-1874 (1988).
[15]	Jiang, J. et al. What is the space of spectral sensitivity functions for digital color cameras?. 2013 IEEE Workshop on Applications of Computer Vision (WACV). Clearwater Beach, FL, USA: IEEE, 2013,doi: 10.1109/WACV.2013.6475015.
[16]	Peurifoy J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Science Advances 4, eaar4206 (2018).
[17]	Liu, D. J. et al. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photonics 5, 1365-1369 (2018).
[18]	So, S. et al. Deep learning enabled inverse design in nanophotonics. Nanophotonics 9, 1041-1057 (2020).
[19]	Molesky, S. et al. Inverse design in nanophotonics. Nature Photonics 12, 659-670 (2018).
[20]	Gao, L. et al. A bidirectional deep neural network for accurate silicon color design. Advanced Materials 31, 1905467 (2019).
[21]	Wu, B. et al. Machine prediction of topological transitions in photonic crystals. Physical Review Applied 14, 044032 (2020).
[22]	Hu, B. Q. et al. Robust inverse-design of scattering spectrum in core-shell structure using modified denoising autoencoder neural network. Optics Express 27, 36276-36285 (2019).
[23]	Ma, W. et al. Deep learning for the design of photonic structures. Nature Photonics 15, 77-90 (2021).
[24]	Ma, W. et al. Probabilistic representation and inverse design of metamaterials based on a deep generative model with semi-supervised learning strategy. Advanced Materials 31, 1901111 (2019).
[25]	Minkov, M. et al. Inverse design of photonic crystals through automatic differentiation. ACS Photonics 7, 1729-1741 (2020).
[26]	Liu, V. & Fan, S. H. S⁴ : a free electromagnetic solver for layered periodic structures. Computer Physics Communications 183, 2233-2244 (2012).
[27]	Anderson, E. et al. LAPACK Users’ Guide. 3rd edn. (Philadelphia: Society for Industrial and Applied Mathematics, 1999).
[28]	Madsen, K. , Nielsen, H. B. & Tingleff, O. Methods for Non-Linear Least Squares Problems. (IMM, 2004).

The number of layers in multilayer thin films	2	60	232
Time required per iteration in conventional framework	0.034 s	3.196 s	67.498 s
Time required per iteration in TFNNs	0.042 s	0.166 s	0.924 s

Thin-film neural networks for optical inverse problem