Ultracompact meta-imagers for arbitrary all-optical convolution

Fu, Weiwei; Zhao, Dong; Li, Ziqin; Liu, Songde; Tian, Chao; Huang, Kun

doi:10.1038/s41377-022-00752-5

Download PDF

Article
Open access
Published: 18 March 2022

Ultracompact meta-imagers for arbitrary all-optical convolution

Weiwei Fu¹^na1,
Dong Zhao¹^na1,
Ziqin Li¹^na1,
Songde Liu^2,3,
Chao Tian^2,3 &
…
Kun Huang¹

Light: Science & Applications volume 11, Article number: 62 (2022) Cite this article

8909 Accesses
54 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Electronic digital convolutions could extract key features of objects for data processing and information identification in artificial intelligence, but they are time-cost and energy consumption due to the low response of electrons. Although massless photons enable high-speed and low-loss analog convolutions, two existing all-optical approaches including Fourier filtering and Green’s function have either limited functionality or bulky volume, thus restricting their applications in smart systems. Here, we report all-optical convolutional computing with a metasurface-singlet or -doublet imager, considered as the third approach, where its point spread function is modified arbitrarily via a complex-amplitude meta-modulator that enables functionality-unlimited kernels. Beyond one- and two-dimensional spatial differentiation, we demonstrate real-time, parallel, and analog convolutional processing of optical and biological specimens with challenging pepper-salt denoising and edge enhancement, which significantly enrich the toolkit of all-optical computing. Such meta-imager approach bridges multi-functionality and high-integration in all-optical convolutions, meanwhile possessing good architecture compatibility with digital convolutional neural networks.

Single-layer spatial analog meta-processor for imaging processing

Article Open access 21 April 2022

Multichannel meta-imagers for accelerating machine vision

Article 04 January 2024

Metasurface-enabled on-chip multiplexed diffractive neural networks in the visible

Article Open access 27 May 2022

Introduction

Artificial intelligence (AI) has recently gained rapid development in academies and industry due to intense research of deep convolutional neural networks (CNN) with a multilayer architecture¹. In each layer, numerous convolutional operators with functionality-assigned kernels are implemented to extract important features of objects for identification, but they are extremely time-consuming with the increment of AI tasks². Although advanced electronic devices such as graphics processing units³, field-programmable gate arrays⁴, and tensor processing unit⁵ have been proposed to accelerate the computation, the speed and energy consumption are still limited by the low response of electrons, such as charging and discharging in capacitance, electromagnetic radiation and create heat by the movement of the electrons in materials⁶.

In comparison, photons as massless bosons allow lossless propagation and manipulation of light through large-bandgap transparent materials for optical parallel analog computing without analog-to-digital and digital-to-analog convertors^7,8,9, hereby enabling high-speed and low-consumption computation. Currently, all-optical convolutional computing has two main approaches¹⁰: Fourier spatial filtering^{11,12,13,14,15,16,17} and Green’s function (GF)^{18,19,20,21,22,23,24,25,26,27}. The Fourier method employs a couple of lenses to realize spatial-frequency transform of original data for spatial-spectrum filtering via a modulation mask that is traditionally pure-phase or pure-amplitude, and inverse transforms for reconstructing processed data. Such a configuration with multiple elements is not preferred in integrated photonic systems, meanwhile, the complex filter with both amplitude and phase modulation is mandatorily needed for an arbitrary convolutional operation but unachievable for most traditional optical elements. The GF approaches implement optical analog computing by modulating angle-dependent transmittance (or reflection) with surface plasmon polaritons¹⁸, artificial nano-resonators¹⁹, photonic crystals²⁰, metasurfaces^{21,22,23,24,25}, topological photonics²⁶, and spin Hall effect of light²⁷. Although some GF approaches operating in a single device enable high integration, they have limited angular responses (usually valid for only one special operation), which are insufficient for arbitrary convolutional computing in AI and image processing. Since both all-optical approaches have the drawbacks of either low integration or limited functionality, wavelength-multiplexing technologies combining optical and electronic operations have recently been demonstrated to accelerate convolutional computing in neural networks^28,29, exhibiting a significant enhancement in speed compared with electronic computing. However, the electronic parts in such optoelectronic systems still constrain the ultimate speed.

Here, we propose a compact meta-imager to realize all-optical convolutional computing with arbitrary kernels in a parallel and real-time way. This meta-imager contains two parts: a metalens for image formation and the other complex-amplitude meta-modulator for reshaping its point spread function (PSF), which can be highly integrated into a single meta-device. By correlating the convolutional operator of an arbitrary 3 × 3 matrix, we realize the corresponding complex-amplitude meta-modulator via geometric dielectric metasurfaces. We have successfully demonstrated multiple convolutional operations, such as spatial differentiation, denoising, edge detection, and enhancement, to improve the imaging quality of optical (phase- and amplitude-type) and biological samples (e.g., chromosome molecules, onion, and oral epidermal cells) with different magnifications and spatial resolutions. Such a combination of both multiple functionalities and compact volume is not possible in other approaches.

Working principles of meta-imager

Wave theory of light predicts image formation in a lens system as a convolutional operation between the optical field of an object and the PSF of the lens³⁰ (Fig. 1a). With the PSF working as a kernel, the imaging system offers a natural choice for convolutional operation in a parallel, analog, and low-consumption way. However, the PSF in the imaging system usually has a fixed pattern such as Airy spot³¹, which cannot support arbitrary operation required in imaging processing and CNN.

**Fig. 1: Working principle of meta-imager for arbitrary convolution operation.**

To realize an arbitrary convolutional operation, we propose a meta-imager, composed of a metalens and a complex-amplitude meta-modulator (Fig. 1b), with a modified PSF. The metalens (with a focal length of f) and the meta-modulator (its complex amplitude denoted by h) are spaced by one-layer (e.g., substrate) or multilayer (e.g., air and substrate) transparent medium with an optical thickness of d = n_airt_air + n_subt_sub, where n_air/sub and t_air/sub are the refractive indices and the thicknesses of the air/substrate layer, respectively. To unveil the PSF of this meta-imager, a point source with its position (x₀,y₀) located at the object plane is assumed to illuminate the meta-imager. After a rigorous mathematical derivation (see Section 1 in Supplementary Materials), we have its pulse response at the image plane

$$\begin{array}{l}{\rm{PSF}}_{\mathrm{meta}}({x}_{3},{y}_{3};{x}_{0},{y}_{0})\propto {e}^{ik\frac{M}{2f}\cdot \frac{f-d}{{l}_{2}-d}\cdot {r}_{0}^{2}}\cdot\\ \left[\tilde{P}\left({x}_{3}+M{x}_{0},{y}_{3}\,+\,M{y}_{0}\right)\right]\otimes \left[F{(h)}_{{f}_{x}=\frac{{x}_{3}\,+\,M{x}_{0}}{\lambda \left({l}_{2}-d\right)},{f}_{y}\,=\,\frac{{y}_{3}+M{y}_{0}}{\lambda ({l}_{2}-d)}}\right]\end {array}$$

(1)

where the wavenumber k = 2π/λ, λ is the operating wavelength, r₀² = x₀² + y₀², l₁ and l₂ are the objects and image distances, the magnification M = l₂/l₁, $\tilde{P}$ is the PSF of the metalens without any modulator, F() stands for Fourier transform, the sign $\otimes$ is the convolution operation, x₃ and y₃ are the spatial coordinates at the image plane. In Eq. (1), the parabolic phase with the r₀-position dependence introduces optical off-axis aberration such as coma and distortion, which, however, can be eliminated if d = f.

If an object has the electric field U(x₀,y₀), its coherent image can be expressed as³²

$$\begin{array}{l}U({x}_{3},{y}_{3})=U({x}_{0},{y}_{0})\otimes {\rm{PSF}_{meta}}=U({x}_{0},{y}_{0})\\ \quad\,\qquad\qquad\otimes \tilde{P}\otimes F(h)=U{^{\prime}} ({x}_{3},{y}_{3})\otimes {\mathcal H}\end {array}$$

(2)

where $U{^{\prime}} ({x}_{3},{y}_{3})=U({x}_{0},{y}_{0})\otimes \tilde{P}$ is the image by the metalens without the modulator, and the item ${\mathcal H} =F(h)$ works as a convolutional operator. Equation (2) indicates that our meta-imager yields a convolutional operation between the magnified image and a spatial spectrum of the meta-modulator. Note that, only one lens is needed in this meta-imager, which therefore allows for lower cost and higher integration than the Fourier filtering system^{11,12,13,14,15,16,17}. The spatial spectrum of the meta-modulator offers designable and customized kernels for various convolutional operations. h can be obtained by inverse Fourier transform of the expected operator, i.e., h = F⁻¹(ℋ), where F⁻¹() denotes the inverse Fourier transform and the convolutional operator ℋ of a 3 × 3 matrix is employed here for high compatibility with traditional image processing³³ and CNN². It, therefore, bridges the gap between the convolutional operator and the meta-modulator. Detailed instruction about getting the complex amplitude of the meta-modulator from a given matrix-type operator is provided in “Methods”.

To realize the complex-amplitude modulation, we utilize transmissive dielectric geometric metasurfaces^{34,35,36,37,38,39,40,41,42} composed of orientation-rotated nanobricks that could transfer circularly polarized incident light into its cross-polarized light with an additional phase of twice the rotation angle θ^41,42. The conversion efficiency, related to the amplitude modulation, is determined by the dimension of the nanobricks. Since the rotation and the dimension of nanobricks can be manipulated separately (see Section 2 in Supplementary Materials), the phase and amplitude of the cross-polarized transmitted light are customized independently with a high spatial precision of subwavelength scale, hereby superior to traditional diffractive optical elements⁴³, spatial light modulators⁴⁴, and digital micromirror devices⁴⁵. Experimentally, the dielectric geometric metasurfaces are demonstrated in a 300-nm thick (i.e., H = 300 nm) crystalline silicon (c-Si) film on a sapphire substrate. To facilitate the fabrication, the nanobricks have the fixed widths W = 110 nm, leaving the only variable (the length L) to modulate the amplitude.

Figure 1c presents the simulated and experimental amplitude profiles with a peak located around L = 165 nm, suggesting that both increasing and falling edges of this peak can be used to modulate the amplitude. At both edges, the amplitude changes quickly, which implies strong sensitivity to L and therefore needs high-quality fabrication. To release it, we utilize the discrete amplitude with three levels for doublet meta-imager (d ≠ 0) and five levels for singlet meta-imager (d = 0). Despite the error caused by this discretization of amplitude, simulated convolutions (see Section 3 in Supplementary Materials) are seldom influenced, since 128-level phase modulation (Fig. 1d) is employed here to compensate for the accuracy.

All-optical convolution via a doublet meta-imager

For doublet meta-imager, the metalens and meta-modulator are fabricated separately in different specimens to facilitate their alignment, so that its realistic spacer contains both sapphire substrates of the metalens and meta-modulator and the in-between air. Its corresponding optical thickness d = f is used in this doublet meta-imager for reducing optical aberration. Figure 1d shows the fabricated metalens with good imaging and focusing functionalities, see the measurement details in Methods and Section 6 in Supplementary Materials.

Edge detection

Since Eq. (1) allows arbitrary convolution operator ℋ, we realize edge detection of an object by using a complex-valued operator composed of two orthogonal (x and y direction) differentiation (see Fig. 2a)

$${ {\mathcal H} }_{ED}={ {\mathcal H} }_{x}+i{ {\mathcal H} }_{y}=\left[\begin{array}{ccc}-1 & 0 & 1\\ -2 & 0 & 2\\ -1 & 0 & 1\end{array}\right]+i\,\left[\begin{array}{ccc}-1 & -2 & -1\\ 0 & 0 & 0\\ 1 & 2 & 1\end{array}\right]$$

(3)

where the real part ℋ_x and the imagery part ℋ_y denote the x- and y-direction operators³³, respectively. In Eq. (3), every matrix element has the spatial pitch of w₀ × w₀, which defines the detection accuracy of the convolutional operator. After substituting Eq. (3) into Eq. (2), we have I = U′_x² + U′_y² (where U′_x = U′⨂ℋ_x and U′_y = U′⨂ℋ_y denote the x- and y-direction differentiations, respectively), which therefore reveals the edge details of the original image U′. By implementing the inverse Fourier transform of ℋ_ED, we obtain the complex amplitude of the meta-modulator, which has a helical phase and a doughnut-shape amplitude (Fig. 2a). For the edge-detection meta-imager, these detailed analyses result naturally in the azimuthal phase of exp(iφ), which is also required in spiral phase contrast microscopy (see Section 14 in Supplementary Materials)^46,47,48. However, our meta-imager needs an additional amplitude modulation that can be used to control the detection accuracy (as shown later). The fabricated meta-modulator has been characterized (Fig. 2b–g) and discussed in more detail in Methods.

Firstly, we detect the edge of an amplitude object (“USTC” etched through a chromium film, see Fig. 2h) in a self-made optical system (see Section 8 in Supplementary Materials). Our meta-imager yields the clear edge of “USTC” (lower panel), with the magnification M = 1. Other larger-magnification edges by using the same meta-imager are also obtained without the loss of detection quality (see Section 9 in Supplementary Materials). The uniform edges with high contrast to the background are superior to others reported results^7,9, implying an efficient edge detection. The realistic detection accuracy, evaluated by the width of the outputted edge, is determined by the mutual interplay between the PSF of the metalens and the spatial pitch w₀ of the convolutional operator (as indicated in Eqs. (1) and (2)). Figure 2i shows the achieved accuracy of R = 3.9 μm at M = 1 and w₀ = 1.5 μm, with high consistency between simulation and experiment. To unveil its dependence on w₀, we have fabricated the meta-modulators with different w₀ and measured the accuracy (see Fig. 2j) at M = 1. When w₀ is larger than the spot size (denoted by r₀ = 3.26 μm at M = 1) of the metalens’ PSF, the realistic accuracy exhibits a quasi-linear dependence on w₀. But, when w₀ < r₀, the PSF of the metalens dominates the achieved accuracy, which is nearly constant for a given metalens. Therefore, the detection accuracy in our meta-imager is determined by the larger one between w₀ and r₀. A higher-NA metalens can enhance the detection accuracy but at the cost of the decreased efficiency (caused by angle-dependent polarization conversion⁴¹) and shrunken field-of-view (induced by the increment of optical aberration³¹). Our current meta-imager enables edge detection with a field-of-view better than 710 μm × 710 μm at M = 1 (see Section 10 in Supplementary Materials).

This meta-imager can also extract the edge of a pure-phase object. Figure 2k shows an optical microscope image of a transparent dolphin with an etched depth of 400 nm on a quartz substrate, leaving a phase jump of 0.577π at λ = 633 nm. Under the illumination, this binary-phase dolphin has a uniform transmission (Fig. 2l) over the entire field of view, except at the dark boundary caused by the phase discontinuity. When this dolphin works as the input object, our meta-imager highlights only the edge but suppresses the background (Fig. 2m), implying the better identification of the object. The contrast of the identified edge to the background is maximum (the best identification) for the phase jump of (2n + 1)π but minimum (no identification) for the phase jump of 2nπ, where n is an integer. Experimentally, we have checked two-phase jumps of 0.433π (300-nm depth) and 0.577π (400-nm depth), which yield well-consistent contrasts with the simulations (see Fig. 2h).

In addition, the meta-imager is employed further to detect the edges of chromosome molecules. The microscopic reflective (Fig. 2o) and magnified transmissive (Fig. 2p) images of weakly dyed chromosome molecules with both amplitude and phase show either low contrast or blurred edges, which is insufficient for distinguishing these chromosomes molecules. In comparison, our meta-imager outlines these chromosome molecules by highlighting the edges in both simulation (Fig. 2q) and experiment (Fig. 2r). The inhomogeneity of the experimental edges originates mainly from optical misalignment between the metalens and the meta-modulator, meanwhile, the unresolved small gaps between two neighboring molecules also weaken the edges.

Note that, all the demonstrations for edge detection are achieved in parallel at the speed of light, leaving the processing time of ~10⁻¹¹ s (evaluated by the ratio of the optical path to the speed of light), enhanced by 9 orders of magnitude compared with electronic digital convolutions (at the level of ~10⁻² s, estimated by implementing a 2-dimensional convolution between two 100 × 100 matrices with MATLAB software in a personal computer (CPU: Intel Core I7-7500U)). It, therefore, enables real-time edge detection of a running “horse” in a movie (Movies 1 and 2). To simulate high-speed moving objects in a laboratory environment, we have loaded the “horse-running” video onto an amplitude-type spatial light modulator (SLM, Holoeye) with a refresh frequency of 60 Hz. The SLM is located at the input plane of our doublet meta-imager with the parameters M = 1 and w₀ = 1.5 μm. Thus, the “horse-running” video is taken as the object. Correspondingly, the detected results at the output plane are captured in real-time by our camera (Thorlabs). The real-time recording of the detected “running-horse” is provided in Supplementary Video 2, while Supplementary Video 1, as a control case, shows the real-time “running-horse” without the meta-modulator.

Moreover, edge enhancement has also been demonstrated by using a real-value operator

$${ {\mathcal H} }_{EE}=\left[\begin{array}{ccc}-1 & -1 & -1\\ -1 & 12 & -1\\ -1 & -1 & -1\end{array}\right]$$

(4)

which leads to a complex meta-modulator with saddle-shaped amplitude and linear phase (Fig. S10). Since edge enhancement resembles closely edge detection, its detailed discussions are provided in Section 11 in Supplementary Materials, where the good agreement between simulated and experimental results confirms its validity.

Spatial differentiation

One-dimensional spatial differentiation is important to extract or remove directional details in imaging and data processing³³. As an example, a meta-modulator realizing 135°-direction differentiation is proposed by linking a convolutional operator

$${ {\mathcal H} }_{SD}=\left[\begin{array}{ccc}-2 & -1 & 0\\ -1 & 0 & 1\\ 0 & 1 & 2\end{array}\right]$$

(5)

which works as a directional derivative filter. Its corresponding meta-modulator (see the microscopic image in Fig. 3b and SEM image in Fig. 3c) has the expected amplitude and phase modulation (see Fig. 3d–g and Methods).

To validate its performance, a binary-amplitude object with 135°-direction line-shape defects (Fig. 3h) is used as the input of this differentiation meta-imager. Figure 3i shows the outputted results, where the defects are removed completely for better identification of “USTC”. Additionally, this differentiator also yields the edges of objects along the designed direction (135°–315°) but eliminates the edges along the orthogonal direction (45°–225°). The azimuthal intensity profiles of the processed “C” are illustrated in Fig. 3j, showing a cancellation range of 30° near the 45°–225° direction. It implies that the 5/6 contour of an object can be detected by using this differentiator.

Furthermore, this differentiator is employed to filter out undesired directional defects in onion epidermal cells (Fig. 3k). The transmitted patterns (Fig. 3l) of the epidermal cells show line-shape darkness (where the opaque defect lies) with the blurred and unresolved boundaries between the two cells. In Fig. 3m, our meta-imager creates the defect-free long-axis edges of the cells, meanwhile eliminating short-axis edges that are nearly parallel to the differentiation direction. Experimentally, the short-axis edges can be extracted via rotating the cells or the meta-modulator by 90°.

Denoising

Pepper-salt noise refers to randomly distributed defects existing in various imaging systems and is removed usually by the median filtering method in digital data processing³³. However, to the best of our knowledge, its all-optical solution to pepper-salt denoising has not to be reported yet due to the limited functionalities in the previous approaches^7,9. By linking the electronic convolutional operators and all-optical meta-modulators straightforwardly, our meta-imager suggests one solution to all-optical pepper-salt denoising. Here, we propose a convolutional operator

$${ {\mathcal H} }_{\rm{PSDN}}=\left[\begin{array}{ccc}1 & 1 & 1\\ 1 & 0 & 1\\ 1 & 1 & 1\end{array}\right]$$

(6)

which removes the random noise by the averaged intensity at the surrounding eight positions of the defect. The resulting meta-modulator has been fabricated (see the microscopic image in Fig. 4b and SEM image in Fig. 4c) in high quality, exhibiting well-performed phase and amplitude (see Fig. 4d–g and more details in Methods).

Figure 4h shows the image of a butterfly pattern with random defects having the size of 4 μm × 4 μm. By employing the denoising meta-imager (with a well-matched accuracy w₀ = 4 μm), we obtain the processed image with much-enhanced uniformity (Fig. 4i), where the dark defects have been removed efficiently with nearly equal intensity to their surroundings. Figure 4j illustrates the line (GG′ in Fig. 4j)-scanning intensity profiles across the exemplified defect. In contrast to the non-denoised (without modulator) case, the denoising meta-imager enhances the valley (minimum) intensity within the defect by ~6 times. The denoised intensity around the defects has a slight variation (see the blue dots in Fig. 4j) caused by the interference with the co-polarized background, which, however, can be suppressed by using a higher-extinction polarization analyzer. Nevertheless, the ratios (Fig. 4k) of the averaged intensity within the defect to the ambient intensity approach are 0.9, which is enhanced by 2.25 times compared to ~0.4 for the non-denoised case, indicating the improved homogeneity. To quantify the uniformity, the root-mean-square error (RMSE) between the normalized intensity around the defect and the ideal flat intensity is employed and shown in Fig. 4l. The RMSE of ~0.5 for the non-denoised case is suppressed down to ~0.18 after denoising, implying a ~2.78-fold improvement in uniformity. Both ratios and RMSEs have suggested an enhancement factor of >2, hereby confirming the denoising ability.

We have also applied this denoising meta-imager to remove random defects in oral epithelial cells (Fig. 4m). These cells have the transmitted pattern (Fig. 4n) with dark dots caused by these opaque defects. After being magnified by an objective lens, these cells are processed by the denoising meta-imager. Despite the existence of the co-polarized background, the intensity within the defect is observed with improved homogeneity (Fig. 4o). The un-eliminated darkness at the defect region originates from the magnified defects, which leads to the mismatch between the size (i.e., 4.8 μm in Fig. 4n) of magnified defects and the accuracy (w₀ = 4 μm) of the operator. Our simulation shows that the meta-imager behaves well when the defect has a size below w₀ (see Section 12 in Supplementary Materials).

All-optical convolution via a singlet meta-imager

After setting d = 0, the metalens and the meta-modulator are combined into a single complex-amplitude meta-device (Fig. 5a) that possesses both functionalities of imaging and modulator simultaneously, thus enabling high integration. However, the cost is the shrunken field-of-view, which is determined by the d-dependent phase item in Eq. (1). In the current configuration (i.e., M = 1, f = 2.5 mm), our simulation predicts that the convolution works well when the parabolic phase $k\frac{M}{2f}\cdot \frac{f-d}{{l}_{2}-d}{r}_{0}^{2} \,<\, 0.8\pi$ (see Section 13 in Supplementary Materials), suggesting a field-of-view of r₀ = 50 μm for this singlet meta-imager.

**Fig. 5: Singlet meta-imager for edge detection.**

We fabricate the singlet meta-imager (see the amplitude and phase profiles in Fig. 5b) that exhibits excellent performance (see Fig. 5c–e and more details in Methods). Then, we implement edge detection of a binary-amplitude object “U” (Fig. 5f) with this singlet meta-imager. Both experimental (Fig. 5g) and simulated (Fig. 5h) images reveal the clear edges with good agreement, as doubly confirmed by the line-scanning intensity with well-coincided widths and locations of the edges (Fig. 5i). The distance of ~94 μm between the top and bottom edges presents the experimentally achieved field-of-view, which approaches the simulated 100 μm.

Discussions

The changeable distance d between the metalens and the meta-modulator enables our meta-imager to realize complex-amplitude manipulation at the coordinate space, which is fundamentally distinguished from the Fourier filtering approach operating at the frequency space^{11,12,13,14,15,16,17}. The resulting advantage is the more compact volume of the entire device via singlet or doublet meta-surfaces. Particularly, the metalens and the meta-modulator can be made on the front and back sides of a substrate⁴⁹, leading to the nearly identical volume as singlet meta-imager and maintaining the field-of-view simultaneously. For a given meta-imager, the outputted convolutional results can be magnified or shrunken on demand, which is more flexible to match the subsequent detection systems than previous approaches. A detailed comparison among them is provided in Table 1 in Supplementary Materials, suggesting that our meta-imager has superior performances, such as arbitrary convolutional operation, high integration, tunable magnification, and high accuracy. For light with other states of polarization, the complex amplitude in our meta-imager might be realized by using a pure-amplitude or pure-phase spatial light modulator with carefully designed encoding technique^50,51. However, it will lead to increased volume, decreased efficiency, low spatial resolution, and low detection quality.

In summary, we have reported a meta-imager approach to realize all-optical convolutional operation with unlimited kernels. By modifying the PSF via the meta-modulator, the convolution between an object and the improved PSF can be simplified into straightforward imaging formation, which yields the expected processing pattern at the imaging plane. Our meta-imager allows at most two elements (i.e., imaging and modulating parts), both of which can be integrated into a single device for a compact volume. Some frequently used convolutional operations have been demonstrated with good performance to enhance the quality of images in optics and biology, which can be extended to artificial intelligence and high-performance computing.

Materials and methods

Obtaining the meta-modulator from a given convolutional operator

Equation (1) has shown the straightforward Fourier relationship between the meta-modulator (with the complex amplitude h) and the convolutional operator (ℋ). In practical applications, the operator ℋ in different formats (e.g., a discrete N × N matrix or an analytical formula) is usually known with specialized functionality. To obtain its corresponding meta-modulator numerically, we put the operator ℋ at the front focal plane of a Fourier lens with a focal length of l₂–d. Thus, the Fourier transform of ℋ can be obtained at the rear focal plane, where the electric field is expressed as ${F}( {\mathcal H} )={F}({F}[h(x,y)])=h(-x,-y)$. Note that, h(−x,−y) and h(x,y) are centrosymmetric so that we can get h(x,y) via the symmetric transformation of h(−x,−y). During the numerical simulation, the sampling intervals (i.e., the pixel pitches) at both front and rear focal planes must be identical. For example, our meta-modulator has the pixel pitch of p_x × p_y (p_x = p_y = p = 250 nm), which should also be adopted in the convolutional operator ℋ. Because w₀ in the operator ℋ is usually larger than the pixel pitch of the meta-modulator, one detection unit (i.e., w₀ × w₀) in ℋ contains the pixels of P × P (P = w₀/p must be an integer), indicating an upsampling process. Then, the zeros are padded symmetrically around the upsampling ℋ to keep the same matrix size with the meta-modulator, as required in the numerical calculation. Note that, if the operator ℋ has a larger-size matrix of N × N (N > 3), the same process with the case of N = 3 is needed to obtain its corresponding meta-modulator. But, due to the non-zero matrix elements at the volumes or rows of N > 3, the larger-size matrix usually increases the efficient w₀, hereby decreasing the detection accuracy. Therefore, such a larger-size matrix ℋ is usually not recommended in practical applications unless the 3 × 3 matrix fails to realize the expected functionalities. If the operator ℋ has an analytical form, it should also be digitalized with the sampling pixel pitch of p_x × p_y.

Once the upsampling ℋ with the symmetrically padded zeros is well-prepared, we put it at the front focal plane of the Fourier lens, without any deviation (i.e., x₀ = 0, y₀ = 0). Thus, according to Fourier optics⁵², the ℋ has the diffraction field at the rear focal plane, which is taken as the complex amplitude h(−x,−y) of the meta-modulator. All the simulations about this diffraction process are implemented by using the Rayleigh–Sommerfeld integral³¹.

The optical performance of fabricated metalens

To maximize the efficiency, we utilize a 128-level phase-type metalens with f = 2.5 mm and a diameter of 1 mm, which are chosen after a careful balance between the thickness of the substrate, the imaging resolution of the metalens³¹, and the angle-dependent conversion efficiency⁴¹ of the nanobricks. Figure 1e shows optical and SEM images of our fabricated metalens (see the fabrication details in Section 4 in Supplementary Materials) with a measured efficiency of 71.4% (see Section 5 in Supplementary Materials) at λ = 633 nm, which is highly consistent with the simulated amplitude (a square root of the efficiency) in Fig. 1c. The imaging and focusing functionalities of these metalens have also been verified experimentally with good performance (see Section 6 in Supplementary Materials), thus guaranteeing convolutional operations.

Characterizing phase and amplitude from meta-modulators in doublet meta-imagers

For easy fabrication, all the meta-modulators of doublet meta-imagers in this work have discretized amplitude with three levels: 1, 0.5, and 0, which are realized by using the nanobricks with the dimensions (W = 110 nm, L = 170 nm), (W = 110 nm, L = 190 nm) and (W = 150 nm, L = 150 nm), respectively. The nano-bricks with the dimension of W = 150 nm and L = 150 nm are used here to facilitate the fabrication.

Edge-detection meta-modulator

After the discretization, the edge-detection meta-modulator (derived from the operator ℋ_ED with w₀ = 1.5 μm) with the 128-level phase is fabricated in high quality, as confirmed from both optical (Fig. 2b) and SEM (Fig. 2c) images. Its performance is tested experimentally under the illumination of a circularly polarized beam, yielding the expected doughnut-contour transmission (Fig. 2d) with crossed polarization. Figure 2e shows the good agreement between experimental and simulated line-scanning amplitude profiles, implying the valid amplitude modulation. To characterize the phase modulation, a self-built Mach–Zehnder setup (see Section 7 in Supplementary Materials) interfering with the cross-polarized part of transmitted light with a slightly tilted co-propagating plane wave is used to generate an interference pattern, from which we can retrieve the experimental phase by using fast Fourier transform^53,54. Figure 2f presents the retrieved vortex-like phase, which has a linear dependence on the azimuthal coordinate as observed in Fig. 2g and therefore verifies the creation of the required helical phase. Note that, a small phase jump at the boundaries between two size-different nanobricks occurs due to the propagation phase^55,56, which has little influence on the entire performance of the meta-modulator and can be eliminated by adding the geometric phase with an equal and sign-opposite initial value.

Differentiation meta-modulator

For the differentiation operator, the element pitch of ℋ_SD is taken as w₀ = 1.5 μm for the purpose of demonstration. Experimentally, the corresponding meta-modulator (see its microscopy image in Fig. 3b and SEM image in Fig. 3c) with 3-level amplitude and 128-level phase has a two-lobe-like transmission profile for the cross-polarization part (Fig. 3d). Along with the 135°-direction CC′ in Fig. 3d, the line-scanning amplitude profiles (Fig. 3e) have two peaks, which are sandwiched by three zeros at the center and both outmost terminals. Between both peaks, the amplitude exhibits the required quasi-linear dependence on the spatial frequency (referring to k_x = x/fλ due to d = f, where x is the spatial coordinate of meta-modulator and f is the focal length of the metalens) for spatial differentiation. Thus, from the position (x_p ≈ ±0.25 mm) of both peaks, one can evaluate the best differentiation accuracy of this meta-imager by using 0.5/k_x ≈ 3.17 μm, which is highly consistent with the PSF size (r₀ = 3.26 μm) of the metalens. It implies that the larger-size meta-modulator (e.g., the outmost part beyond both peaks) is not necessary to enhance the accuracy (inherently determined by the PSF of the entire imaging system), which coincides with the predicted accuracy in Fig. 2j.

The retrieved phase from the interference pattern reveals a phase shift of π between two different lobes (see Fig. 3f), where the unstable phase at the zero-transmission region is caused by the oscillation of experimental noise. The good agreement between the experimental and simulated line-scanning phase profiles in Fig. 3g suggests valid phase modulation in this meta-modulator.

Denoising meta-modulator

The operator in Eq. (6) leads to a meta-modulator with a Gaussian-like amplitude and an additional linear phase (Fig. 4a), which is fundamentally distinguished from conventional low-pass filters with only the amplitude modulation⁵². The experimental meta-modulator (see its microscopic reflective image in Fig. 4b) has the ring-shaped pattern after the discretization of the amplitude in terms of three different nanobricks (Fig. 4c), thereby leading to the expected transmission (equivalent to amplitude modulation) as depicted in Fig. 4d. The quantitative comparison of the experimental and simulated amplitude profiles in Fig. 4e reveals their good consistency and confirms the validity of the fabricated meta-modulator. In addition, the experimental phase-encoded into the meta-modulator is retrieved in Fig. 4f, demonstrating a linearly increasing phase except the small jump caused by the propagation phase. Despite the existence of the propagation phase, the deviation between the retrieved and simulated phase is still smaller than 0.3π (Fig. 4g), which has no significant influence on convolutional results (see Fig. S3).

The optical performance of singlet meta-imager

A singlet meta-imager (see its phase and amplitude profiles in Fig. 5b) realizing the edge-detection operator (Eq. 3) with w₀ = 1.75 μm is exemplified here for the purpose of demonstration. After the discretization with five-level amplitude and 128-level phase, our fabricated singlet meta-imager has a reflective microscopic pattern with five colored rings (Fig. 3c). When illuminated by a circularly polarized laser, the singlet meta-imager has the increasing transmission (cross-polarized) from the center to the outmost ring (Fig. 3d). Figure 3e provides a quantitative comparison between the simulated and experimental amplitude profiles, where their high consistency implies an efficient amplitude modulation. In contrast, the experimental measurement of the phase is difficult because the lensing phase makes the light beam focused tightly so the interference approach is not suitable.

References

Krizhevsky, A. Sutskever, I., & Hinton, G. E. Imagenet classification with deep convolutional neural networks. in Proc. 25th International Conference on Neural Information Processing Systems. (NIPS, Lake Tahoe, 2012) 1097–1105.
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS Google Scholar
Wang, P. S. et al. O-Cnn: Octree-based convolutional neural networks for 3d shape analysis. ACM Trans. Graph. 36, 72 (2017).
Article Google Scholar
Zhang, C. et al. Optimizing FPGA-based accelerator design for deep convolutional neural networks. in Proc. 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. (ACMs, Monterey, 2015) 161–170.
Jouppi, N. P. et al. In-datacenter performance analysis of a tensor processing unit. in Proc. 44th Annual International Symposium on Computer Architecture. (ACMs, Toronto, 2017) 1–12.
Miller, D. A. B. Attojoule optoelectronics for low-energy information processing and communications. J. Lightwave Technol. 35, 346–396 (2017).
Article ADS Google Scholar
Zangeneh-Nejad, F. et al. Analogue computing with metamaterials. Nat. Rev. Mater. 6, 207–225 (2020).
Article ADS Google Scholar
Solli, D. R. & Jalali, B. Analog optical computing. Nat. Photonics 9, 704–706 (2015).
Article ADS Google Scholar
Abdollahramezani, S., Hemmatyar, O. & Adibi, A. Meta-optics for spatial optical analog computing. Nanophotonics 9, 4075–4095 (2020).
Article Google Scholar
Silva, A. et al. Performing mathematical operations with metamaterials. Science 343, 160–163 (2014).
Article ADS MathSciNet MATH Google Scholar
Wang, Z. et al. On-chip wavefront shaping with dielectric metasurface. Nat. Commun. 10, 3547 (2019).
Article ADS Google Scholar
Zhou, J. et al. Optical edge detection based on high-efficiency dielectric metasurface. Proc. Natl Acad. Sci. USA 116, 11137–11140 (2019).
Article ADS Google Scholar
Zhou, J. et al. Two-dimensional optical spatial differentiation and high-contrast imaging. Natl Sci. Rev. 8, nwaa176 (2021).
Article Google Scholar
Zhou, J. et al. Metasurface enabled quantum edge detection. Sci. Adv. 6, eabc4385 (2020).
Article ADS Google Scholar
Pors, A., Nielsen, M. G. & Bozhevolnyi, S. I. Analog computing using reflective plasmonic metasurfaces. Nano Lett. 15, 791–797 (2015).
Article ADS Google Scholar
Qiu, X. et al. Spiral phase contrast imaging in nonlinear optics: seeing phase objects using invisible illumination. Optica 5, 208–212 (2018).
Article ADS Google Scholar
Huo, P. et al. Photonic spin-multiplexing metasurface for switchable spiral phase contrast imaging. Nano Lett. 20, 2791–2798 (2020).
Article ADS Google Scholar
Zhu, T. et al. Plasmonic computing of spatial differentiation. Nat. Commun. 8, 15391 (2017).
Article ADS Google Scholar
Cordaro, A. et al. High-index dielectric metasurfaces performing mathematical operations. Nano Lett. 19, 8418–8423 (2019).
Article ADS Google Scholar
Guo, C. et al. Photonic crystal slab Laplace operator for image differentiation. Optica 5, 251–256 (2018).
Article ADS Google Scholar
Kwon, H. et al. Nonlocal metasurfaces for optical signal processing. Phys. Rev. Lett. 121, 173004 (2018).
Article ADS Google Scholar
Zhou, Y. et al. Flat optics for image differentiation. Nat. Photonics 14, 316–323 (2020).
Article ADS Google Scholar
Davis, T. J. et al. Metasurfaces with asymmetric optical transfer functions for optical signal processing. Phys. Rev. Lett. 123, 013901 (2019).
Article ADS Google Scholar
Kwon, H. et al. Single-shot quantitative phase gradient microscopy using a system of multifunctional metasurfaces. Nat. Photonics 14, 109–114 (2020).
Article ADS Google Scholar
Komar, A. et al. Edge detection with MIE-resonant dielectric metasurfaces. ACS Photonics 8, 864–871 (2021).
Article Google Scholar
Zhu, T. et al. Topological optical differentiator. Nat. Commun. 12, 680 (2021).
Article ADS Google Scholar
Zhu, T. et al. Generalized spatial differentiation from the spin hall effect of light and its application in image processing of edge detection. Phys. Rev. Appl. 11, 034043 (2019).
Article ADS Google Scholar
Feldmann, J. et al. Parallel convolutional processing using an integrated photonic tensor core. Nature 589, 52–58 (2021).
Article ADS Google Scholar
Xu, X. et al. 11 Tops photonic convolutional accelerator for optical neural networks. Nature 589, 44–51 (2021).
Article ADS Google Scholar
Born, M. & Wolf, E. Principles of Optics: Electromagnetic Theory of Propagation, Interference and Diffraction of Light, (Cambridge University Press, 1999).
Huang, K. et al. Planar diffractive lenses: fundamentals, functionalities, and applications. Adv. Mater. 30, 1704556 (2018).
Article Google Scholar
Wilson, T. & Sheppard, C. Theory and Practice of Scanning Optical Microscopy, (Academic Press London, 1984).
Russ, J. C. The Image Processing Handbook, (CRC Press, 2016).
Luo, X. Subwavelength optical engineering with metasurface waves. Adv. Optical Mater. 6, 1701201 (2018).
Article Google Scholar
Astilean, S. et al. High-efficiency subwavelength diffractive element patterned in a high-refractive-index material for 633nm. Opt. Lett. 23, 552–554 (1998).
Article ADS Google Scholar
Yu, N. et al. Light propagation with phase discontinuities: generalized laws of reflection and refraction. Science 334, 333–337 (2011).
Article ADS Google Scholar
Bomzon, Z., Kleiner, V. & Hasman, F. Computer-generated space-variant polarization elements with subwavelength metal stripes. Opt. Lett. 26, 33–35 (2001).
Article ADS Google Scholar
Zhang, L. et al. Advances in full control of electromagnetic waves with metasurfaces. Adv. Optical Mater. 4, 818–833 (2016).
Article Google Scholar
Kuznetsov, A. I. et al. Optically resonant dielectric nanostructures. Science 354, aag2472 (2016).
Article Google Scholar
Luo, X. Principles of electromagnetic waves in metasurfaces. Sci. China Phys. Mech. Astron. 58, 594201 (2015).
Article Google Scholar
Huang, K. et al. Ultraviolet metasurfaces of ≈80% efficiency with antiferromagnetic resonances for optical vectorial anti-counterfeiting. Laser Photonics Rev. 13, 1800289 (2019).
Article ADS Google Scholar
Huang, K. et al. Silicon multi-meta-holograms for the broadband visible light. Laser Photonics Rev. 10, 500–509 (2016).
Article ADS Google Scholar
Brown, B. R. & Lohmann, A. W. Complex spatial filtering with binary masks. Appl. Opt. 5, 967–969 (1966).
Article ADS Google Scholar
Huang, K. et al. Design of Dpe for modulating the electric field at the out-of-focus plane in a lens system. Appl. Opt. 51, 5149–5153 (2012).
Article ADS Google Scholar
Ren, Y. X. et al. Experimental generation of Laguerre-Gaussian beam using digital micromirror device. Appl. Opt. 49, 1838–1844 (2010).
Article ADS Google Scholar
Ritsch-Marte, M. Orbital angular momentum light in microscopy. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 375, 20150437 (2017).
Article ADS Google Scholar
Davis, J. A. et al. Image processing with the Radial Hilbert transform: theory and experiments. Opt. Lett. 25, 99–101 (2000).
Article ADS Google Scholar
Fürhapter, S. et al. Spiral phase contrast imaging in microscopy. Opt. Express 13, 689–694 (2005).
Article ADS Google Scholar
Arbabi, A. et al. Miniature optical planar camera based on a wide-angle metasurface doublet corrected for monochromatic aberrations. Nat. Commun. 7, 13682 (2016).
Article ADS Google Scholar
Chen, Y. et al. Generalized perfect optical vortices along arbitrary trajectories. J. Phys. D Appl. Phys. 54, 214001 (2021).
Article ADS Google Scholar
Davis, J. A. et al. Encoding amplitude information onto phase-only filters. Appl. Opt. 38, 5004–5013 (1999).
Article ADS Google Scholar
Goodman, J. W. Introduction to Fourier Optics (Roberts and Company Publishers, 2005).
Huang, K. et al. Spiniform phase-encoded metagratings entangling arbitrary rational-order orbital angular momentum. Light 7, 17156 (2018).
Article Google Scholar
Huang, K. et al. Bio-inspired photonic masquerade with perturbative metasurfaces. ACS Nano 14, 7529–7537 (2020).
Article Google Scholar
Zhang, J. et al. Polarization-enabled tunable focusing by visible-light metalenses with geometric and propagation phase. J. Opt. 21, 115102 (2019).
Article ADS Google Scholar
Balthasar Mueller, J. P. et al. Metasurface polarization optics: independent phase control of arbitrary orthogonal states of polarization. Phys. Rev. Lett. 118, 113901 (2017).
Article ADS Google Scholar

Download references

Acknowledgements

K.H. thanks the National Natural Science Foundation of China (Grant Nos. 12134013 and 61875181), CAS Pioneer Hundred Talents Program, “the Fundamental Research Funds for the Central Universities” in China, USTC Research Funds of the Double First-Class Initiative (Grant No. YD2030002003), and the support from the University of Science and Technology of China’s Centre for Micro and Nanoscale Research and Fabrication. C.T. thanks the National Natural Science Foundation of China (Grant Nos. 62122072 and 12174368), the Institute of Artificial Intelligence at Hefei Comprehensive National Science Center (Grant No. 21KT016), and the Anhui Science and Technology Department (Grant No. 18030801138).

Author information

These authors contributed equally: Weiwei Fu, Dong Zhao, Ziqin Li

Authors and Affiliations

Department of Optics and Optical Engineering, University of Science and Technology of China, Hefei, Anhui, 230026, China
Weiwei Fu, Dong Zhao, Ziqin Li & Kun Huang
Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, 230088, China
Songde Liu & Chao Tian
Department of Precision Machinery and Precision Instrumentation, University of Science and Technology of China, Hefei, Anhui, 230026, China
Songde Liu & Chao Tian

Authors

Weiwei Fu
View author publications
You can also search for this author in PubMed Google Scholar
Dong Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ziqin Li
View author publications
You can also search for this author in PubMed Google Scholar
Songde Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chao Tian
View author publications
You can also search for this author in PubMed Google Scholar
Kun Huang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.H. conceived the idea. W.F. and K.H. developed the theory and performed the simulations. D.Z. and Z.L. prepared and fabricated optical samples. W.F. and D.Z. built up the experimental setup and did the characterization. S.L., C.T., W.F., and K.H. prepared the biological samples. W.F., K.H., D.Z., and C.T. wrote the paper. K. H. and C. T. supervised the overall project. All authors discussed the results, did the data analysis, and commented on the paper.

Corresponding authors

Correspondence to Chao Tian or Kun Huang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Supplementary information

Supplementary Materials

Supplementary Video 1

Supplementary Video 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fu, W., Zhao, D., Li, Z. et al. Ultracompact meta-imagers for arbitrary all-optical convolution. Light Sci Appl 11, 62 (2022). https://doi.org/10.1038/s41377-022-00752-5

Download citation

Received: 09 September 2021
Revised: 17 February 2022
Accepted: 23 February 2022
Published: 18 March 2022
DOI: https://doi.org/10.1038/s41377-022-00752-5

This article is cited by

Compact meta-differentiator for achieving isotropically high-contrast ultrasonic imaging
- Yurou Jia
- Suying Zhang
- Xiaojun Liu
Nature Communications (2024)
Multichannel meta-imagers for accelerating machine vision
- Hanyu Zheng
- Quan Liu
- Jason G. Valentine
Nature Nanotechnology (2024)
Inverse-designed low-index-contrast structures on a silicon photonics platform for vector–matrix multiplication
- Vahid Nikkhah
- Ali Pirmoradi
- Nader Engheta
Nature Photonics (2024)
Correlated optical convolutional neural network with “quantum speedup”
- Yifan Sun
- Qian Li
- Xiangdong Zhang
Light: Science & Applications (2024)
Broadband angular spectrum differentiation using dielectric metasurfaces
- Ming Deng
- Michele Cotrufo
- Lin Chen
Nature Communications (2024)