Article
Open access
Published: 05 September 2024

Photonic probabilistic machine learning using quantum vacuum noise

Nature Communications volume 15, Article number: 7760 (2024) Cite this article

2688 Accesses
Metrics details

Subjects

Abstract

Probabilistic machine learning utilizes controllable sources of randomness to encode uncertainty and enable statistical modeling. Harnessing the pure randomness of quantum vacuum noise, which stems from fluctuating electromagnetic fields, has shown promise for high speed and energy-efficient stochastic photonic elements. Nevertheless, photonic computing hardware which can control these stochastic elements to program probabilistic machine learning algorithms has been limited. Here, we implement a photonic probabilistic computer consisting of a controllable stochastic photonic element – a photonic probabilistic neuron (PPN). Our PPN is implemented in a bistable optical parametric oscillator (OPO) with vacuum-level injected bias fields. We then program a measurement-and-feedback loop for time-multiplexed PPNs with electronic processors (FPGA or GPU) to solve certain probabilistic machine learning tasks. We showcase probabilistic inference and image generation of MNIST-handwritten digits, which are representative examples of discriminative and generative models. In both implementations, quantum vacuum noise is used as a random seed to encode classification uncertainty or probabilistic generation of samples. In addition, we propose a path towards an all-optical probabilistic computing platform, with an estimated sampling rate of ~1 Gbps and energy consumption of ~5 fJ/MAC. Our work paves the way for scalable, ultrafast, and energy-efficient probabilistic machine learning hardware.

A versatile single-photon-based quantum computing platform

Article Open access 26 March 2024

Quantum circuits with many photons on a programmable nanophotonic chip

Article 03 March 2021

CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning

Article Open access 27 March 2024

Introduction

Probabilistic machine learning can accelerate image generation^1,2, heuristic optimization^3,4, and probabilistic inference^5,6 by leveraging stochasticity to encode uncertainty and enable statistical modeling^7,8. These approaches are well suited for real-life applications which must account for uncertainty and variability, including autonomous driving⁹, medical diagnosis¹⁰, and drug discovery¹¹. However, digital complementary metal-oxide-semiconductor (CMOS) technology requires extensive resource overhead to simulate randomness and control probabilities, which leads to significantly increased power consumption and decreased operational speed¹². These challenges have sparked recent proposals for beyond-CMOS hardware such as low-barrier magnetic tunnel junctions¹³ and diffusive memristors¹⁴—both of which leverage intrinsic noise as a source of randomness.

Concurrently, optical neural networks (ONNs)^15,16 have shown remarkable progress in energy efficiency^17,18, speed¹⁹, and bandwidth²⁰ for solving deterministic tasks such as image classification²¹ and speech recognition²². An important feature of ONNs is the inherent presence of noise in their operation. Therefore, photonic computing hardware typically implements computational tasks that are robust to optical noise¹⁶. ONNs have also been explored in regimes where deterministic tasks are performed with high accuracy, despite the presence of high levels of inherent noise¹⁸. Conversely, ONNs in which optoelectronic noise is intentionally added have also been proposed for optimization²³ and generative networks²⁴. Interestingly, quantum optics offers a natural source of randomness in the ground state of electromagnetic field, known as quantum vacuum noise^25,26,27. This intrinsic noise source is ubiquitous in optics and has been used to achieve high-data rate random number generation^28,29. In addition, optical systems influenced by quantum vacuum noise have shown natural abilities to generate probability distributions^30,31,32, which are of strong interest for computing applications^13,14. However, the experimental demonstration of a photonic probabilistic machine learning system has remained elusive so far, mostly due to the lack of programmable stochastic photonic elements.

Here, we experimentally demonstrate a probabilistic computing platform utilizing photonic probabilistic neurons (PPNs). Our PPN is implemented as a biased degenerate optical parametric oscillator (OPO), which leverages quantum vacuum noise to generate a probability distribution encoded by a bias field. We realized a hybrid optoelectronic probabilistic machine learning system which combines time-multiplexed PPNs and electronic processors with algorithm-specific measurement-and-feedback strategies. We demonstrate probabilistic inference of MNIST-handwritten digits with a stochastic binary neural network (SBNN), highlighting how quantum vacuum noise can encode classification uncertainty in discriminative models. Additionally, we showcase the generation of MNIST-handwritten digits with a pixel convolutional neural network (pixelCNN), demonstrating how statistical sampling in generative models can be facilitated by quantum vacuum noise. Furthermore, we provide a thorough discussion of the potential of an all-optical probabilistic machine learning system, offering a possible performance enhancement by a factor of 100 in both speed and energy over traditional CMOS implementations, thereby opening new avenues in high-speed, energy-efficient computing applications.

Results

Probabilistic computing with time-multiplexed PPNs

We first provide a brief overview of two probabilistic machine learning models and their optical implementation with PPNs (Fig. 1).

**Fig. 1: Probabilistic machine learning with stochastic photonic elements.**

Discriminative models learn decision lines that encode classification boundaries between different images (Fig. 1a, left)³³. Probabilistic neural networks (Fig. 1a, middle) then impart statistical properties onto network parameters (e.g., weight uncertainty⁵ or layer nodes³⁴). Therefore, the network can provide a statistical ensemble of classification results, which are shown as different probabilities of the image classified to certain labels (Fig. 1a, right). Probabilistic inference can quantify classification uncertainty, which becomes critical for ambiguous images located near the decision boundary^35,36.

On the other hand, generative models learn the underlying probability distribution of the training dataset (e.g., images) in order to create new ones (Fig. 1b, left)³³. When generating new images, generative models use random sources to seed stochastic image sampling based on the probability distribution learned by the network (Fig. 1b, middle). As a result, images with different labels can be generated (Fig. 1b, right).

In both of these computational tasks, probabilistic machine learning requires stochastic photonic elements whose probability distribution can be tuned, and that can perform statistically independent sampling. We refer to the optical implementation of this capability as PPNs (purple circles in Fig. 1a, b).

The proposed PPN is depicted in Fig. 1c. The building block consists of a synchronously pumped degenerate OPO³⁰. An OPO consists of a nonlinear medium (e.g., second-order nonlinear crystal, down converting photon frequency) and an optical cavity surrounding it. The phase of the initial optical field is random due to electromagnetic field fluctuations inside the cavity (quantum vacuum noise). When the power of the pump laser exceeds a certain threshold power, phase-sensitive gain of the OPO allows the initial state to fall into one of the bistable output states with either phase (0 rad, or π rad)²⁸. In other words, quantum vacuum noise acts as a perfect random source that manifests itself in the output phase. In fact, this random source is an intrinsic noise source ubiquitous in quantum optics^25,26,27. When a vacuum-level external bias field b is introduced in the OPO cavity, the probability distribution of the output steady states can be coherently controlled³⁰. Specifically, our OPO-based PPN encodes a Bernoulli trial B(p) with binary outcomes having probability p and 1 − p. Independent random sampling and processing can be realized by time-multiplexing the bias signal, resulting in N independent outcomes with encoded probabilities as depicted by different heights in Fig. 1c.

The experimental system realizing the PPN, and its implementation into a probabilistic computing system, is shown in Fig. 2. The system consists of three modules: biased OPO (purple area), detection (green area), and processing unit (blue area). We time-multiplex OPO signals with an amplitude modulator along the pump path to sample multiple binary outputs from a single optical cavity at a rate of 10 kHz. This bit rate is chosen to ensure the statistical independence of each PPN^28,30. We use a homodyne detector to measure the optical phase of the steady state and map it to the corresponding bit value (i.e., 0 rad → 0 and π rad → 1).

**Fig. 2: Experimental demonstration of a photonic probabilistic computer.**

During each cycle, a bit is measured by the homodyne detector (value 0 or 1), conditioned on the bias value b. This bit, or a collection of bit values (“bitstream”), is then fed into an electronic processing unit to update the bias field value and sample the PPN in the next cycle. In our experiment, the processing unit is taken as either a field-programmable gated array (FPGA) or a graphics processing unit (GPU). The FPGA is more adapted for real-time bitstream processing and control of the optical system, while the GPU can accelerate complex machine learning algorithms such as image generation at the cost of a slower system control.

Individual p_i values are encoded in the phase of the bias field b_i by applying a calibrated square-wave voltage to a phase modulator in the bias line path. The voltage–probability relation provided by the phase modulator is shown in Fig. 2b. This relation is used in the following computing experiments to control the bias voltage. A detailed description of the experimental setup is discussed in Supplementary Note 1.

Photonic probabilistic computer for image classification

We now perform probabilistic image classification of MNIST-handwritten digits³⁷ using a pre-trained SBNN model on our optical probabilistic computing platform (Fig. 3a). SBNN encodes inference uncertainty by substituting deterministic layer nodes (as found in conventional fully connected neural networks) with stochastic binary nodes³⁸. In a conventional, fully connected neural network, the jth node value in the (n + 1)th layer X_j,n+1 can be calculated in two steps: (1) matrix–vector multiplication (MVM) between weight matrix W and the nth layer X_n (z_j,n ≡ ∑_iW_j,iX_i,n); followed by (2) a nonlinear activation function σ(⋅) : \({X}_{j,n+1}=\sigma ({z}_{j,n})\).

**Fig. 3: Probabilistic inference uncertainty encoded by quantum vacuum noise.**

Within our SBNN model, each layer node is represented by a PPN, and a single layer (yellow areas in Fig. 3a) is described as a bitstream of time-multiplexed PPNs. Because of the nonlinear nature of the bias-probability relationship (Fig. 2b), sampling a binary output X_j,n with our PPN from the given bias b_j,n (or equivalently bias modulator voltage V_j,n in our experiment), naturally corresponds to passing a nonlinear activation function: \({X}_{j,n}=B({p}_{j,n})=B[\sigma ({V}_{j,n})]\). Modulator voltage V_j,n is calculated via MVM between the weight W_n−1 and the \({\left(n-1\right)}{{{{{\rm{th}}}}}}\) layer X_n−1 (gray areas in Fig. 3a, which is performed by the FPGA in our experiment). In other words, each PPN node binarizes the input, which consists of a weighted sum of previous layer nodes, with probability p_j,n. Because of the stochastic nature of the nodes, their probabilities change for every inference, leading to a probabilistic interpretation of classification results for an identical input image (Fig. 3a, right).

To perform image classification of MNIST-handwritten digits with our optical SBNN, we first binarize original MNIST-handwritten digits (Fig. 3a, left). Original MNIST-handwritten digits (grayscale, pixel values ranging from 0 to 255) are normalized between 0 and 1. The resulting pixel values represent the probability value for each PPN. The grayscale images are binarized by sampling the PPNs. The binary images are propagated through the network (784 → 128 → 64 → 10), with real-time communication between PPNs and the FPGA. The output layer O_0,1,...,9 is used to interpret the classification result, higher O_j corresponding to the higher probability of image representing digit “j”. The network is pre-trained in silico and the weights are implemented on the FPGA. A detailed description of the training process and how the FPGA communicates with the optical setup is in Supplementary Note 2.

To test the performance of our optical SBNN, a batch of grayscale MNIST-handwritten digits (100 images) from the test set is selected. By binarizing each grayscale MNIST-handwritten digit 10 times to encode statistical uncertainty, we prepared 1000 binarized MNIST-handwritten digits in total to be classified by our optical SBNN. While propagating to the output layer, PPNs in the input and hidden layers encode the uncertainty by stochastically sampling the binary values from given probabilities. Once the output layer is reached, we can collect the statistics from 10 different inference results for each input image. Confusion matrices in Fig. 3b show that the overall experimental classification accuracy (96.5%) is in close agreement with the accuracy obtained from the numerical simulations for the single batch (97.0%) and total test images (98.3%) (see Supplementary Note 2). The classification accuracy of our photonic probabilistic computing hardware is also comparable with that of other optical computing platforms reaching more than 95%^21,39,40.

Figure 3c shows how our probabilistic neural network can diagnose the reliability of inference results by harnessing quantum vacuum noise. Unlike deterministic neural networks, the variability of layer nodes in SBNNs results in different probability for each inference. One of the factors that can potentially degrade the classification performance is the ambiguity of the image (i.e., how close the image is to the decision boundary, as shown in Fig. 1a). By encoding uncertainty during inference, our photonic probabilistic computing hardware suggests all possible labels that ambiguous images can be classified. We choose two ambiguous images and two unambiguous images from the test dataset and plot the probability of each binarized grayscale MNIST-handwritten digit being classified under a certain label. Because we binarized 10 images each, 10 different probability values are shown for each label.

Three different scenarios are described in Fig. 3c. Unambiguous images such as “0” and “9” (achieving 100% of classification accuracy) show relatively consistent classification results with probabilities of correct classification close to 1. In this scenario, probabilistic neural networks show similar behavior to deterministic neural networks, which always give the same classification result with a fixed probability. When the input image becomes ambiguous (image “5” underlined in red, achieving 50% of classification accuracy), our SBNN model indicates that the image can be either “3” or “5”. Accordingly, the distribution of probabilities on each label broadens with its average value close to 50%. The worst case scenario is depicted by image “2” (underlined in blue), showing low overall accuracy (20%) and strong inconsistency in classification results. Such scenario clearly showcases how probabilistic sampling can provide additional information to the end-user. Classification results for labels that are not included in Fig. 3c can be found in Supplementary Note 2.

Offering both overall accuracy and statistics of classification results, probabilistic neural networks can diagnose inference results by providing a confidence level of the decision. The total classification result for each input image can be found in Supplementary Note 2.

Generating images from quantum vacuum noise with photonic generative models

We now turn to the demonstration of generative models with our photonic probabilistic computing platform (Fig. 4), demonstrating the use of quantum optical randomness as a source for generative machine learning models. We use a type of autoregressive model (pixelCNN), which models a conditional probability of a current pixel value from previous pixels⁴¹.

**Fig. 4: Images sampled from quantum vacuum noise.**

Our implementation protocol for pixelCNN with PPNs is described in Fig. 4a. A binary image with the first N − 1 pixels X_i≤N−1 specified is given as an input to the network. In principle, N can be any natural number, N = 1 corresponding to the case when pixelCNN creates an image only using quantum vacuum noise as a random seed. When the input image is given, a pre-trained pixelCNN model in the GPU evaluates p_N to be encoded on the PPN from previous pixels X_i≤N−1, generating a binary number for the Nth pixel (X_N). The probability p_N+1 is now computed based on previous pixel values X_i≤N. This process is repeated until the full image is generated (28 × 28 = 784 pixels). Our hybrid optoelectronic computing system can generate new images using quantum vacuum noise as a random seed. Details of network structure and training method can be found in Supplementary Note 3.

Different MNIST-handwritten digits, all generated from the same incomplete input image, highlight how quantum vacuum noise enables stochastic image sampling (Fig. 4b). Although they all start from the same “ancestor” image, the multiple stochastic samples of pixel values from the PPNs branch off into different MNIST-handwritten digits with different labels (“descendant” images). It is also possible to generate different images with the same label (which is likely to be labeled as “2”).

We produced 100 examples of handwritten digit images from quantum vacuum noise using our photonic probabilistic computing platform (Fig. 4c). This was done by initiating an empty image as an input to our optical pixelCNN. We also test the negative-log-likelihood (NLL) of the generated images NLL \(\equiv -{\sum }_{i}\{{X}_{i}\ln ({p}_{i})+(1-{X}_{i})\ln (1-{p}_{i})\}\), where the sum runs over i = 1, …, 784 pixel indices. A lower value of NLL indicates statistical similarity to the distribution of training images, yielding 71.1 ± 18.8 for our experimental results and 64.9 ± 15.4 for numerical simulations. This shows that our system has learned an accurate representation of the image distribution. Details of the performance of image generations can be found in Supplementary Note 3.

Discussion

In our demonstration of photonic probabilistic machine learning, the speed and energy efficiency were limited by the PPN sampling rate and data transfer bandwidth between electronic processors and PPNs. In the following, we propose an all-optical probabilistic computing platform which can overcome these challenges, and evaluate the potential benefit in terms of speed and energy efficiency compared to the electronic state of the art.

To increase sampling rate and reduce energy consumption, we propose an all-optical implementation. For instance, PPNs can be implemented with injection-seeded vertical-cavity surface-emitting lasers, reaching >1 Gbps⁴² and providing energy-efficient operation⁴³. Fast control of the probability and state detection can be achieved with high-bandwidth modulators and detectors^{44,45,46,47,48}, suggesting that PPNs achieving 1 Gbps sampling rate are within reach (detailed explanations can be found in Supplementary Note 4).

Furthermore, our programmable stochastic element naturally implements an all-optical nonlinearity through the bias-probability relationship, which has been a historical challenge in the implementation of energy-efficient all-optical ONNs¹⁵. Typically, ONNs rely on optoelectronic measurement-feedback schemes to update the network layers^39,49. Conversely, in the proposed scheme, an optical signal (vacuum-level bias) controls the nonlinearity of the layer. Because the bias signal can be derived directly from the accumulated PPN outputs, bypassing active components, the scheme can reduce energy consumption per multiply-accumulate (MAC) operation to as low as ~5 fJ/MAC. State-of-the-art stochastic electronic devices, such as low-barrier magnetic tunnel junctions and diffusive memristors integrated with conventional CMOS technologies are expected to achieve ~0.1 Gbps^50,51 and consume ~900 fJ/MAC³⁸. Comparatively, our proposed photonic platform can be ~ ×10 faster and ~ ×100 more energy efficient. A detailed discussion of this all-optical probabilistic computing platform is found in Supplementary Note 4.

We now compare the speed and energy performance of our photonic platform to a state-of-the-art FPGA^52,53, in an image classification task considering a binary neural network. The deterministic FPGA implementation demonstrated a classification of ~1.6 million images per second with ~23 W power consumption. Adopting the network structure of our SBNN model in Fig. 3, we can calculate the computation time and the number of MAC operations required for each inference. Our estimation gives ~4 ns and ~10⁵ MAC operations per classification, which result in ~250 million image classifications per second with a power consumption of ~0.1 W. Therefore, the suggested all-optical probabilistic computing hardware could perform ×100 faster while consuming ×100 less power. Detailed discussion can be found in Supplementary Note 4.

One of the possible extensions of our work is to train the network physically^54,55. This becomes critical when an accurate digital modeling of the physical system becomes challenging due to its complexity. Without an additional cost of simulating randomness in digital models, several training methods which resort to stochasticity, including stochastic gradient descent⁵⁶, dropout³⁴, and noise injection⁵⁷ could potentially be realized with PPNs. Harnessing quantum vacuum noise in optical elements for both training and testing, our PPNs will pave the way of implementing all-optical probabilistic physical neural networks, which can benefit state-of-the-art machine learning applications including large language models⁵⁸ and diffusion models⁵⁹.

Our platform could also be used to implement other important computational tasks. The first one is alternative interpretable neural network models with trainable activation functions⁶⁰, which could be implemented with the PPN by taking advantage of its tunable bias-probability relationship. The second one is Ising model solvers with external magnetic fields, which can be modeled by the injection of a bias field in a network of OPOs⁶¹.

Data availability

All data supporting this work are available within the manuscript, the Supplementary Information, the online repository: https://codeocean.com/capsule/4025993/tree. Raw data generated during the study are available once requested to the corresponding authors. Correspondence and requests should be addressed to S.C. (seouc130@mit.edu) and C.R.-C. (chrc@stanford.edu).

Code availability

The code used in this study is available at https://codeocean.com/capsule/4025993/tree.

References

Nichol, A. et al. Glide: towards photorealistic image generation and editing with text-guided diffusion models. Preprint at https://doi.org/10.48550/arXiv.2112.10741 (2021).
Goodfellow, I. et al. Generative adversarial nets. In Advances in Neural Information Processing Systems 27 (NIPS, 2014).
Roques-Carmes, C. et al. Heuristic recurrent algorithms for photonic Ising machines. Nat. Commun. 11, 249 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Pham, D. & Karaboga, D. Intelligent Optimisation Techniques: Genetic Algorithms, Tabu Search, Simulated Annealing and Neural Networks (Springer Science & Business Media, 2012).
Blundell, C., Cornebise, J., Kavukcuoglu, K. & Wierstra, D. Weight uncertainty in neural network. In International Conference on Machine Learning 1613–1622 (PMLR, 2015).
Neal, R. M. Bayesian Learning for Neural Networks Vol. 118 (Springer Science & Business Media, 2012).
Murphy, K. P. Probabilistic Machine Learning: An Introduction (MIT Press, 2022).
Ghahramani, Z. Probabilistic machine learning and artificial intelligence. Nature 521, 452–459 (2015).
Article ADS CAS PubMed Google Scholar
Feng, D., Harakeh, A., Waslander, S. L. & Dietmayer, K. A review and comparative study on probabilistic object detection in autonomous driving. IEEE. Trans. Intell. Transp. Syst. 23, 9961–9980 (2021).
Article Google Scholar
Richens, J. G., Lee, C. M. & Johri, S. Improving the accuracy of medical diagnosis with causal machine learning. Nat. Commun. 11, 3923 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiménez-Luna, J., Grisoni, F. & Schneider, G. Drug discovery with explainable artificial intelligence. Nat. Mach. Intell. 2, 573–584 (2020).
Article Google Scholar
Qin, Y. et al. A high-speed true random number generator based on unified selector-RRAM. In IEEE Electron Device Letters (IEEE, 2023).
Chowdhury, S. et al. A full-stack view of probabilistic computing with p-bits: devices, architectures and algorithms. IEEE J. Explor. Solid State Comput. Devices Circuits 9, 1–11 (2023).
Article ADS Google Scholar
Woo, K. S. et al. Probabilistic computing using Cu_0.1Te_0.9/HfO₂/Pt diffusive memristors. Nat. Commun. 13, 5762 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Wetzstein, G. et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).
Article ADS CAS PubMed Google Scholar
McMahon, P. L. The physics of optical computing. Nat. Rev. Phys. 5, 717–734 (2023).
Article Google Scholar
Wang, T. et al. An optical neural network using less than 1 photon per multiplication. Nat. Commun. 13, 123 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, S.-Y., Wang, T., Laydevant, J., Wright, L. G. & McMahon, P. L. Quantum-noise-limited optical neural networks operating at a few quanta per activation. Preprint at https://doi.org/10.48550/arXiv.2307.15712 (2023).
Mourgias-Alexandris, G. et al. Noise-resilient and high-speed deep learning with coherent silicon photonics. Nat. Commun. 13, 5572 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Totovic, A., Giamougiannis, G., Tsakyridis, A., Lazovsky, D. & Pleros, N. Programmable photonic neural networks combining WDM with coherent linear optics. Sci. Rep. 12, 5605 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Bernstein, L. et al. Single-shot optical neural network. Sci. Adv. 9, eadg7904 (2023).
Article CAS PubMed PubMed Central Google Scholar
Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446 (2017).
Article ADS CAS Google Scholar
Prabhu, M. et al. Accelerating recurrent Ising machines in photonic integrated circuits. Optica 7, 551–558 (2020).
Article ADS CAS Google Scholar
Wu, C. et al. Harnessing optoelectronic noises in a photonic generative network. Sci. Adv. 8, eabm2956 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Purcell, E. M., Torrey, H. C. & Pound, R. V. Resonance absorption by nuclear magnetic moments in a solid. Phys. Rev. 69, 37 (1946).
Article ADS CAS Google Scholar
Chan, H. B., Aksyuk, V. A., Kleiman, R. N., Bishop, D. J. & Capasso, F. Quantum mechanical actuation of microelectromechanical systems by the Casimir force. Science 291, 1941–1944 (2001).
Article ADS CAS PubMed Google Scholar
Sandoghdar, V., Sukenik, C., Hinds, E. & Haroche, S. Direct measurement of the van der Waals interaction between an atom and its images in a micron-sized cavity. Phys. Rev. Lett. 68, 3432 (1992).
Article ADS CAS PubMed Google Scholar
Marandi, A., Leindecker, N. C., Vodopyanov, K. L. & Byer, R. L. All-optical quantum random bit generation from intrinsically binary phase of parametric oscillators. Opt. Express 20, 19322–19330 (2012).
Article ADS PubMed Google Scholar
Kim, K. et al. Massively parallel ultrafast random bit generation with a chip-scale laser. Science 371, 948–952 (2021).
Article ADS CAS PubMed Google Scholar
Roques-Carmes, C. et al. Biasing the quantum vacuum to control macroscopic probability distributions. Science 381, 205–209 (2023).
Article ADS CAS PubMed Google Scholar
Wu, C., Yang, X., Chen, Y. & Li, M. Photonic bayesian neural network using programmed optical noises. IEEE J. Sel. Top. Quantum Electron. 29, 1–6 (2022).
Google Scholar
Ma, B., Zhang, J., Li, X. & Zou, W. Stochastic photonic spiking neuron for Bayesian inference with unsupervised learning. Opt. Lett. 48, 1411–1414 (2023).
Article ADS PubMed Google Scholar
Jebara, T. Machine Learning: Discriminative and Generative. (Springer Science & Business Media, 2012).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet Google Scholar
Silipo, R. & Marchesi, C. Artificial neural networks for automatic ECG analysis. In IEEE Transactions on Signal Processing 46, 1417–1425 (IEEE, 1998).
Leibig, C., Allken, V., Ayhan, M. S., Be rens, P. & Wahl, S. Leveraging uncertainty information from deep neural networks for disease detection. Sci. Rep. 7, 17816 (2017).
Article ADS PubMed PubMed Central Google Scholar
LeCun, Y. et al. Gradient-based learning applied to document recognition. In Proc. IEEE 86, 2278–2324 (IEEE, 1998).
Li, Y. et al. Binary-stochasticity-enabled highly efficient neuromorphic deep learning achieves better-than-software accuracy. Adv. Intell. Syst. 6, 2300399 (2024).
Zhang, H. et al. An optical neural chip for implementing complex-valued neural network. Nat. Commun. 12, 457 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Zuo, Y. et al. All-optical neural network with nonlinear activation functions. Optica 6, 1132–1137 (2019).
Article ADS CAS Google Scholar
Van Den Oord, A., Kalchbrenner, N. & Kavukcuoglu, K. Pixel recurrent neural networks. In International Conference on Machine Learning 1747–1756 (PMLR, 2016).
Zhao, J. et al. Fast all-optical random number generator. Preprint at https://doi.org/10.48550/arXiv.2201.07616 (2022).
Chen, Z. et al. Deep learning with coherent VCSEL neural networks. Nat. Photonics 17, 723–730 (2023).
Article ADS CAS Google Scholar
Valdez, F., Mere, V. & Mookherjea, S. 100 GHz bandwidth, 1 volt integrated electro-optic Mach-Zehnder modulator at near-IR wavelengths. Optica 10, 578–584 (2023).
Article ADS CAS Google Scholar
He, M. et al. High-performance hybrid silicon and lithium niobate Mach-Zehnder modulators for 100 Gbit s⁻¹ and beyond. Nat. Photonics 13, 359–364 (2019).
Article ADS CAS Google Scholar
Wang, C. et al. Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages. Nature 562, 101–104 (2018).
Article ADS CAS PubMed Google Scholar
Lischke, S. et al. High bandwidth, high responsivity waveguide-coupled germanium pin photodiode. Opt. Express 23, 27213–27220 (2015).
Article ADS CAS PubMed Google Scholar
Lischke, S. et al. Ultra-fast germanium photodiode with 3-dB bandwidth of 265 GHz. Nat. Photonics 15, 925–931 (2021).
Article ADS CAS Google Scholar
Ashtiani, F., Geers, A. J. & Aflatouni, F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).
Article ADS CAS PubMed Google Scholar
Jiang, H. et al. A novel true random number generator based on a stochastic diffusive memristor. Nat. Commun. 8, 882 (2017).
Article ADS PubMed PubMed Central Google Scholar
Chen, X. et al. Magnetic-tunnel-junction-based true random-number generator with enhanced generation rate. Phys. Rev. Appl. 18, L021002 (2022).
Article ADS CAS Google Scholar
Qin, H. et al. Binary neural networks: a survey. Pattern Recognit. 105, 107281 (2020).
Article Google Scholar
Umuroglu, Y. et al. Finn: a framework for fast, scalable binarized neural network inference. In Proc. 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays 65 (ACM, 2017).
Wright, L. G. et al. Deep physical neural networks trained with backpropagation. Nature 601, 549–555 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Momeni, A., Rahmani, B., Mallejac, M., Del Hougne, P. & Fleury, R. Backpropagation-free training of deep physical neural networks. Science 382, 1297–1303 (2023).
Article ADS MathSciNet CAS PubMed Google Scholar
Bottou, L. Large-scale machine learning with stochastic gradient descent. In Proc. COMPSTAT 2010 177–186 (Springer, 2010).
Zur, R. M., Jiang, Y., Pesce, L. L. & Drukker, K. Noise injection for training artificial neural networks: a comparison with weight decay and early stopping. Med. Phys. 36, 4810–4818 (2009).
Article PubMed PubMed Central Google Scholar
Naveed, H. et al. A comprehensive overview of large language models. Preprint at https://doi.org/10.48550/arXiv.2307.06435 (2023).
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic models. In Advances in Neural Information Processing Systems 33, 6840–6851 (2020).
Liu, Z. et al. Kan: Kolmogorov-arnold networks. Preprint at https://doi.org/10.48550/arXiv.2404.19756 (2024).
Horodynski, M. et al. Stochastic logic in biased coupled photonic probabilistic bits. Preprint at https://doi.org/10.48550/arXiv.2406.04000 (2024).

Download references

Acknowledgements

The authors thank Wei Wang (Peng Cheng Laboratory) for providing resources used during the experimental demonstration of SBNNs. The authors also acknowledge Joe Steinmeyer for helpful discussions on FPGA. S.C. acknowledges support from the Korea Foundation for Advanced Studies Overseas PhD Scholarship. Y.S. acknowledges support from the Swiss National Science Foundation (SNSF) through the Early Postdoc Mobility Fellowship No. P2EZP2-188091. C.R.-C. is supported by a Stanford Science Fellowship. D.L., Z.C., and M.S. acknowledge support from the National Science Foundation under Cooperative Agreement PHY-2019786 (The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, http://iaifi.org/). M.H. acknowledges funding by the Austrian Science Fund (FWF) through grant J4729. J.S. acknowledges earlier support from a Mathworks Fellowship. This material is based upon work supported by the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers, Co-design Center for Quantum Advantage (C2QA) under contract number DE-SC0012704. This material is also based upon work sponsored in part by the U.S. Army DEVCOM ARL Army Research Office through the MIT Institute for Soldier Nanotechnologies under Cooperative Agreement number W911NF-23-2-0121.

Author information

Authors and Affiliations

Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA, USA
Seou Choi, Yannick Salamin, Charles Roques-Carmes, Rumen Dangovski, Jamison Sloan, Shiekh Zia Uddin & Marin Soljačić
Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
Yannick Salamin, Zhuo Chen, Michael Horodynski, Shiekh Zia Uddin & Marin Soljačić
E. L. Ginzton Laboratories, Stanford University, Stanford, CA, USA
Charles Roques-Carmes
The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, Cambridge, MA, USA
Rumen Dangovski, Di Luo & Zhuo Chen
Center for Theoretical Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
Di Luo
Department of Physics, Harvard University, Cambridge, MA, USA
Di Luo

Authors

Seou Choi
View author publications
You can also search for this author in PubMed Google Scholar
Yannick Salamin
View author publications
You can also search for this author in PubMed Google Scholar
Charles Roques-Carmes
View author publications
You can also search for this author in PubMed Google Scholar
Rumen Dangovski
View author publications
You can also search for this author in PubMed Google Scholar
Di Luo
View author publications
You can also search for this author in PubMed Google Scholar
Zhuo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Michael Horodynski
View author publications
You can also search for this author in PubMed Google Scholar
Jamison Sloan
View author publications
You can also search for this author in PubMed Google Scholar
Shiekh Zia Uddin
View author publications
You can also search for this author in PubMed Google Scholar
Marin Soljačić
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.C., Y.S., C.R.-C., J.S., and M.S. conceived the original idea. R.D. and D.L. contributed to the development of machine learning algorithms. S.C. and Y.S. built the experimental setup with contributions from C.R.-C., M.H., and S.Z.U.; S.C. acquired and analyzed the data. S.C. developed the code for the electronic processing unit and trained the neural networks with contributions from R.D., D.L., and Z.C.; M.S. supervised the project. The manuscript was written by S.C., Y.S., and C.R.-C., with inputs from all authors.

Corresponding authors

Correspondence to Seou Choi or Charles Roques-Carmes.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Leong Chuan Kwek, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Choi, S., Salamin, Y., Roques-Carmes, C. et al. Photonic probabilistic machine learning using quantum vacuum noise. Nat Commun 15, 7760 (2024). https://doi.org/10.1038/s41467-024-51509-0

Download citation

Received: 20 March 2024
Accepted: 08 August 2024
Published: 05 September 2024
DOI: https://doi.org/10.1038/s41467-024-51509-0

Photonic probabilistic machine learning using quantum vacuum noise

Subjects

Abstract

Similar content being viewed by others

A versatile single-photon-based quantum computing platform

Quantum circuits with many photons on a programmable nanophotonic chip

CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning

Introduction

Results

Probabilistic computing with time-multiplexed PPNs

Photonic probabilistic computer for image classification

Generating images from quantum vacuum noise with photonic generative models

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

A versatile single-photon-based quantum computing platform

Quantum circuits with many photons on a programmable nanophotonic chip

CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning

Introduction

Results

Probabilistic computing with time-multiplexed PPNs

Photonic probabilistic computer for image classification

Generating images from quantum vacuum noise with photonic generative models

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links