Table of Contents
- Introduction
- Why Image Recognition Matters
- Classical Challenges in Visual AI
- Quantum Advantages for Image Processing
- Encoding Images into Quantum Circuits
- Angle, Basis, and Amplitude Encoding
- Quantum Feature Extraction
- Variational Quantum Classifiers for Images
- Quantum Convolutional Neural Networks (QCNNs)
- Hybrid CNN-QNN Architectures
- Quantum Support Vector Machines (QSVMs)
- Quantum Kernels for Visual Similarity
- Image Datasets for Quantum ML
- Example: Quantum MNIST Classifier
- Implementation in PennyLane and Qiskit
- Training Strategies and Cost Functions
- Hardware Constraints and Circuit Efficiency
- Performance Benchmarks and Results
- Limitations and Future Research
- Conclusion
1. Introduction
Quantum machine learning (QML) is beginning to transform the field of image recognition by offering alternative methods of encoding, processing, and classifying visual data through quantum circuits. With limited but promising results, QML in vision is gaining momentum.
2. Why Image Recognition Matters
- Used in self-driving cars, security, medical imaging, and robotics
- Relies on high-dimensional data and complex patterns
- One of the most computationally demanding areas in ML
3. Classical Challenges in Visual AI
- Scaling CNNs requires massive GPU resources
- Adversarial robustness issues
- Difficulty generalizing across domains
4. Quantum Advantages for Image Processing
- Compact encoding of pixel patterns
- Quantum entanglement for modeling spatial correlation
- Potential quantum speedup in pattern classification
5. Encoding Images into Quantum Circuits
- Flatten images into 1D vectors
- Normalize pixel values to range suitable for qubit rotations
- Reduce dimensionality to fit on available qubits
6. Angle, Basis, and Amplitude Encoding
- Angle: \( x_i
ightarrow RY(x_i) \) - Basis: map binary pixels directly to |0⟩ and |1⟩
- Amplitude: encode pixel values into amplitudes (needs normalization)
7. Quantum Feature Extraction
- Use parameterized quantum circuits to extract high-level features
- Output expectation values from observables as feature maps
8. Variational Quantum Classifiers for Images
- Build VQC with parameterized gates and entanglers
- Train using cross-entropy or hinge loss
- Output is binary or multi-class prediction
9. Quantum Convolutional Neural Networks (QCNNs)
- Inspired by classical CNNs
- Local qubit filters followed by entangling layers and pooling
- Hierarchical quantum representation of image data
10. Hybrid CNN-QNN Architectures
- Use CNN layers for low-level features
- Quantum circuit classifies final embeddings
- Enables transfer learning + quantum classification
11. Quantum Support Vector Machines (QSVMs)
- Compute kernel in Hilbert space using fidelity between quantum states
- Classify images based on similarity in quantum feature space
12. Quantum Kernels for Visual Similarity
- Use quantum circuits to build Gram matrices
- Apply kernel-based algorithms (SVM, KRR) for image tasks
13. Image Datasets for Quantum ML
- MNIST (digit classification)
- Fashion-MNIST
- CIFAR-10 (simplified subsets)
- Custom binary image tasks
14. Example: Quantum MNIST Classifier
- Encode 4×4 or 8×8 patches into qubit registers
- Use VQC for classification
- Benchmark against classical MLP or SVM
15. Implementation in PennyLane and Qiskit
- PennyLane:
qml.qnode
for image VQC - Qiskit: Use
QuantumCircuit
+qiskit_machine_learning
VQC classes
16. Training Strategies and Cost Functions
- Use cross-entropy, hinge loss
- Apply gradient descent with parameter-shift or SPSA
- Hybrid optimizers like Adam + COBYLA
17. Hardware Constraints and Circuit Efficiency
- Limited qubit count restricts image size
- Use dimensionality reduction (PCA) or patching
- Circuit depth must be shallow for NISQ compatibility
18. Performance Benchmarks and Results
- Comparable accuracy on simple tasks (e.g., MNIST-4×4)
- Quantum kernel methods outperform classical on small datasets
- Hybrid models scale better than pure quantum ones
19. Limitations and Future Research
- Scalability to large images remains a challenge
- Encoding overhead is non-trivial
- Need better error mitigation for real devices
20. Conclusion
Quantum ML offers exciting possibilities for image recognition, especially through quantum-enhanced feature extraction, classification, and kernel methods. While limited by current hardware, hybrid approaches show promise, pointing toward a future of quantum-augmented vision systems.