Meister Isef 4

Improving Robustness of the NSLS-II X-ray Scattering
Image Neural Network with Data Augmentation

INTRODUCTION RESEARCH QUESTION PROCEDURE RESULTS
How will Data Augmentation and Instrument-
Specific Noise Injection improve the accuracy of
the Convolutional Neural Network?
Figure 1: Example Synchrotron: Figure 2: Diagram of x-ray
National Synchrotron Light scattering process in
Source II (NSLS-II)
Image taken by bnl.gov
(Brookhaven National Lab) [11]
synchrotron.
Diagram from Kevin Yager [6] PROCEDURE
1. Neural Network
Synchrotrons:
- TensorFlow’s deep CNN (AlexNet) [14] for image labeling [12]
- Allow researchers to peer into the nano-scale to
understand the structure of new materials
- By understanding the structure of materials,
scientists can accelerate their research because Figure 10: Example of x-ray scattering image
they better understand the function of the with 6-fold symmetry. Image taken from Kevin
Yager.
material (form equals function) Figure 4: Architecture for AlexNet. Image taken from Hao Gao [15]
- How Synchrotrons Works:
- To train the CNN, images are passed through initial convolutional Figure 11. Increase in accuracy for each class shows that data augmentation
- Accelerates electrons into a large circular orbit
layers and a subsequent output layer of softmax regression affects different classes and different types of noise differently. The neural
- Use magnetic fields to deflect the electrons network was able to classify certain labels extremely well (Higher Order,
- Stochastic gradient descent algorithm used to minimize the loss
and release extremely bright x-ray light Rings) with a mAP over 80%, and therefore the performance did not increase
- Training terminated when the loss was below 0.10.
- Light is directed to a chamber that contains a much after data augmentation since the initial performance was so high.
sample of interest There were some attributes such as Diffuse High-q that were initially hard to
- X-ray light scatters off molecule classify (40% accuracy) and data augmentation improved the classification of
2. Synthetic Dataset those by 30%.
- A detector picks up the scattered light and - Impossible to tag 50,000 X-ray scattering images
produces X-ray scattering images Figure 6: Chart that demonstrates the types of noise and the varying level of
- Generate pre-tagged images noise applied to the X-ray scattering images. Class-conditional data augmentation as proposed by Salamon and Bello [16]
- Images reveal structural information with could improve the performance of a neural network for specific classes.
patterns of rings, halos, spots, etc. that arise Chart created by the Finalist Class-conditional DA consists of increasing the number of images that the
when scattered waves interfere. neural network cannot classify well (ie. symmetric rings) in training may
- Images provide insight into size, orientation, 4. Training/Testing CNN: Data Augmentation increase the overall accuracy (mAP).
packing of atoms
Figure created by the Finalist.
• Generate 50k noise-free x-ray scattering images
Train • Train the Neural Network
(Baseline)
• Generate 5k x-ray scattering images
• Apply Noise to Images using Python script
CONCLUSION
Apply
Noise • Evaluate image performance in neural network
- Newly trained CNN is successful in recognizing images
• Generate 50k noise-free images
• Generate 50k images with random amounts of noise with noise
Figure 3: Example of material with 6-fold symmetry Data
structure from an X-ray scattering image (left) Augment- • Retrain Neural Network with 100k images - Newly trained CNN always outperform the previous
compared to image from an electron microscope ation • Evaluate and compare the performance of the CNN that was only trained on noise-free images
(right). The x-ray scattering image reveals much more original 5k images in the newly trained neural - Data augmentation, specifically noise injection, is a
structural detail. Images taken from Kevin Yager [6] Evaluate network
Figure 5: Chart that compares the real X-ray scattering images and successful regularization technique for recognizing X-
synthetic X-ray scattering images to illustrate the synthetic and real Figure 7: Figure created by the Finalist
images look very similar. ray scattering images
Applications:
- Many different fields depend on X-ray scattering
Image taken from Kevin Yager [6]
RESULTS - This discovery advances the novel and groundbreaking
research scientists are performing with synchrotrons
images to accelerate their research Comparing CNN Before and After Training with because the image labeling process is now automated
- Biology: 3. Noise Injection Augmented Data and the algorithm can classify images with noise.
- Observe molecules undergo conformational Gaussian Noise: image pixels take on values that are Gaussian- 0.7
- With higher accuracy, we are closer to solving the big
mAP (Mean Average Precision)
changes (from the environment) [2] distributed 0.6

• Cause: beam line doesn’t align correctly, blurry lenses data issue and taking advantage of a synchrotron’s full
- Design novel drugs (understand how the 0.5
molecule of interest fits into biological • Apply Gaussian distribution to values in image array to 0.4 potential
No Noise
0.3
systems) [4]
- Chemistry: Salt and Pepper Noise: sparsely occurring black and white pixels 0.2 Data
Augmentation
- Future Work:
- How molecules interact with each other • Cause: sharp sudden disturbances in signal 0.1
- How enzymes perform catalysis [3] • Apply Poisson distribution to randomly corrupt values in image array 0
- Train the CNN with real NSLS-II X-ray scattering
- Electronics: images
- Design microelectronics [5] Poisson Counting Statistic : capturing an x-ray scattering image is a - Transfer learning: apply information learned from
- Material Science: chance observation [10] synthetic images to classify real image data [13]
- Apply changes in pressure or temperature to • Cause: If a few photons in a beam of light hit a detector every
second, the fluctuations in the number of photons is significant.
- Class Conditional Data Augmentation
see how a molecule reacts (ex: developing a Figure 8: Comparison of CNN performance with and without data
material harder than diamond) • If the pixel has a higher brightness, it has a higher chance to be augmentation. After data augmentation, the neural network
varied
• λ = value in image array
consistently better classifies images with different types of noise and
different levels of noise. The performance neural network increases REFERENCES
Issue: Big Data • Poisson distribution: right tailed distribution the most on images with Gaussian noise, then Salt and Pepper noise,
the least Poisson Statistics
• Apply Poisson Distribution (right tailed distribution) to model a [1] B. Wang, K. G. Yager, D. Yu, and M. Hoai, "X-ray scattering image classification using deep
- Synchrotrons produce 1-4 TB/day
chance observation Figure created by the Finalist.
learning" 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), 1: 697–704,
- 60 beam lines (60 concurrent experiments) 2017. [2] B. A. Wallace, “Conformational changes by synchrotron radiation circular dichroism
spectroscopy.” Nature Structural Biology 7, 708–709 (2000), doi:10.1038/78915 [3] J. Hajdu, K.R.
- Manual image analysis is complex Acharya, D.I. Stuart, P.J. McLaughlin, D. Barford, N.G. Oikonomakos, H. Klein, and L.N. Johnson,
- Image streams outpace manual image analysis
METHODOLOGY “Catalysis in the crystal: synchrotron radiation studies with glycogen phosphorylase B.” The EMBO
Journal 6 (2): 539-546, 1987. [4] A. C. Anderson, “The process of structure-based drug design.”
Chemistry & Biology 10 (9): 787–97, 2013. [5] J. Jordan-Sweet, “Synchrotron X-ray scattering
techniques for microelectronics-related materials studies” IBM Journal of Research and
Convolutional Neural Networks (CNN): X-ray Scattering Image CNN: Development 44(4): 457 – 476, 2000. [6] K. Yager, “Deep learning for automated synchrotron
experiments”. IEEE 2017 New York Scientific Data Summit, New York.
- Issue: http://www.esrf.eu/about/synchrotron-science/synchrotron [7] I. Goodfellow, Y. Bengio, and A.
- CNNs are specifically useful for processing data
- Low accuracy Courville, “Deep learning.” MIT Press, 2016. [8] J. Sietsma and R. J. F. Dow, “Creating artificial
that has a “grid-like topology” such as “image neural networks that generalize.” Neural Networks 4 (1): 67–79, 1991. [9] R. M. Zur, Y. Jiang, L. L.
- Neural Network is not robust to noise Pesce, and K. Drukker, “Noise injection for training artificial neural networks: A comparison with
data, which can be thought of as a 2D grid of
- Regularization Technique: Specialized Data Augmentation (DA) and weight decay and early stopping”. Medical Physics, 36(10), 4810–4818, 2009. [10] D. V.
pixels” [7] Perepelitsa, “Johnson noise and shot noise”. Dept. of Physics, MIT, 2006.
Noise Injection (NI) [8] http://web.mit.edu/dvp/Public/noise-paper.pdf [11] Department of Energy Approves
- CNNs simulate how the brain interprets images Figure 9. Impact of the different types of noise on the CNN’s ability to
- DA: Increase size of training dataset classify the images. After DA, the performance of the CNN does not Construction Start of NSLS-II Project | BNL Newsroom. (2018). [12]
- As an infant learns, he/she trains with https://www.tensorflow.org/tutorials/deep_cnn [13] S. J. Pan and Q. Yang, “A survey on transfer
- NI: Train neural network with deliberately distorted images to deteriorate as much as before DA when exposed to images with higher learning”. IEEE Transactions on Knowledge and Data Engineering 22(10):1345 - 1359 · November
auditory cues to learn how to identify the levels of Gaussian noise and Salt and Pepper noise. This demonstrates
allow the CNN to view images with greater diversity [9] 2010. [14] Krizhevsky, A., Sutskever, I., and Hinton, G. E. ImageNet classification with deep
world around them that the network has truly become more robust to noise. After trained convolutional neural networks. In NIPS, pp. 1106–1114, 2012 [15] Gao, H. (2018). A Walk-through
- Traditional DA: Linear Transformations (translation, cropping), of AlexNet – Hao Gao – Medium. [16] J. Salamon and J. P. Bello, “Deep convolutional neural
on noisy images, it becomes better at generalizing and classifying
rotation, change in saturation or hue, etc. different types of images. networks and data augmentation for environmental sound classification” IEEE Signal Processing
Letters, 4(3):279-283, 2017.
- Specialized DA: Apply instrument-specific noise
Figure created by the Finalist.

Meister Isef 4

Hochgeladen von

Dokumentinformationen

Originaltitel

Copyright

Verfügbare Formate

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Copyright:

Verfügbare Formate

Meister Isef 4

Hochgeladen von

Copyright:

Verfügbare Formate

Improving Robustness of the NSLS-II X-ray Scattering

Image Neural Network with Data Augmentation

changes (from the environment) [2] distributed 0.6

Das könnte Ihnen auch gefallen