Beruflich Dokumente
Kultur Dokumente
??? ???
What do we need?
TRAINING DATA
Training phase
DOG (1)
NOT DOG (0)
87% DOG
13% NOT DOG
Byte
Kilobyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Megabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Gigabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Terabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Petabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
One Byte
Exabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean Zettabyte
Byte : one grain of rice
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Zettabyte : Fills the Pacific Ocean Yottabyte
Yottabyte : A EARTH SIZE RICE BALL!
Byte : one grain of rice
Hobbyist
Kilobyte : cup of rice
Megabyte : 8 bags of rice
Desktop
Gigabyte : 3 Semi trucks
Terabyte : 2 Container Ships
Internet
Petabyte : Blankets Manhattan
Exabyte : Blankets west coast states
Big Data
Zettabyte : Fills the Pacific Ocean
Yottabyte : A EARTH SIZE RICE BALL! The Future
1 Yottabyte
1 Xenottabyte
1 Shilentnobyte
1 Domegemegrottebyte
1 Icosebyte
1 Monoicosebyte
Where does all this data
come from?
The Power of the Crowd
Machine Learning
Exploratory
Iterative
Quality of data: Garbage In Garbage Out
Multi-disciplinary team
D3.js - Open Source
Examples in Transport & Logistics
IBM Watson
Ahlers Supply Network Innovation & Analytics (ASNIA)
Transmetrics big data cargo platform – a rigorous approach deriving benefits from current and future data
1 2 3 4
Data uptake Demand AI Execution
cleansing forecast optimizatio Controlling
and modelling n
enrichment
Challenging Good to
very good
• Other issues (e.g. senders with • Grouped senders by AI; among others found a
multiple name spellings) customer with 330+ different accounts/names
Methodology / Behind the scenes
Real situation
(as observed on
the warehouse
floor)
sven.verstrepen@ahlers.com
http://www.linkedin.com/in/svenverstrepen
http://www.mentat-it.be
Frank Salliau
Independent Data Scientist & Machine Learning Expert
frank.salliau@mentat-it.be
https://be.linkedin.com/in/franksalliau