Willkommen bei Scribd!

Scaling MySQL and Java in High Write Throughput Environments Presentation

Hochgeladen von

100% fanden dieses Dokument nützlich (6 Abstimmungen)

2K Ansichten20 Seiten

We present the backend architecture behind Spinn3r – our scalable web and blog crawler. Most existing work in scaling MySQL has been around high read throughput environments similar to web applications. In contrast, at Spinn3r we needed to complete thousands of write transactions per second in order to index the blogosphere at full speed. We have achieved this through our ground up development of a fault tolerant distributed database and compute infstructure all built on top of cheap commodity hardware. We’ve built out a number of technologies on top of MySQL that help enable us to easily scale operations. We’ve implemented an Open Source load balancing JDBC driver named lbpool. (http://code.tailrank.com/lbpool). Lbpool allows us to loosely couple our MySQL slaves which allow us to gracefully handle system failures. It also supports load balancing, reprovisioning, slave lag, and other advanced features not available in the stock MySQL JDBC driver. We’ve also built out a sharded database similar to infrastructure built at other companies such as Google (Adwords) and Yahoo (Flickr). Our sharded DB has a number of interesting properties including ultra high throughput requirements (we process 52TB per month), distributed sequence generation, and query plan execution. - Kevin Burton (Tailrank), Jonathan Moore (Tailrank/spinn3r)

Copyright

Verfügbare Formate

PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

100% fanden dieses Dokument nützlich (6 Abstimmungen)

2K Ansichten20 Seiten

Scaling MySQL and Java in High Write Throughput Environments Presentation

Hochgeladen von

zmg

Copyright:

Attribution Non-Commercial (BY-NC)

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 20

Im Dokument suchen

Scaling MySQL and Java in

High Write Throughput

Environments
How we built Spinn3r

1
What is Spinn3r?
• Licensed weblog crawler
• 500k posts per hour (RSS+HTML)
• 3.5TB of content
• 10 months of blog archives
• 3B documents
• 80Mb /s - 24/7

2
Hardware
• ~40 servers
– Quad Core
– 8GB memory
– Gigabit ethernet
– Dual SATA (software RAID 0)
• Moving to SSD

3
Write Throughput
• 90% write, 10% read
• MyISAM didn’t scale
– Too many seeks in high write load
• InnoDB with write ahead log
– 1/5th of effective disk bandwidth
– Improve the fuzzy checkpointing logic
– Just continually write memory images (log
structured)
– 1.5 minutes to write an 8G image

4
Database Sharding
• Split data across shards based on PK
– hashcode of URL
• Range routing
• Limitations
– No triggers
– No foreign keys
– No transactions
• Similar philosophy to Bigtable, S3,
Dynamo, etc

5
Shard Architecture

6
Query Limitations
• No functions in WHERE clauses
• LIMIT required
• Query should be deterministic
– ORDER BY
– ID = N
• Must order by some column to page
• No offset
• No aggregate functions

7
Shard Insertion
• Bulk insert data
– Custom API
– Operate on lists, commit every N records
or T minutes.
– INSERT … ON DUPLICATE KEY UPDATE
• Parallel dispatch architecture

8
In-memory Storage
• Metadata
– queue
– graph
• Deprecated memcached
• Allows InnoDB to execute at speed
• WAL allows disk to write at about
40MB/s

9
On-disk Storage
• 2.5 TB of content (full HTML and RSS)
• Numerous backup copies
• RAID caching controllers with BBU
• InnoDB blobs with to append-only and
‘eventually immutable’ tables.
• Gzip compressed (3x savings)
– Reduces the # of IOs by trading CPU

10
Resource/Primary Key
• Key is truncate(SHA1(resource+secret))
• Deterministic mechanism for key
generation
– works across robots
• Works well with shards
• Routable
• Decentralized
• Avoid clustered indexes

11
Distributed Lock Manager
• acquire( lock )
• renew( lock )
• Similar to Google’s chubby
• See Paxos algorithm for distributed
consensus
• Good for master servers, failover, etc.
• We use this for master queue promotion

12
Sequence Generation
• Need monotonically increasing
sequences
– Paging through results
• Settled on global prefix+local suffix with
a distributed lock manager
• Used in shards to page across results.
– paging on time is hard/impossible due to
collision

13
Task/Queue
• Similar to MapReduce
• Central queue
– Fault tolerant
– Sharded for scale
• Distributed tasks
• Executes robot jobs over 30 machines
• Supports heterogeneous machines

14
JDBC Load Balancing
• Created lbpool
– Licensed to MySQL (Open Source)
• Load balanced connection pool
• Replication aware
• Handles runtime rebalancing
– slave lag
– broken slaves
• Fault tolerant

15
User Defined Functions
• Necessary for distributed databases
• Row level locks to avoid race conditions
• Increment
• Bloom filters
• Zeta codes
• Histographs

16
Solid State Storage
• NAND based flash devices
• SUPER fast reads
– 15k 4k reads per second
– ~250/s for HDDs
• Regular performance writes
– Small InnoDB buffer pool
• Historically avoided to due high MTBF

17
Current SSD state
• $30 / GB
• 16/32/64 GB capacity
• Mtron
• Memoright
• STEC
• ~ 100MB/s sequential write
• ~ 120MB/s sequential read

18
The Future of DB Storage
• SSD for in-memory data
• 10x performance boost for 20% cost
increase.
– $30/GB now -> $15/GB in Q2-Q3
• Mainstream in 2009
• MUCH more data per node
• Log structured databases
• See benchmarks

19
Questions
• Further reading:
– feedblog.org
– spinn3r.com
– feedblog.org/category/ssd/
– code.google.com/p/mysql-lbpool/
– Paxos algorithm
– Chubby

Das könnte Ihnen auch gefallen

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
Capacity Planning For Web Operations Presentation
Dokument52 Seiten
Capacity Planning For Web Operations Presentation
zmg
100% (5)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (399)
The Top 20 Design Tips For MySQL Enterprise Data Architects
Dokument38 Seiten
The Top 20 Design Tips For MySQL Enterprise Data Architects
Oleksiy Kovyrin
91% (11)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Portable Scale-Out Benchmarks For MySQL Presentation
Dokument45 Seiten
Portable Scale-Out Benchmarks For MySQL Presentation
zmg
100% (3)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (894)
Monitoring Scale-Out With The MySQL Enterprise Monitor
Dokument26 Seiten
Monitoring Scale-Out With The MySQL Enterprise Monitor
Oleksiy Kovyrin
100% (2)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Real World Web: Performance & Scalability
Dokument189 Seiten
Real World Web: Performance & Scalability
Oleksiy Kovyrin
100% (26)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
DBSlayer A Simpler Way To Proxy Presentation
Dokument28 Seiten
DBSlayer A Simpler Way To Proxy Presentation
zmg
100% (1)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
MySQL Proxy: The Complete Tutorial (Full Day) Presentation
Dokument90 Seiten
MySQL Proxy: The Complete Tutorial (Full Day) Presentation
Oleksiy Kovyrin
100% (4)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Monitoring Scale-Out With The MySQL Enterprise Monitor
Dokument26 Seiten
Monitoring Scale-Out With The MySQL Enterprise Monitor
Oleksiy Kovyrin
100% (2)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (587)
Building Scalable: High Performance Datamarts With MySQL
Dokument103 Seiten
Building Scalable: High Performance Datamarts With MySQL
Oleksiy Kovyrin
100% (3)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (265)
Architecture of Maria A New Storage Engine With A Transactional Design Presentation
Dokument19 Seiten
Architecture of Maria A New Storage Engine With A Transactional Design Presentation
yejr
Noch keine Bewertungen
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
23-34&5) ,-1367&8) 9,0&: ) (/&?1341&@#"A343" (3& BCCD EF4, &GHIGJ7&BCCD
Dokument31 Seiten
23-34&5) ,-1367&8) 9,0&: ) (/&?1341&@#"A343" (3& BCCD EF4, &GHIGJ7&BCCD
warwithin
Noch keine Bewertungen
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
9AKK106103A1925 - E Anchorage 2015 Symphony Plus
Dokument65 Seiten
9AKK106103A1925 - E Anchorage 2015 Symphony Plus
Format_C
Noch keine Bewertungen
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Log
Dokument76 Seiten
Log
Wilfredo Noa Ore
Noch keine Bewertungen
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (73)
CSE522 5 Sp14 Scheduling Aperiodic
Dokument29 Seiten
CSE522 5 Sp14 Scheduling Aperiodic
ANNAPUREDDY RAVINDER REDDY
Noch keine Bewertungen
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (344)
ETABS Pulling Results Code Probably
Dokument13 Seiten
ETABS Pulling Results Code Probably
Chirag Joshi
Noch keine Bewertungen
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
iTN8601 (A) Configuration Guide (LCT) (Rel - 01)
Dokument82 Seiten
iTN8601 (A) Configuration Guide (LCT) (Rel - 01)
Jose Antonio Garza Gonzalez
Noch keine Bewertungen
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
Java EnvSetup
Dokument18 Seiten
Java EnvSetup
myth.superking
Noch keine Bewertungen
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1712)
Win Plot For Al Got Ex Getting Started
Dokument24 Seiten
Win Plot For Al Got Ex Getting Started
Dora BA
Noch keine Bewertungen
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Activate Windows 10 for FREE
Dokument1 Seite
Activate Windows 10 for FREE
Achmad Ardiyanto Amzak
100% (1)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
Pathloss Files Conversion Methods 0 PDF
Dokument5 Seiten
Pathloss Files Conversion Methods 0 PDF
Kyle Diop
Noch keine Bewertungen
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Squirrel Installation
Dokument4 Seiten
Squirrel Installation
Aravind Bs
Noch keine Bewertungen
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2219)
States of A Thread in Java - GeeksforGeeks
Dokument15 Seiten
States of A Thread in Java - GeeksforGeeks
Fousiya Fousi
Noch keine Bewertungen
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Methodology of AUTOSAR and Communication in AUTOSAR-1-1 PDF
Dokument49 Seiten
Methodology of AUTOSAR and Communication in AUTOSAR-1-1 PDF
Nandini
Noch keine Bewertungen
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Intro License Smart PDF
Dokument48 Seiten
Intro License Smart PDF
Abdelhai
Noch keine Bewertungen
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
RPC - Remote Procedure Call Overview
Dokument26 Seiten
RPC - Remote Procedure Call Overview
Jayaprabha Kanase
Noch keine Bewertungen
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1015)
AN ESC Comparison V1i7
Dokument16 Seiten
AN ESC Comparison V1i7
mike
Noch keine Bewertungen
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
CS8493 Operating Systems Question Bank
Dokument38 Seiten
CS8493 Operating Systems Question Bank
dred
Noch keine Bewertungen
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Toibin
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
Operation Manual EnUS 2667326603
Dokument168 Seiten
Operation Manual EnUS 2667326603
Freddy A. Meza Diaz
Noch keine Bewertungen
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (119)
Kd67x 4 Manual
Dokument108 Seiten
Kd67x 4 Manual
luis009
Noch keine Bewertungen
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
Student Guide NetApp Accredited Storage Architecture Professional Workshop
Dokument540 Seiten
Student Guide NetApp Accredited Storage Architecture Professional Workshop
Owen
Noch keine Bewertungen
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
Flex Pod
Dokument32 Seiten
Flex Pod
Tom Baker
Noch keine Bewertungen
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Load Balancing in Cloud Computing Seminar Report
Dokument13 Seiten
Load Balancing in Cloud Computing Seminar Report
Heena Mehra
100% (1)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
Apuntador Logitech R400
Dokument2 Seiten
Apuntador Logitech R400
Ivonne Santiago
Noch keine Bewertungen
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2099)
Timers and Counterschapter5
Dokument15 Seiten
Timers and Counterschapter5
Anup Fearless
Noch keine Bewertungen
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
Vulnerabilities in TCP/IP Protocols
Dokument61 Seiten
Vulnerabilities in TCP/IP Protocols
Saif Ullah
Noch keine Bewertungen
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
CANopen 30
Dokument14 Seiten
CANopen 30
sridevi
Noch keine Bewertungen
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
B APIC NXOS CLI User Guide Chapter 0111
Dokument178 Seiten
B APIC NXOS CLI User Guide Chapter 0111
ponco wiseno
Noch keine Bewertungen
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
G1701-90112 GCMS Software Installation
Dokument65 Seiten
G1701-90112 GCMS Software Installation
Alaa Bassyouny
Noch keine Bewertungen
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
Clusterware Administration and Deployment Guide
Dokument767 Seiten
Clusterware Administration and Deployment Guide
Jack Wang
Noch keine Bewertungen
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carre
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
Discovery Improvements
Dokument22 Seiten
Discovery Improvements
prdelong
Noch keine Bewertungen
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
Manual OmniDrive USB2 CF V1-21E
Dokument2 Seiten
Manual OmniDrive USB2 CF V1-21E
Juan Rios
Noch keine Bewertungen