Sie sind auf Seite 1von 30
A New Strategy for Integrating E&P Unstructured Data Lonnie Chin (Talisman Energy) Terry Gardner (Schlumberger
A New Strategy for Integrating E&P
Unstructured Data
Lonnie Chin (Talisman Energy)
Terry Gardner (Schlumberger Information Systems)
PPDM 2009 Fall Conference
Top Secret! 1800 1600 1400 1200 1000 800 600 400 200 0 2006 2007 2008
Top Secret!
1800
1600
1400
1200
1000
800
600
400
200
0
2006
2007
2008
2009
2010
2011
Top Secret!
Exabytes
121
281
410
670
988
1610
What the $#@!% is a Zettabyte! 2011 - 1.61Zettabytes! 1 000 000 000 000 000
What the $#@!% is a Zettabyte!
2011 - 1.61Zettabytes!
1 000 000 000 000 000 000 000 bytes
zetta
exa
peta
tera
giga
mega
kilo
• 1 118 060 000 000 000 - 3.5 inch disks
• 2 300 000 000 000 - 700 MB CD
• 171 277 000 000 – 9.4 GB DVD
The Bad News in 0’s and 1’s • 2007 first year data creation outpaces data
The Bad News in 0’s and 1’s
• 2007 first year data creation outpaces
data storage*
• Est. 60-70% of all data is unstructured
• “Digital Shadow” now larger than
“Digital Footprint”**
• Forecasted size increase of 10X in 5
years and rate is increasing**
• Info containers growing 50% faster than
data volume**
Finding Meaning • Physical and environmental factors are a reality • Knowledge base and decision
Finding Meaning
• Physical and environmental factors are a
reality
• Knowledge base and decision making
capacity driven by unstructured data.
• Individuals create unstructured data but
organizations responsible for regulation**
• A time of “Perpetual Info-motion”
• Volume is not the only cause, volume +
variety is the problem
Keys to Success Understanding Corporate Structured Responsibility Foundation Automation/ Technology Integration
Keys to Success
Understanding
Corporate
Structured
Responsibility
Foundation
Automation/
Technology
Integration and
Business
Strategy
Why Am I Here? Problem Statement Current methods of unstructured data management cannot provide the
Why Am I Here?
Problem Statement
Current methods of unstructured data
management cannot provide the sophistication
or accommodate the volume and variety we are
encountering.
If we cannot keep up with today how do we
prepare for the issues of tomorrow?
Know the Players Unstructured data/information is computerized information that is either not bound by a
Know the Players
Unstructured data/information is computerized
information that is either not bound by a defined
data model or has one that is not easily defined
by logical programming.
Traditional Solution:
•Read the document, collect and store key
metadata that can be searched.
•EXTREMELY resource intensive.
Corporate Responsibility • CLEAR Policy • Governance • Defined Stewardship • Education • Enforcement
Corporate Responsibility
• CLEAR Policy
• Governance
• Defined Stewardship
• Education
• Enforcement
Structured Foundation • MDM development also vital for unstructured data management • Wellmaster MDM provides
Structured Foundation
• MDM development also vital for unstructured
data management
• Wellmaster MDM provides foundation to
impart structure
• Consistent taxonomy plays key role in
nongeographic tagging
• Wellmaster MDM will also leverage from
better unstructured data.
Automation and Technology • Developed by In-Q-Tel for CIA antiterrorism efforts • Proprietary natural language
Automation and Technology
• Developed by In-Q-Tel for CIA antiterrorism efforts
• Proprietary natural language processing technology
• Read a document and tag for geographic references
and assign coordinates
Search results are displayed on the
map
Click the document and you access it
instantly
No one has to read and catalog the
document for it to be searchable
Know the Players • Production Operations and Maintenance. • Reservoir Description. – Operating and maintenance
Know the Players
• Production Operations and Maintenance.
• Reservoir Description.
– Operating and maintenance objectives.
The Field Life Cycle.
– Reservoir geology.
– Production operations input to the FDP.
– Reservoir fluids.
- Exploration phase.
– Maintenance engineering input to the FDP.
– Data gathering.
- Appraisal phase.
– Data interpretation.
• Project and Contract Management.
– Phasing and organisation.
- Development planning.
• Volumetric Estimation.
– Planning and control.
– Deterministic methods.
- Production phase.
– Cost estimation and budgets.
– Expressing uncertainty.
- Decommissioning.
– Reasons for contracting.
• Field Appraisal.
– Types of contract.
– The role of appraisal in the field life cycle.
• Petroleum Economics.
– Identifying and quantifying sources of uncertainty.
• Exploration.
– Basic principles of development economics.
– Appraisal tools.
– Hydrocarbon accumulations.
– Constructing a project cashflow.
– Expressing reduction of uncertainty.
– Calculating a discounted cashflow.
– Exploration methods and techniques.
– Cost-benefit calculations for appraisal.
– Profitability indicators.
– Practical aspects of appraisal.
• Drilling Engineering.
– Project screening and ranking.
• Reservoir Dynamic Behaviour.
– Well planning.
– Per barrel costs.
– The driving force for production.
– Sensitivity analysis.
– Rig types and rig selection.
– Reservoir drive mechanisms.
– Exploration economics.
– Drilling systems and equipment.
– Gas reservoirs.
• Managing the Producing Field.
– Fluid displacement in the reservoir.
– Site preparation.
– Managing the subsurface.
– Reservoir simulation.
– Drilling techniques.
– Managing the surface facilities.
– Estimating the recovery factor.
– Managing the external factors.
– Casing and cementing.
– Estimating the production profile.
– Managing the internal factors.
– Enhanced oil recovery.
– Drilling problems.
• Managing Decline.
• Well Dynamic Behaviour.
– Costs and contracts.
– Infill drilling.
– Estimating the number of development wells.
• Safety and The Environment.
– Workover activity.
– Fluid flow near the wellbore.
– Enhanced oil recovery.
– Horizontal wells.
– Safety awareness.
– Production debottlenecking.
– Production testing and bottom hole pressure testing.
– Safety management systems.
– Incremental development.
– Tubing performance.
– Environment.
– Well completions.
• Decommissioning.
– Current environmental concerns.
– Artificial lift.
– Legislation.
– Economic lifetime.
• Surface Facilities.
– Decommissioning funding.
– Oil and gas processing.
– Decommissioning methods.
– Facilities.
Finding Documents in Multiple Repositories Reality more complicated
Finding Documents in Multiple Repositories
Reality more complicated
What is the Solution? Specifically for the oil and gas industry… You’d make all of
What is the Solution?
Specifically for the oil and gas
industry…
You’d make all of the documents, files,
and web content show up on a map…
This is what MetaCarta does.
MetaCarta: A Map-Based Search Appliance eSearch Appliance, connectors
MetaCarta: A Map-Based Search Appliance
eSearch
Appliance, connectors
Automation and Technology Nongeographic tagging • Example 1 – Tagging of documents to geologic time
Automation and Technology
Nongeographic tagging
• Example 1
– Tagging of documents to geologic time scale
– Custom entries in Gazetteer to create links
between keywords and taxonomy
• Example 2
– Tagging of documents to geographic coordinate
scheme designed for spatial analysis
– Custom entries in Gazetteer to create links
between keywords and predefined coordinates
Automation and Technology Engineering Geological Drilling Geophysical
Automation and Technology
Engineering
Geological
Drilling
Geophysical
Automation and Technology - eDios •Administration •Integrated Viewing •Asset Allocation •Classification
Automation and Technology - eDios
•Administration
•Integrated Viewing
•Asset Allocation
•Classification
•Metadata Collection
•Loading
•Records Integration
Automation and Technology - Datafinder •Integrated Viewing •Data Delivery •User Friendly Access •Portable
Automation and Technology - Datafinder
•Integrated Viewing
•Data Delivery
•User Friendly Access
•Portable
Integration Strategy Triage • Capability to identify high priority material • Simplification of file
Integration Strategy
Triage
• Capability to identify high priority material
• Simplification of file differentiation
Metadata Collection
• Reduce focus on manual entry
• Create extensive unbiased collection
Tier 2 repositories
• Data can have context and without being
fully processed
• Accessible from anywhere in virtual MDM
Integration Strategy Tier 1 Tier 2 Manual processing Automated processing Manually reviewed documentation No manual
Integration Strategy
Tier 1
Tier 2
Manual processing
Automated processing
Manually reviewed
documentation
No manual review
Structured repository with
access and search
capability
Bulk repository with
access and search
capability
Specific metadata
manually identified,
verified and captured
Automated indexing and
tagging based on
Gazetteer content
Integration Schematic Tier 1 PPDM eDios R_XX Triage Index & Data DM Scan Staging PPDM
Integration Schematic
Tier 1
PPDM
eDios
R_XX
Triage
Index &
Data
DM
Scan
Staging
PPDM
MDM
Tier 2
Integration Strategy Augmentation of Facilitated Process • Prefiltering and selection to reduce effort to allocate
Integration Strategy
Augmentation of Facilitated Process
• Prefiltering and selection to reduce effort
to allocate assets (ie. Wells and Areas)
• Automating classification process with by
including nongeographic tagging
• Extract key words to summarize content to
eliminate need to read all content
Setting the Stage Pilot Project Goals • Install appliance and access test data • Successfully
Setting the Stage
Pilot Project Goals
• Install appliance and access test data
• Successfully process test data set with
default configuration
• Add MDM data and observe difference in
results and performance
• Add nongeographic tagging and observe
difference in results and performance
• Evaluate accuracy of results
Results Controlled Test Parameters • 60 524 files in a typical dataset 867 GB •
Results
Controlled Test Parameters
60 524 files in a typical dataset 867 GB
Unix file system
Mix of structured and unstructured data
Totals
Default
+MDM
+MDM +Nongeo
Files
60
524
60
524
60
524
Time (hrs)
4
4.07
4.62
Processed
28
500
28
500
28
500
Tagged
3650
12
500
14
100
Hit Rate
1 in 7.9
1 in 2.3
1 in 2.0
The Bottom Line Quantifiable Targets • Current avg manual processing time 7 min • Target
The Bottom Line
Quantifiable Targets
• Current avg manual processing time 7 min
• Target = 50% reduction (3.5min/doc)
• Potential 822.5 hr reduction
• Full target 1097.25 (65% tagged)
Not Directly Quantifiable
• Searchable access to all documents
• All metadata extracted in one pass
• Ability to target data management
activities
Next Steps • Develop and execute technical and business integration plan • Identify methods to
Next Steps
• Develop and execute technical and
business integration plan
• Identify methods to improve tagging
accuracy
• Refine MDM foundation to add more
content to Gazetteer
• Identify priority unstructured data stores for
processing and integration
• Implementing nongeographic tagging
Long Term Targets • Min. 65% of readable files tagged • Min. 50% reduction in
Long Term Targets
• Min. 65% of readable files tagged
• Min. 50% reduction in processing time
• Integration of nongeological MDM stores
• Custom nongeographic tagging module to
assist in taxonomic classification
Conclusion • Unstructured E&P data forms the majority of the volume and decision making impact.
Conclusion
• Unstructured E&P data forms the majority of
the volume and decision making impact.
• Current methods of management are
inadequate and recourse intensive
• Early results from combining new practices
with updated technologies look promising.
• Structured data MDM solution already
implemented is a tremendous advantage.
References • * The Expanding Digital Universe – A Forecast of Worldwide Information Growth Through
References
• * The Expanding Digital Universe – A Forecast of Worldwide
Information Growth Through 2010; John F. Gantz, et. al. (IDC
Whitepaper, Mar 2007)
• Computer World, Oct 30, 2006, “Where Size is Opportunity,” pg.
22
• ** The Diverse and Exploding Digital Universe – An Updated
Forecast of Worldwide Information Growth Through 2011; John
F. Gantz, et. al. (IDC Whitepaper, Mar 2008)