Sie sind auf Seite 1von 30

A New Strategy for Integrating E&P

Unstructured Data
Lonnie Chin (Talisman Energy)
Terry Gardner (Schlumberger Information Systems)

PPDM 2009 Fall Conference


Expansion of The Digital Universe
Top Secret!

1800
1600

1610
1400
1200
1000

988
800
600

670
400 410
121

281

200
0
2006 2007 2008 2009 2010 2011

Exabytes
Top Secret!
What the $#@!% is a Zettabyte!

2011 - 1.61Zettabytes!
1 000 000 000 000 000 000 000 bytes
zetta exa peta tera giga mega kilo

1 118 060 000 000 000 - 3.5 inch disks


2 300 000 000 000 - 700 MB CD
171 277 000 000 9.4 GB DVD
The Bad News in 0s and 1s

2007 first year data creation outpaces


data storage*
Est. 60-70% of all data is unstructured
Digital Shadow now larger than
Digital Footprint**
Forecasted size increase of 10X in 5
years and rate is increasing**
Info containers growing 50% faster than
data volume**
Finding Meaning

Physical and environmental factors are a


reality
Knowledge base and decision making
capacity driven by unstructured data.
Individuals create unstructured data but
organizations responsible for regulation**
A time of Perpetual Info-motion
Volume is not the only cause, volume +
variety is the problem
Keys to Success

Understanding

Corporate Structured
Responsibility Foundation

Integration and
Automation/
Business
Technology
Strategy
Why Am I Here?

Problem Statement

Current methods of unstructured data


management cannot provide the sophistication
or accommodate the volume and variety we are
encountering.

If we cannot keep up with today how do we


prepare for the issues of tomorrow?
Know the Players

Unstructured data/information is computerized


information that is either not bound by a defined
data model or has one that is not easily defined
by logical programming.

Traditional Solution:
Read the document, collect and store key
metadata that can be searched.
EXTREMELY resource intensive.
Corporate Responsibility

CLEAR Policy
Governance
Defined Stewardship
Education
Enforcement
Structured Foundation

MDM development also vital for unstructured


data management
Wellmaster MDM provides foundation to
impart structure
Consistent taxonomy plays key role in
nongeographic tagging
Wellmaster MDM will also leverage from
better unstructured data.
Automation and Technology
Developed by In-Q-Tel for CIA antiterrorism efforts
Proprietary natural language processing technology
Read a document and tag for geographic references
and assign coordinates
Search results are displayed on the
map

Click the document and you access it


instantly

No one has to read and catalog the


document for it to be searchable
Know the Players
Reservoir Description. Production Operations and Maintenance.
Operating and maintenance objectives.
The Field Life Cycle. Reservoir geology.
Production operations input to the FDP.
Reservoir fluids.
- Exploration phase. Data gathering. Maintenance engineering input to the FDP.
- Appraisal phase. Data interpretation. Project and Contract Management.
Phasing and organisation.
- Development planning. Volumetric Estimation.
Planning and control.
Deterministic methods.
- Production phase. Expressing uncertainty. Cost estimation and budgets.
Reasons for contracting.
- Decommissioning. Field Appraisal.
Types of contract.
The role of appraisal in the field life cycle.
Identifying and quantifying sources of uncertainty. Petroleum Economics.
Exploration. Basic principles of development economics.
Appraisal tools.
Hydrocarbon accumulations. Expressing reduction of uncertainty. Constructing a project cashflow.
Exploration methods and techniques. Cost-benefit calculations for appraisal. Calculating a discounted cashflow.
Practical aspects of appraisal. Profitability indicators.
Drilling Engineering. Project screening and ranking.
Reservoir Dynamic Behaviour.
Well planning. The driving force for production.
Per barrel costs.
Sensitivity analysis.
Rig types and rig selection. Reservoir drive mechanisms.
Exploration economics.
Drilling systems and equipment. Gas reservoirs.
Fluid displacement in the reservoir. Managing the Producing Field.
Site preparation. Managing the subsurface.
Reservoir simulation.
Drilling techniques. Estimating the recovery factor. Managing the surface facilities.
Casing and cementing. Estimating the production profile. Managing the external factors.
Enhanced oil recovery. Managing the internal factors.
Drilling problems.
Well Dynamic Behaviour. Managing Decline.
Costs and contracts. Infill drilling.
Estimating the number of development wells.
Safety and The Environment. Fluid flow near the wellbore. Workover activity.
Horizontal wells. Enhanced oil recovery.
Safety awareness.
Production testing and bottom hole pressure testing. Production debottlenecking.
Safety management systems. Tubing performance. Incremental development.
Environment. Well completions. Decommissioning.
Current environmental concerns. Artificial lift. Legislation.
Surface Facilities. Economic lifetime.
Oil and gas processing. Decommissioning funding.
Facilities. Decommissioning methods.
Finding Documents in Multiple Repositories

Reality more complicated


What is the Solution?

Specifically for the oil and gas


industry
Youd make all of the documents, files,
and web content show up on a map

This is what MetaCarta does.


MetaCarta: A Map-Based Search Appliance

eSearch
Appliance, connectors
Automation and Technology

Nongeographic tagging
Example 1
Tagging of documents to geologic time scale
Custom entries in Gazetteer to create links
between keywords and taxonomy
Example 2
Tagging of documents to geographic coordinate
scheme designed for spatial analysis
Custom entries in Gazetteer to create links
between keywords and predefined coordinates
Automation and Technology
Geological Engineering

Drilling Geophysical
Automation and Technology - eDios

Administration Metadata Collection


Integrated Viewing
Loading
Asset Allocation
Records Integration
Classification
Automation and Technology - Datafinder
Integrated Viewing
Data Delivery

User Friendly Access


Portable
Integration Strategy

Triage
Capability to identify high priority material
Simplification of file differentiation
Metadata Collection
Reduce focus on manual entry
Create extensive unbiased collection
Tier 2 repositories
Data can have context and without being
fully processed
Accessible from anywhere in virtual MDM
Integration Strategy
Tier 1 Tier 2
Manual processing Automated processing

Manually reviewed No manual review


documentation
Structured repository with Bulk repository with
access and search access and search
capability capability
Specific metadata Automated indexing and
manually identified, tagging based on
verified and captured Gazetteer content
Integration Schematic
Tier 1

eDios PPDM
R_XX
Triage

Index & Data


Scan Staging DM

PPDM
MDM

Tier 2
Integration Strategy

Augmentation of Facilitated Process


Prefiltering and selection to reduce effort
to allocate assets (ie. Wells and Areas)
Automating classification process with by
including nongeographic tagging
Extract key words to summarize content to
eliminate need to read all content
Setting the Stage

Pilot Project Goals


Install appliance and access test data
Successfully process test data set with
default configuration
Add MDM data and observe difference in
results and performance
Add nongeographic tagging and observe
difference in results and performance
Evaluate accuracy of results
Results
Controlled Test Parameters
60 524 files in a typical dataset 867 GB
Unix file system
Mix of structured and unstructured data

Totals Default +MDM +MDM +Nongeo


Files 60 524 60 524 60 524
Time (hrs) 4 4.07 4.62
Processed 28 500 28 500 28 500
Tagged 3650 12 500 14 100
Hit Rate 1 in 7.9 1 in 2.3 1 in 2.0
The Bottom Line
Quantifiable Targets
Current avg manual processing time 7 min
Target = 50% reduction (3.5min/doc)
Potential 822.5 hr reduction
Full target 1097.25 (65% tagged)

Not Directly Quantifiable


Searchable access to all documents
All metadata extracted in one pass
Ability to target data management
activities
Next Steps

Develop and execute technical and


business integration plan
Identify methods to improve tagging
accuracy
Refine MDM foundation to add more
content to Gazetteer
Identify priority unstructured data stores for
processing and integration
Implementing nongeographic tagging
Long Term Targets

Min. 65% of readable files tagged


Min. 50% reduction in processing time
Integration of nongeological MDM stores
Custom nongeographic tagging module to
assist in taxonomic classification
Conclusion

Unstructured E&P data forms the majority of


the volume and decision making impact.
Current methods of management are
inadequate and recourse intensive
Early results from combining new practices
with updated technologies look promising.
Structured data MDM solution already
implemented is a tremendous advantage.
References
* The Expanding Digital Universe A Forecast of Worldwide
Information Growth Through 2010; John F. Gantz, et. al. (IDC
Whitepaper, Mar 2007)

Computer World, Oct 30, 2006, Where Size is Opportunity, pg.


22

** The Diverse and Exploding Digital Universe An Updated


Forecast of Worldwide Information Growth Through 2011; John
F. Gantz, et. al. (IDC Whitepaper, Mar 2008)

Das könnte Ihnen auch gefallen