Can Data Science Revolutionize Oil in Bakken Shale?

The integration of data science into the oil industry has the potential to dramatically transform operations, particularly within the Bakken Shale. This rich oil-bearing formation in North Dakota has witnessed significant production increases due to advancements in drilling techniques. Since 2012, the deployment of large multistage completions in horizontal wells has surged oil extraction, making the region a critical player in the United States’ energy landscape. With over 18,000 wells stimulated, there has been extensive experimentation in well spacing and stimulation intensity, aiming for cost efficiency without compromising output. Nonetheless, even large-scale interventions have not consistently yielded expected outcomes, revealing gaps in the understanding of the multifaceted factors contributing to well productivity. As a result, there is an opportunity to leverage data science tools to fine-tune completion strategies.

The North Dakota Industrial Commission has meticulously accumulated extensive completion and production data, accessible for analytical use. When combined with cutting-edge advances in data science, this data becomes a powerful asset in refining drilling practices through statistical analysis and predictive modeling. These methods allow for the extraction of insights from the vast historical data of Bakken wells, offering the potential to further enhance oil recovery rates. This analysis targets completion designs and parameter interactions, emphasizing the need for a comprehensive understanding to optimize production strategies effectively. Examples of data science applications in this context highlight the ways these tools can influence completion decisions, driving the future of oil production within the Bakken Shale.

Completion Design Optimization with Data Science

Predictive modeling has emerged as a cornerstone in optimizing completion designs within the Bakken Shale. An initial study involving data from over 12,000 wells conducted in 2020 demonstrated the complexity of the task, as simple statistical interpretations fell short of capturing the intricate relationships between performance and completion parameters. These nonlinear dependencies necessitate advanced data-mining techniques capable of handling complexity and incomplete data. Among such methods, gradient boosting tools have been employed to derive predictive models, focusing on various completion design parameters. These include the perforated interval, fluid volume, proppant amount, stage count, injection rate, pressure, proppant type, and completion type, which collectively influence well performance, particularly cumulative six-month production.

The preliminary models indicated a phenomenon known as overfitting, wherein the predictive models performed differently for test datasets compared to their training counterparts. This issue arises when a single statistical model is applied to diverse Bakken locations, each with distinct geological and reservoir characteristics. To counteract this, wells were categorized into three groups based on their geographical and productivity distinctions, and separate predictive models were constructed for each. This stratification allowed for more accurate predictions by aligning completion strategies with localized geological influences. The analyses revealed total proppant and fluid volumes as primary influencers of well performance across the subareas, albeit with variations in optimal configurations depending on local geological conditions.

Geology-Driven Well Classification and Analysis

Beyond initial modeling efforts, incorporating geological factors into completion analysis offers a more holistic optimization methodology. Cluster analysis, leveraging data from 14,700 wells, utilized publicly available geological and geochemical information to classify wells into subareas with similar characteristics. Parameters such as formation depth, thickness, temperature, pressure gradient, porosity, and permeability were considered alongside geochemical data, like the hydrogen index, Tmax, total organic carbon, and bulk oil and gas properties. The K-means clustering algorithm facilitated the grouping of these wells, leading to well-defined subareas with minimal overlap, which were consistent with recognized geological trends in the Bakken.

The stratified analysis enhanced well evaluation capabilities, extending into broader drilling spacing unit (DSU) optimizations. By aggregating completion and production metrics and integrating well spacing and count, the evaluations accounted for wells’ interaction within DSUs. Essentially, this approach provided a comprehensive understanding of how different completion strategies impacted oil production, all anchored in years of operational data. The statistical methods, although not physically modeling the processes, effectively distilled insights from historical performance, guiding completion decisions that considered both technological similarities and geological variance.

Evolving Strategies and Future Perspectives

The evolution of completion technologies within the Bakken, from 2005 onward, underscores an ongoing journey toward optimization. Technological advances have included increased stimulation intensity, extended lateral sections, diverters, and recompletion strategies, each contributing to more efficient oil production. To ensure effective optimization, datasets must reflect consistent technological applications, often facilitated by filtering data according to completion year. Notably, 2012 and 2017 stand out as milestone years representing substantial technological shifts, serving as benchmarks for evaluating progress.

Looking ahead, partial dependence plots used in optimization can be advanced by integrating visualizations like heat maps or 3D charts. These tools can depict the interplay of multiple parameters on production outcomes, providing a richer analytical framework. Moreover, a shift from static parameter values in partial plots to sensitivity analyses using simulated datasets can illustrate broader scenario ranges. Given the dynamics of unconventional oil development, both the geological framework and evolving completion techniques will continually inform adaptive strategies, maximizing recovery in a sustainable manner.

Insights for Future Exploration

Integrating data science into the oil industry holds tremendous potential, especially for the Bakken Shale in North Dakota. This oil-rich area has experienced significant boosts in production, largely due to advancements in drilling technologies like large multistage completions in horizontal wells. Since 2012, such technologies have elevated oil extraction, establishing the region as a key player in America’s energy sector. Despite over 18,000 wells being stimulated and extensive efforts on well spacing and stimulation intensity aimed at cost efficiency, the results haven’t always met expectations. This inconsistency highlights gaps in understanding all the diverse factors affecting well productivity. Consequently, data science tools offer an opportunity to refine completion strategies.

The North Dakota Industrial Commission has collected comprehensive completion and production data, which is ripe for analysis. By applying modern data science techniques, this data can be transformed into a valuable resource for improving drilling practices. Methods such as statistical analysis and predictive modeling enable insightful extraction from the historical data of Bakken wells, which can lead to enhanced oil recovery rates. The focus is on understanding completion designs and parameter interactions to optimize production processes efficiently. Data science applications in this realm showcase how these tools can shape completion decisions, steering the future of oil production in the Bakken Shale forward effectively.

Subscribe to our weekly news digest.

Join now and become a part of our fast-growing community.

Invalid Email Address
Thanks for Subscribing!
We'll be sending you our best soon!
Something went wrong, please try again later