Predicting Bitcoin Prices: A Data-Driven Approach to Crypto Trends 🚀📈 | by Si Thu Aung

Bitcoin, the world’s main cryptocurrency, has proven excessive volatility through the years. Traders and analysts continually search methods to foretell value actions utilizing data-driven approaches. On this article, we’ll discover analyze Bitcoin value developments utilizing Exploratory Information Evaluation (EDA), statistical strategies, and a easy predictive mannequin.

We’ll use Python and the Bitcoin Value Dataset from Kaggle to cowl historic value actions. By the top, you’ll perceive key insights about Bitcoin’s value developments and learn to apply predictive modeling.

Step 1: Load the Dataset

First, obtain the dataset from Kaggle and cargo it into Python.
Dataset Hyperlink: Click here.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns# Load the dataset
df = pd.read_csv("bitcoin_2017_to_2023.csv")
# Show first few rows
df.head()

Widespread columns in Bitcoin value datasets:

Date/Time: The timestamp of the recorded value.
Open, Excessive, Low, Shut (OHLC): Costs throughout completely different time frames.
Quantity: Whole variety of transactions in a given interval.

Step 2: Information Understanding and Cleansing

Earlier than diving into the evaluation, let’s discover the dataset.

# Examine for lacking values
df.data()


RangeIndex: 3126000 entries, 0 to 3125999
Information columns (complete 10 columns):
#   Column                        Dtype  
---  ------                        -----  
0   timestamp                     object 
1   open                          float64
2   excessive                          float64
3   low                           float64
4   shut                         float64
5   quantity                        float64
6   quote_asset_volume            float64
7   number_of_trades              int64  
8   taker_buy_base_asset_volume   float64
9   taker_buy_quote_asset_volume  float64
dtypes: float64(8), int64(1), object(1)
reminiscence utilization: 238.5+ MB

# Abstract statistics
df.describe()

Dealing with Lacking Values

We fill lacking values utilizing forward-fill to take care of knowledge continuity.

# Convert date column to datetime format
df['timestamp'] = pd.to_datetime(df['timestamp'])# Fill lacking values
df.fillna(technique='ffill', inplace=True)

Step 3: Exploratory Information Evaluation (EDA)

Visualizing Bitcoin Value Tendencies

plt.determine(figsize=(12, 6))
plt.plot(df['timestamp'], df['close'], label="Closing Value", colour='blue')
plt.xlabel("Date")
plt.ylabel("Value (USD)")
plt.title("Bitcoin Value Over Time")
plt.legend()
plt.present()

Every day Returns Distribution

df['Daily Return'] = df['close'].pct_change()
plt.determine(figsize=(8, 5))
df['Daily Return'].hist(bins=50, alpha=0.7)
plt.title("Distribution of Bitcoin Every day Returns")
plt.xlabel("Every day Return")
plt.ylabel("Frequency")
plt.present()

Step 4: Shifting Averages for Pattern Evaluation

Shifting averages clean out value fluctuations to determine developments.

# Calculate Shifting Averages
df['MA_50'] = df['close'].rolling(window=50).imply()
df['MA_200'] = df['close'].rolling(window=200).imply()plt.determine(figsize=(12, 6))
plt.plot(df['timestamp'], df['close'], label="Closing Value", colour='blue')
plt.plot(df['timestamp'], df['MA_50'], label="50-day MA", colour='pink')
plt.plot(df['timestamp'], df['MA_200'], label="200-day MA", colour='inexperienced')
plt.xlabel("Date")
plt.ylabel("Value (USD)")
plt.title("Bitcoin Value with Shifting Averages")
plt.legend()
plt.present()

Step 5: Correlation Evaluation

Let’s study the relationships between value and quantity.

plt.determine(figsize=(8, 6))
sns.heatmap(df.corr(), annot=True, cmap="coolwarm", fmt=".2f")
plt.title("Correlation Heatmap")
plt.present()

Step 6: Easy Predictive Mannequin

We’ll use Linear Regression to foretell Bitcoin’s value primarily based on historic developments.

from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression# Convert dates to numeric days since begin
df['Days'] = (df['timestamp'] - df['timestamp'].min()).dt.days
X = df[['Days']]
y = df['close']
# Practice-test break up
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Practice the mannequin
mannequin = LinearRegression()
mannequin.match(X_train, y_train)
# Predict
y_pred = mannequin.predict(X_test)
# Plot Predictions
plt.determine(figsize=(10, 6))
plt.scatter(X_test, y_test, colour='blue', label="Precise Costs")
plt.plot(X_test, y_pred, colour='pink', label="Predicted Costs")
plt.xlabel("Days since begin")
plt.ylabel("Bitcoin Value (USD)")
plt.title("Bitcoin Value Prediction utilizing Linear Regression")
plt.legend()
plt.present()

sns.displot((y_test-y_pred), kde=True)

from sklearn import metrics
print('MAE ', metrics.mean_absolute_error(y_test, y_pred))
print('MSE ', metrics.mean_squared_error(y_test, y_pred))
print('RMSE ', np.sqrt(metrics.mean_squared_error(y_test, y_pred)))

MAE  9812.94312732686
MSE  156391499.1181766
RMSE  12505.658683898926

On this article, we:

Loaded and cleaned Bitcoin value knowledge
Explored historic value developments utilizing EDA
Analyzed correlations and transferring averages
Constructed a easy predictive mannequin utilizing Linear Regression

The mannequin demonstrates appreciable room for enchancment. The comparatively excessive MAE, MSE, and RMSE values recommend that the mannequin is struggling to foretell Bitcoin’s value with excessive accuracy. It could be helpful to strive extra superior fashions like a classical time sequence mannequin ARIMA (AutoRegressive Built-in Shifting Common) or a deep studying mannequin LSTM (Lengthy Brief-Time period Reminiscence) higher to seize the temporal patterns inherent in Bitcoin value actions.

For those who discovered this insightful, share your ideas within the feedback beneath! 📝

GitHub Hyperlink: Click here!

Source link

Meanwhile in Europe: How We Learned to Stop Worrying and Love the AI Angst | by Andreas Maier | Jul, 2025

Handling Big Git Repos in AI Development | by Rajarshi Karmakar | Jul, 2025

A Technical Overview of the Attention Mechanism in Deep Learning | by Silva.f.francis | Jun, 2025

How Smart Entrepreneurs Turn Mid-Year Tax Reviews Into Long-Term Financial Wins

I Tried Buying a Car Through Amazon: Here Are the Pros, Cons

Amazon and eBay to pay ‘fair share’ for e-waste recycling

Artificial Intelligence Concerns & Predictions For 2025

Barbara Corcoran: Entrepreneurs Must ‘Embrace Change’

Most Popular

2026 Will Be The Year of Data + AI Observability

How to get people to eat more veggies? Add meat

The Future of Automation: How AI and ML Solutions Simplify Complex Tasks | by Noseberry Digitals | Jun, 2025

Our Picks