PyTrader

This project dives into the world of algorithmic trading by blending classic strategies like SMA and MACD with cutting-edge machine-learning models. From automating data collection to building predictive GRU models, it’s all about finding the edge in market movements.

Features Implemented:

  • Built automated data pipelines for the collection and preprocessing of stock data.

  • Developed a backtesting framework with transaction costs and slippage simulation.

  • Implemented SMA Crossover, Mean Reversion, and MACD trading strategies used as features.

  • Trained Random Forest and GRU models for predicting buy/sell/hold signals.

  • Addressed class imbalance with weighted loss functions and resampling techniques.

  • Evaluated portfolio performance using Sharpe Ratio, CAGR, and Max Drawdown.

  • Visualized portfolio value, trade signals, and key metrics.

Skills Used:

  • Data collection and preprocessing for financial data.

  • Feature engineering using technical indicators (SMA, RSI, MACD, etc.).

  • Algorithm design and backtesting for trading strategies.

  • Machine learning model training (Random Forest, GRU) for market prediction.

  • Performance evaluation using Sharpe Ratio, CAGR, and precision/recall metrics.

  • Data visualization for portfolio performance and trade signals.

Technical Specification:

  • Python

  • Pandas

  • Numpy

  • Scikit-learn

  • Torch

  • Ta

  • Matplotlib

Planning and Execution

The first step in this project was building robust data pipelines to collect and preprocess historical stock data. Ensuring clean, consistent, and feature-rich data was critical, so I focused on handling missing values, scaling, and normalizing the data while generating technical indicators like SMA, RSI, and MACD. This provided a solid foundation for both backtesting strategies and training machine learning models.

Once the data pipelines were in place, I designed a backtesting framework to evaluate trading strategies under realistic market conditions, accounting for transaction costs and slippage. With this framework, I implemented traditional trading strategies such as SMA Crossover and MACD, and later integrated machine learning models like Random Forest and GRU to predict buy, sell, or hold signals. The design was guided by the goal of creating a seamless flow from raw data to actionable insights, with performance metrics and visualizations tying everything together.

Challenges and lessons Learned

  • Class Imbalance: Balancing the dataset for buy, sell, and hold signals was a significant challenge. Addressing this required dynamic class weighting and resampling techniques, which improved model performance for minority classes.

  • Model Interpretability: Understanding why machine learning models made specific predictions was initially difficult. This led to a focus on tracking predictions and visualizing trade signals to better understand model behaviour.

  • Hyperparameter Tuning: Finding the right combination of hyperparameters for the GRU and Random Forest models, such as sequence length, learning rate, and dropout, involved extensive experimentation and analysis.

  • Overfitting: Ensuring the models generalized well across unseen data required techniques like cross-validation, regularization, and early stopping.

  • Data Pipeline Optimization: Building automated pipelines for fetching, cleaning, and preprocessing stock data highlighted the importance of efficient workflows, especially for handling large datasets.

  • Integration of Strategies and ML Models: Combining traditional strategies like SMA and MACD with machine learning predictions presented challenges in ensuring smooth integration and alignment with the backtesting framework.

  • Visualization Complexity: Representing portfolio performance and trade signals in an intuitive and meaningful way required iterative improvements to visualizations.

  • Iterative Learning: Each challenge reinforced the importance of iterative learning—refining code, improving metrics, and understanding the trade-offs between simplicity and complexity in trading systems.

Conclusion

This project was a deep dive into algorithmic trading, combining traditional strategies with machine learning to tackle real-world challenges. While the GRU model struggled to predict buy and sell signals accurately due to class imbalance and data complexity, the process provided valuable insights into working with time-series data, feature engineering, and model evaluation. From building robust data pipelines to designing a backtesting framework, this project was a hands-on journey that sharpened skills in trading strategy development, machine learning, and problem-solving in dynamic environments.