Poster
in
Workshop: Advances in Financial AI: Opportunities, Innovations, and Responsible AI

AlphaQuant: LLM-Driven Automated Robust Feature Engineering for Quantitative Finance

Kamer Yuksel · Hassan Sawaf

Project Page [ OpenReview]

Abstract

Feature engineering is critical to predictive modeling, transforming raw data into meaningful features that enhance performance. However, traditional feature engineering is labor-intensive and prone to biases, while automated methods often lack robustness and interpretability. This paper introduces a novel framework that combines large language models (LLMs) with evolutionary optimization to automate robust feature discovery. The framework integrates LLMs for domain specific feature generation and a rigorous evaluation loop using machine-learning models hyper-tuned with time-series cross-validation on historical asset performance to ensure their robustness. The key contributions: (1) an LLM-powered system for generating domain-relevant, interpretable feature extraction functions, (2) an evolutionary illumination process that iteratively refines the feature-set based on importance scores from hyper-tuned models, and (3) empirical validation on financial data demonstrating significant improvements in predictive accuracy and feature robustness. The results highlight the potential of LLMs to revolutionize feature engineering, paving the way for interpretable machine-learning models. The set of discovered features is open-sourced for the reproducibility of results.

Chat is not available.