Poster
in
Workshop: Advances in Financial AI: Opportunities, Innovations, and Responsible AI
AlphaQuant: LLM-Driven Automated Robust Feature Engineering for Quantitative Finance
Kamer Yuksel · Hassan Sawaf
Feature engineering is critical to predictive modeling, transforming raw data into meaningful features that enhance performance. However, traditional feature engineering is labor-intensive and prone to biases, while automated methods often lack robustness and interpretability. This paper introduces a novel framework that combines large language models (LLMs) with evolutionary optimization to automate robust feature discovery. The framework integrates LLMs for domain specific feature generation and a rigorous evaluation loop using machine-learning models hyper-tuned with time-series cross-validation on historical asset performance to ensure their robustness. The key contributions: (1) an LLM-powered system for generating domain-relevant, interpretable feature extraction functions, (2) an evolutionary illumination process that iteratively refines the feature-set based on importance scores from hyper-tuned models, and (3) empirical validation on financial data demonstrating significant improvements in predictive accuracy and feature robustness. The results highlight the potential of LLMs to revolutionize feature engineering, paving the way for interpretable machine-learning models. The set of discovered features is open-sourced for the reproducibility of results.