Back to projects
Precision meets Intelligence

FinSightAI: Small Models. Big Financial Insight.

Fine-tuning SmolLM2 with QLoRA to redefine domain-specific reasoning in finance.

FinSightAI was engineered to bring financial literacy and analytical depth to smaller, efficient models. Through QLoRA fine-tuning, it bridges the gap between performance and practicality.

Designed for advisory systems, report summarization, and real-time financial chat agents, FinSightAI proves that you don't need massive compute to achieve massive insight.

SmolLM2QLoRAPEFTUnSloth
FinSightAI radar chart
Model Efficiency

Precision without the overhead

FinSightAI achieves enterprise-grade reasoning on consumer hardware.

Efficient Fine-tuning

Fine-tuned on RTX 3050 GPU with just 6GB VRAM, completed within 8 hours.

Parameter Efficiency

Only 280MB of additional storage through LoRA adapters.

Precision Boost

Outperforms base SmolLM2 by up to 135% on BLEU score.

Inference Speed

Retains near-identical latency to base model during inference.

Architecture

Technical Snapshot

A breakdown of the QLoRA fine-tuning pipeline.

Base Model

SmolLM2-1.7B-Instruct

Fine-tuning Method

QLoRA + PEFT + UnSloth

Quantization Format

4-bit NF4 (NormalFloat)

Training Hardware

NVIDIA RTX 3050 (6GB VRAM)

Training Time

~16 hours

Data Tokens

50M+ tokens from 30K+ conversations

Performance

Metrics that speak for themselves

Quantitative improvements across BLEU, ROUGE, and response quality.

Overall Metric Distribution
1 / 7

Overall Metric Distribution

Multi-dimensional radar view showing improvement across all evaluation metrics.

Live Comparison

Base Model vs. FinSightAI

Watch the fine-tuned model reason through a real financial question.

Question

Explain the price-to-earnings (P/E) ratio and how investors use it to evaluate stocks.

Base Model (SmolLM2)
The price-to-earnings ratio is a metric. It is calculated by dividing stock price by earnings. Investors look at it. Higher P/E can mean growth. Lower P/E can mean value. It depends on the industry.
FinSightAI
Click "Run Inference" to see FinSightAI respond...
Reflection

Why FinSightAI matters

A statement that precision doesn't need scale.

"FinSightAI is the first compact model we trust for compliance reviews. It hits enterprise accuracy while running on a single 3050 laptop—our analysts ship insights faster without waiting for the big cluster."

Nia Mensim — Director of Quant Innovation, Helix Capital