Vadim's blog | Vadim's blog

Slaying Bullish Bias - A Market Wizards Playbook

May 19, 2025 · 8 min read

Senior Software Engineer at Vitrifi

“The markets are never wrong; opinions often are.”
—Jesse Livermore (quoted by Bruce Kovner in Market Wizards)

2025 is a cognitive trap for equity bulls. The Ukraine front barely moves, President Trump’s blanket 10 % tariff rattles importers, and German GDP just printed –0.6 % QoQ—yet the S&P 500 hovers north of 5 500.
If that disconnect feels comfortable, your built-in bullish bias (the reflex that “prices should rise”) is probably steering the wheel.

Below you’ll find the fully annotated 30-question audit that the original Market Wizards might run if they sat at your terminal today. Each line now includes:

Wizard Insight – the lesson Schwager’s interviewees hammered home.
2025 Angle – why the trap is live right now.
Real-World Example – an actual 2025 tape or trade vignette.

Paste the checklist into your trading journal, sprint through one block per week, and watch your P/L detach from hope-fuelled drift.

1 Self-Diagnosis & Mind-Set

#	Question	Wizard Insight	2025 Angle	Real-World Example
1	Do you scan for longs first?	Mark Cook forced students to open a bearish filter before coffee.	All major U.S. broker dashboards open on “Top Gainers.”	11 Mar 2025: NVDA +6 % headlined your grid; bottom losers list showed LUMN –13 % (a better 2-R short you never saw).
2	5 % drop—curiosity or dip euphoria?	Paul Tudor Jones cut leverage 50 % within minutes on 19 Oct 1987.	15 Mar 2025: SPX –5.1 %, VIX 34 → index kept sliding another –2 % before basing.	You felt “great entry” and bought QQQ, stopped out –1 R next day.
3	Does shorting feel “un-American”?	Tom Baldwin joked “The pits only cheer the upside.”	Media framed every 2024 sell-off as “unpatriotic betting.”	You posted a bearish tweet on Apple and got piled-on for “fighting innovation.”
4	Dips = noise, rallies = trends?	Ed Seykota logged only % risk and ATR multiples—no adjectives.	CNBC still calls –2 % a “slump” but +2 % a “rally.”	23 Apr 2025 journal: “just a blip lower” (SPX –1.8 %), “solid up-trend” (+1.6 %).
5	Is self-worth tied to rising curves?	Seykota kept family money in T-Bills.	Real college costs +6 % YoY; equity drift no longer guarantees coverage.	You increased size after your kid’s tuition invoice hit inbox.

2 Historical Perspective & Narrative Traps

#	Question	Wizard Insight	2025 Angle	Real-World Example
6	How did you fare in each mini-crash?	Jones was green in ’87; Raschke flat in ’98.	2022 bear (–27 %) still on broker statement.	Your 2022 curve: –18 % vs CTA index +13 %.
7	Tested your edge with drift = 0?	Seykota’s systems worked on pork bellies—no drift.	Forward SPX drift est. < 4 %.	Your momentum back-test Sharpe fell from 1.2 ➜ 0.48.
8	Rely on “Don’t bet against America”?	Kovner warns empires rotate.	Proposed 2 % buy-back tax in House bill HR-1735.	Removing buy-backs in DCF knocked 7 turns off Apple PE.
9	Ignoring survivorship in Wizard lore?	Schwager himself says thousands blew up.	TikTok “profit porn” hides losers.	Your Telegram group shares only green screenshots.
10	Studied markets that never bounced?	Japanese believers held Nikkei bags for 34 yrs.	Greek ASE –85 % from ’07 peak even now.	Your Europe ETF overweight assumes 7 % CAGR.

3 Quantitative Evidence

#	Question	Wizard Insight	2025 Angle	Real-World Example
11	Shorts share of tickets & P/L?	Cook: “Trade both sides or half your vision is gone.”	Q1-25 had strongest 3-day down-impulse since Covid lows.	9 shorts out of 112 trades; net P/L –2 R.
12	Invert your long signal—result?	Seykota’s “whipsaw song” works both ways.	High-short-interest anomaly revived with expensive rates.	Inverted signal on same universe scored Sharpe 0.32.
13	Price vs log-return testing?	Wizards think in % risk.	Nasdaq 100 raw-point rise masks compounding.	Strategy CAGR fell from 18 % ➜ 11 % in log mode.
14	Stop symmetry?	Raschke: 2 ATR both sides.	Meme squeezes tempt 1 ATR shorts, 3 ATR longs.	Last month: 6 short stop-outs at –1 ATR, 2 long at –3 ATR.
15	Monte-Carlo μ = 0 survival?	Jones funds vol desks to weather drift drought.	Commodity volatility doubles path risk now.	10 000 paths: median curve flatlines by month 22.

4 Risk & Capital Allocation

#	Question	Wizard Insight	2025 Angle	Real-World Example
16	Exposure cap symmetric?	Seykota could flip net ±200 %.	Short borrow fees sub-1 % for 80 % of S&P names.	You allow +150 % long, –25 % short.
17	Averaging down losers?	Kovner: “Losers average losers.”	AI chip names drop 18 % intraday regularly.	Added twice to AMD at –3 % and –6 %; closed –2 R.
18	Cover shorts first in vol spikes?	Tudor held shorts through crash until vol bled.	Post-VIX-34 drift negative for 12 sessions.	Closed TSLA short on spike, kept long tech—lost 1.4 R.
19	Put hedge value?	Jones buys vol only when skew cheap.	1-month ATM put cost 1.8 % in Mar 2025.	Last year: spent 3.4 R in premium, saved 1.1 R in crashes.
20	Squeezes breach worst-case loss?	Baldwin sized by dollar vol.	Feb 2025 GME +40 % gap.	Short lost 2.3 R overnight.

5 Process & Decision Architecture

#	Question	Wizard Insight	2025 Angle	Real-World Example
21	UI bias toward gainers?	Seykota coded neutral dash.	Broker UIs show green first.	Missed FSLY –12 % fail because list buried.
22	Short checklist depth?	Raschke rehearses shorts like longs.	Easier borrows post-reg changes.	Long checklist 12 items; short only 5.
23	Narrative only for shorts?	Wizards trust price.	News calls every dip an “overreaction.”	Skipped META short for lack of “fundamental story”; missed –8 %.
24	Post-mortem balance?	Cook logs every miss.	Feb 2025: three perfect failed-break short signals unreviewed.	Reviewed 7 missed longs, 0 shorts.
25	Auto-flip after failed breakout?	“Failed move = fast move” —Soros.	AI names fake breakouts weekly.	Long NVDA fake-out –1 R, no flip; price dropped another 4 %.

6 Psychology & Continuous Improvement

#	Question	Wizard Insight	2025 Angle	Real-World Example
26	Bias tags clustering on longs?	Jones hired risk coach.	AI tools auto-tag sentiment now.	65 % optimism tags on long entries, 15 % on shorts.
27	Real-time beta alerts?	Tudor’s board lit red at β > 0.7.	Slack webhooks trivial.	Hit 0.78 beta on 9 Apr, noticed next day.
28	Gap-down rehearsal?	Basso ran crash sims monthly.	Turkey ETF gap –12 % overnight, Feb 2025.	Panicked exit + slippage –1 R; never rehearsed scenario.
29	Forced-flat longs feeling?	Seykota welcomes dry powder.	Broker outage flushed longs 14 Jan.	Felt panic → identity fusion with bull thesis.
30	Preparing for lower drift?	Wizards add new edges.	Demographics & reshoring compress margins.	Equity CAGR model still at 8 %.

7 Wrap-Up

Bullish bias survives because it pays most of the time—until it erases years of gains in a single macro grenade.
The Market Wizards neutralised the bias through symmetry: equal screens, stops, reviews, and above all, equal respect for up and down tape.

Run this playbook once per quarter:

Audit each question honestly.
Patch the weakest habit or policy.
Re-test your edge in a zero-drift simulation.

Do that, and the next tariff volley, energy spike, or AI bubble unwind becomes just another tradeable regime—not a career-ending ambush.

Happy (bias-free) trading!

Contributing a Safer MarketIfTouchedOrder to Nautilus Trader — Hardening Conditional Orders in Rust

May 3, 2025 · 3 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

TL;DR – PR #2577 introduces a fallible constructor, complete domain-level checks, and four focussed tests for MarketIfTouchedOrder, thereby closing long-standing Issue #2529 on order-validation consistency.

1 Background

MarketIfTouchedOrder (MIT) is effectively the reverse of a stop-market order: it lies dormant until price touches a trigger, then fires as an immediate market order.
Because a latent trigger feeds straight into an instant fill path, robust validation is non-negotiable—any silent mismatch becomes a live trade.

2 Why the Change Was Necessary

Problem	Impact
Partial positivity checks on `quantity`, `trigger_price`, `display_qty`	Invalid values propagated deep into matching engines before exploding
`TimeInForce::Gtd` accepted `expire_time = None`	Programmer thought they had “good-til-date”; engine treated it as GTC
No check that `display_qty ≤ quantity`	Iceberg slice could exceed total size, leaking full inventory
Legacy `new` API only panicked	Call-site couldn’t surface errors cleanly

Issue #2529 demanded uniform, fail-fast checks across all order types; MIT was first in line.

3 What PR #2577 Delivers

Area	Before (`v0`)	After (`v1`)
Constructor	`new` → panic on error	`new_checked` → `anyhow::Result<Self>`; `new` now wraps it
Positivity checks	Partial	Guaranteed for `quantity`, `trigger_price`, (optional) `display_qty`
GTD orders	`expire_time` optional	Required when `TIF == GTD`
Iceberg rule	None	`display_qty ≤ quantity`
Error channel	Opaque panics	Precise `anyhow::Error` variants
Tests	0	4 rstest cases (happy-path + 3 failure modes)

Diff stats: +159 / −13 – one file crates/model/src/orders/market_if_touched.rs.

4 File Walkthrough Highlights

new_checked – all domain guards live here; returns Result.
Guard helpers – re-uses check_positive_quantity, check_positive_price, check_predicate_false.
Legacy compatibility – new() simply calls Self::new_checked(...).expect(FAILED).
apply() tweak – slippage is recomputed immediately after a fill event.
Tests – ok, quantity_zero, gtd_without_expire, display_qty_gt_quantity.

6 Order-Lifecycle Diagram

7 Using the New API

let mit = MarketIfTouchedOrder::new_checked(
    trader_id,
    strategy_id,
    instrument_id,
    client_order_id,
    OrderSide::Sell,
    qty,
    trigger_price,
    TriggerType::LastPrice,
    TimeInForce::Gtc,
    None,          // expire_time
    false, false,  // reduce_only, quote_quantity
    None, None,    // display_qty, emulation_trigger
    None, None,    // trigger_instrument_id, contingency_type
    None, None,    // order_list_id, linked_order_ids
    None,          // parent_order_id
    None, None,    // exec_algorithm_id, params
    None,          // exec_spawn_id
    None,          // tags
    init_id,
    ts_init,
)?;

Prefer new_checked in production code; if you stick with new, you’ll still get clearer panic messages.

8 Impact & Next Steps

Fail-fast safety – all invariants enforced before the order leaves your code.
Granular error reporting – propagate Result outward instead of catching panics.
Zero breaking changes – legacy code continues to compile.

Action items: migrate to new_checked, bubble the Result, and sleep better during live trading.

9 References

Type	Link
Pull Request #2577	https://github.com/nautechsystems/nautilus_trader/pull/2577
Issue #2529	https://github.com/nautechsystems/nautilus_trader/issues/2529

Happy (and safer) trading!

Contributing a Safer LimitIfTouchedOrder to Nautilus Trader — A Small Open-Source Win for Rust Trading

May 3, 2025 · 3 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

LimitIfTouchedOrder (LIT) is a conditional order that sits between a simple limit order and a stop-limit order: it rests inactive until a trigger price is touched, then converts into a plain limit at the specified limit price. Because it straddles two distinct price levels and multiple conditional flags, robust validation is critical—any silent mismatch can manifest as unwanted executions in live trading.

Pull Request #2533 standardises and hardens the validation logic for LIT orders, bringing it up to the same quality bar as MarketOrder and LimitOrder. The PR was merged into develop on May 1 2025 by @cjdsellers (+207 / −9 across one file). (GitHub, [GitHub][2])

Why the Change Was Needed

Inconsistent invariants – quantity, price, and trigger_price were not always checked for positivity.
Edge-case foot-guns – TimeInForce::Gtd could be set with a zero expire_time, silently turning a “good-til-date” order into “good-til-cancel”.
Side/trigger mismatch – A BUY order with a trigger above the limit price (or SELL with trigger below limit) yielded undefined behaviour.
Developer frustration – Consumers of the SDK had to replicate guard clauses externally; a single canonical constructor removes that burden.

Key Enhancements

Area	Before	After
Constructor API	`new` (panic-on-error)	`new_checked` (returns `Result`) + `new` now wraps it
Positivity checks	Only partial	Guaranteed for `quantity`, `price`, `trigger_price`, and optional `display_qty`
Display quantity	Not validated	Must be ≤ `quantity`
GTD orders	No expire validation	Must supply `expire_time` when `TimeInForce::Gtd`
Side/trigger rule	Undefined	`BUY ⇒ trigger ≤ price`, `SELL ⇒ trigger ≥ price`
Unit-tests	0 dedicated tests	5 focused tests (happy-path + 4 failure modes)

Implementation Highlights

new_checked – a fallible constructor returning anyhow::Result<Self>. All invariants live here.
Guard helpers – leverages check_positive_quantity, check_positive_price, and check_predicate_false from nautilus_core::correctness.
Legacy behaviour preserved – the original new now calls new_checked().expect("FAILED"), so downstream crates that relied on panics keep working.
Concise Display impl – human-readable string that shows side, quantity, instrument, prices, trigger type, TIF, and status for quick debugging.
Test suite – written with rstest; covers ok, quantity_zero, gtd_without_expire, buy_trigger_gt_price, and sell_trigger_lt_price.

Code diff stats: 207 additions, 9 deletions, affecting crates/model/src/orders/limit_if_touched.rs. ([GitHub][2])

Impact on Integrators

If you only called LimitIfTouchedOrder::new nothing breaks—you’ll merely enjoy better error messages if you misuse the API. For stricter compile-time safety, switch to the new new_checked constructor and handle Result<T> explicitly.

let order = LimitIfTouchedOrder::new_checked(
    trader_id,
    strategy_id,
    instrument_id,
    client_order_id,
    OrderSide::Buy,
    qty,
    limit_price,
    trigger_price,
    TriggerType::LastPrice,
    TimeInForce::Gtc,
    None,          // expire_time
    false, false,  // post_only, reduce_only
    false, None,   // quote_qty, display_qty
    None, None,    // emulation_trigger, trigger_instrument_id
    None, None,    // contingency_type, order_list_id
    None,          // linked_order_ids
    None,          // parent_order_id
    None, None,    // exec_algorithm_id, params
    None,          // exec_spawn_id
    None,          // tags
    init_id,
    ts_init,
)?;

Conclusion

PR [#2533] dramatically reduces the surface area for invalid LIT orders by centralising all domain rules in a single, auditable place. Whether you’re building discretionary tooling or a fully automated strategy on top of Nautilus Trader, you now get fail-fast behaviour with precise error semantics—no more mystery fills in production.

Next steps: adopt new_checked, make your own wrappers return Result, and enjoy safer trading.

How to Integrate OpenAI TTS with FFmpeg in a FastAPI Service

March 6, 2025 · 5 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

OpenAI offers powerful text-to-speech capabilities, enabling developers to generate spoken audio from raw text. Meanwhile, FFmpeg is the de facto standard tool for audio/video processing—used heavily for tasks like merging audio files, converting formats, and applying filters. Combining these two in a FastAPI application can produce a scalable, production-ready text-to-speech (TTS) workflow that merges and manipulates audio via FFmpeg under the hood.

This article demonstrates how to:

Accept text input through a FastAPI endpoint
Chunk text and use OpenAI to generate MP3 segments
Merge generated segments with FFmpeg (through the pydub interface)
Return or store a final MP3 file, ideal for streamlined TTS pipelines

By the end, you’ll understand how to build a simple but effective text-to-speech microservice that leverages the power of OpenAI and FFmpeg.

1. Why Combine OpenAI and FFmpeg

Chunked Processing: Long text might exceed certain API limits or timeouts. Splitting into smaller parts ensures each piece is handled reliably.
Post-processing: Merging segments, adding intros or outros, or applying custom filters (such as volume adjustments) becomes trivial with FFmpeg.
Scalability: A background task system (like FastAPI’s BackgroundTasks) can handle requests without blocking the main thread.
Automation: Minimizes manual involvement—one endpoint can receive text and produce a final merged MP3.

2. FastAPI Endpoint and Background Tasks

Below is the FastAPI code that implements a TTS service using the OpenAI API and pydub (which uses FFmpeg internally). It splits the input text into manageable chunks, generates MP3 files per chunk, then merges them:

import os
import time
import logging
from pathlib import Path

from dotenv import load_dotenv
from fastapi import APIRouter, HTTPException, Request, BackgroundTasks
from fastapi.responses import JSONResponse
from pydantic import BaseModel
from openai import OpenAI
from pydub import AudioSegment

load_dotenv(".env.local")

OPENAI_API_KEY = os.environ.get("OPENAI_API_KEY")
client = OpenAI(api_key=OPENAI_API_KEY)

router = APIRouter()

logging.basicConfig(
    level=logging.DEBUG,  # Set root logger to debug level
    format='%(levelname)s | %(name)s | %(message)s'
)
logger = logging.getLogger(__name__)
logger.setLevel(logging.DEBUG)

class AudioRequest(BaseModel):
    input: str

def chunk_text(text: str, chunk_size: int = 4096):
    """
    Generator that yields `text` in chunks of `chunk_size`.
    """
    for i in range(0, len(text), chunk_size):
        yield text[i:i + chunk_size]

@router.post("/speech")
async def generate_speech(request: Request, body: AudioRequest, background_tasks: BackgroundTasks):
    """
    Fires off the TTS request in the background (fire-and-forget).
    Logs are added to track progress. No zip file is created.
    """
    model = "tts-1"
    voice = "onyx"

    if not body.input:
        raise HTTPException(
            status_code=400,
            detail="Missing required field: input"
        )

    # Current time for folder naming or logging
    timestamp = int(time.time() * 1000)

    # Create a folder for storing output
    output_folder = Path(".") / f"speech_{timestamp}"
    output_folder.mkdir(exist_ok=True)

    # Split the input into chunks
    chunks = list(chunk_text(body.input, 4096))

    # Schedule the actual speech generation in the background
    background_tasks.add_task(
        generate_audio_files,
        chunks=chunks,
        output_folder=output_folder,
        model=model,
        voice=voice,
        timestamp=timestamp
    )

    # Log and return immediately
    logger.info(f"Speech generation task started at {timestamp} with {len(chunks)} chunks.")
    return JSONResponse({"detail": f"Speech generation started. Timestamp: {timestamp}"})

def generate_audio_files(chunks, output_folder, model, voice, timestamp):
    """
    Generates audio files for each chunk. Runs in the background.
    After all chunks are created, merges them into a single MP3 file.
    """
    try:
        # Generate individual chunk MP3s
        for index, chunk in enumerate(chunks):
            speech_filename = f"speech-chunk-{index + 1}.mp3"
            speech_file_path = output_folder / speech_filename

            logger.info(f"Generating audio for chunk {index + 1}/{len(chunks)}...")

            response = client.audio.speech.create(
                model=model,
                voice=voice,
                input=chunk,
                response_format="mp3",
            )

            response.stream_to_file(speech_file_path)
            logger.info(f"Chunk {index + 1} audio saved to {speech_file_path}")

        # Merge all generated MP3 files into a single file
        logger.info("Merging all audio chunks into one file...")
        merged_audio = AudioSegment.empty()

        def file_index(file_path: Path):
            # Expects file names like 'speech-chunk-1.mp3'
            return int(file_path.stem.split('-')[-1])

        sorted_audio_files = sorted(output_folder.glob("speech-chunk-*.mp3"), key=file_index)
        for audio_file in sorted_audio_files:
            chunk_audio = AudioSegment.from_file(audio_file, format="mp3")
            merged_audio += chunk_audio

        merged_output_file = output_folder / f"speech-merged-{timestamp}.mp3"
        merged_audio.export(merged_output_file, format="mp3")
        logger.info(f"Merged audio saved to {merged_output_file}")

        logger.info(f"All speech chunks generated and merged for timestamp {timestamp}.")
    except Exception as e:
        logger.error(f"OpenAI error (timestamp {timestamp}): {e}")

Key Takeaways

AudioRequest model enforces the presence of an input field.
chunk_text ensures no chunk exceeds 4096 characters (you can adjust this size).
BackgroundTasks offloads the TTS generation so the API can respond promptly.
pydub merges MP3 files (which in turn calls FFmpeg).

3. Using FFmpeg Under the Hood

Installing pydub requires FFmpeg on your system. Ensure FFmpeg is in your PATH—otherwise you’ll get errors when merging or saving MP3 files. For Linux (Ubuntu/Debian):

sudo apt-get update
sudo apt-get install ffmpeg

For macOS (using Homebrew):

brew install ffmpeg

If you’re on Windows, install FFmpeg from FFmpeg’s official site or use a package manager like chocolatey or scoop.

4. Mermaid JS Diagram

Below is a Mermaid sequence diagram illustrating the workflow:

Explanation:

User sends a POST request with text data.
FastAPI quickly acknowledges the request, then spawns a background task.
Chunks of text are processed via OpenAI TTS, saving individual MP3 files.
pydub merges them (calling FFmpeg behind the scenes).
Final merged file is ready in your output directory.

5. Conclusion

Integrating OpenAI text-to-speech with FFmpeg via pydub in a FastAPI application provides a robust, scalable way to automate TTS pipelines:

Reliability: Chunk-based processing handles large inputs without overloading the API.
Versatility: FFmpeg’s audio manipulation potential is nearly limitless.
Speed: Background tasks ensure the main API remains responsive.

With the sample code above, you can adapt chunk sizes, add authentication, or expand the pipeline to include more sophisticated post-processing (like watermarking, crossfading, or mixing in music). Enjoy building richer audio capabilities into your apps—OpenAI and FFmpeg make a powerful duo.

How to Set Up and Run DeepSeek-R1 Locally With Ollama and FastAPI

January 30, 2025 · 5 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

DeepSeek-R1 is a family of large language models (LLMs) known for advanced natural language capabilities. While hosting an LLM in the cloud can be convenient, local deployment provides greater control over latency, privacy, and resource utilization. Tools like Ollama simplify this process by handling model downloading and quantization. However, to truly scale or integrate these capabilities into other services, you often need a robust REST API layer—FastAPI is perfect for this.

This article covers the entire pipeline:

Installing and configuring Ollama to serve DeepSeek-R1 locally
Interacting with DeepSeek-R1 using the CLI, Python scripts, or a FastAPI endpoint for streaming responses
Demonstrating a minimal FastAPI integration, so you can easily wrap your model in a web service

By the end, you’ll see how to run DeepSeek-R1 locally while benefiting from FastAPI’s scalability, logging, and integration features—all without sending your data to external servers.

1. Why Run DeepSeek-R1 Locally?

Running DeepSeek-R1 on your own machine has multiple advantages:

Privacy & Security: No data is sent to third-party services
Performance & Low Latency: Local inference avoids remote API calls
Customization: Fine-tune or adjust inference parameters as needed
No Rate Limits: In-house solution means no usage caps or unexpected cost spikes
Offline Availability: Once downloaded, the model runs even without internet access

2. Setting Up DeepSeek-R1 Locally With Ollama

2.1 Installing Ollama

Download Ollama from the official website.
Install it on your machine, just like any application.

note

Check Ollama’s documentation for platform-specific support. It’s available on macOS and some Linux distributions.

2.2 Download and Test DeepSeek-R1

Ollama makes model retrieval simple:

ollama run deepseek-r1

This command automatically downloads DeepSeek-R1 (the default variant). If your hardware cannot handle the full 671B-parameter model, specify a smaller distilled version:

ollama run deepseek-r1:7b

info

DeepSeek-R1 offers different parameter sizes (e.g., 1.5B, 7B, 14B, 70B, 671B) for various hardware setups.

2.3 Running DeepSeek-R1 in the Background

To serve the model continuously (useful for external services like FastAPI):

ollama serve

By default, Ollama listens on http://localhost:11434.

3. Using DeepSeek-R1 Locally

3.1 Command-Line (CLI) Inference

You can chat directly with DeepSeek-R1 in your terminal:

ollama run deepseek-r1

Type a question or prompt; responses stream back in real time.

3.2 Accessing DeepSeek-R1 via API

If you’re building an application, you can call Ollama’s REST API:

curl http://localhost:11434/api/chat -d '{
  "model": "deepseek-r1",
  "messages": [{ "role": "user", "content": "Solve: 25 * 25" }],
  "stream": false
}'

note

Set "stream": true to receive chunked streaming responses—a feature you can integrate easily into web apps or server frameworks like FastAPI.

3.3 Python Integration

Install the ollama Python package:

pip install ollama

Then use:

import ollama

response = ollama.chat(
    model="deepseek-r1",
    messages=[
        {"role": "user", "content": "Explain Newton's second law of motion"},
    ],
)
print(response["message"]["content"])

4. FastAPI Integration and Streaming Responses

To wrap DeepSeek-R1 in a fully customizable FastAPI service, you can define streaming endpoints for advanced usage. Below is an example that sends chunked responses to the client:

import os
import json
from typing import List
from pydantic import BaseModel
from dotenv import load_dotenv
from fastapi import FastAPI, Query
from fastapi.responses import StreamingResponse
from openai import OpenAI

from .utils.prompt import ClientMessage, convert_to_openai_messages
from .utils.tools import get_current_weather  # example tool
from .utils.tools import available_tools  # hypothetical dict of tool funcs

load_dotenv(".env.local")

app = FastAPI()
client = OpenAI(api_key="ollama", base_url="http://localhost:11434/v1/")

class Request(BaseModel):
    messages: List[ClientMessage]

def stream_text(messages: List[ClientMessage], protocol: str = 'data'):
    stream = client.chat.completions.create(
        messages=messages,
        model="deepseek-r1",
        stream=True,
    )

    if protocol == 'text':
        for chunk in stream:
            for choice in chunk.choices:
                if choice.finish_reason == "stop":
                    break
                else:
                    yield "{text}".format(text=choice.delta.content)

    elif protocol == 'data':
        draft_tool_calls = []
        draft_tool_calls_index = -1

        for chunk in stream:
            for choice in chunk.choices:
                if choice.finish_reason == "stop":
                    continue
                elif choice.finish_reason == "tool_calls":
                    for tool_call in draft_tool_calls:
                        yield f'9:{{"toolCallId":"{tool_call["id"]}","toolName":"{tool_call["name"]}","args":{tool_call["arguments"]}}}\n'

                    for tool_call in draft_tool_calls:
                        tool_result = available_tools[tool_call["name"]](**json.loads(tool_call["arguments"]))
                        yield (
                            f'a:{{"toolCallId":"{tool_call["id"]}","toolName":"{tool_call["name"]}","args":{tool_call["arguments"]},'
                            f'"result":{json.dumps(tool_result)}}}\n'
                        )
                elif choice.delta.tool_calls:
                    for tool_call in choice.delta.tool_calls:
                        id = tool_call.id
                        name = tool_call.function.name
                        arguments = tool_call.function.arguments
                        if id is not None:
                            draft_tool_calls_index += 1
                            draft_tool_calls.append({"id": id, "name": name, "arguments": ""})
                        else:
                            draft_tool_calls[draft_tool_calls_index]["arguments"] += arguments
                else:
                    yield f'0:{json.dumps(choice.delta.content)}\n'

            # usage
            if chunk.choices == []:
                usage = chunk.usage
                prompt_tokens = usage.prompt_tokens
                completion_tokens = usage.completion_tokens
                yield (
                    f'd:{{"finishReason":"{"tool-calls" if len(draft_tool_calls) > 0 else "stop"}",'
                    f'"usage":{{"promptTokens":{prompt_tokens},"completionTokens":{completion_tokens}}}}}\n'
                )

@app.post("/api/chat")
async def handle_chat_data(request: Request, protocol: str = Query('data')):
    messages = request.messages
    openai_messages = convert_to_openai_messages(messages)
    response = StreamingResponse(stream_text(openai_messages, protocol))
    response.headers['x-vercel-ai-data-stream'] = 'v1'
    return response

Key Points:

stream=True allows the server to stream content chunk by chunk.
The code handles optional “tool calls” logic—customizable for your own environment.
FastAPI’s StreamingResponse ensures the client receives partial output in real time.

With this setup, you can embed DeepSeek-R1 into more complex microservices or orchestrate multi-step workflows that rely on streaming LLM responses.

6. Conclusion

DeepSeek-R1 combined with Ollama and FastAPI gives you a powerful local LLM service. You can handle all aspects of data ingestion, retrieval, and inference in one place—without relying on third-party endpoints or paying subscription costs. Here’s a recap:

Ollama manages downloading and serving the DeepSeek-R1 models.
FastAPI provides a flexible web layer for streaming responses or building microservices.

Build your local AI solutions confidently and privately—DeepSeek-R1 is now at your fingertips.

Adapting Stock Forecasts with AI

December 29, 2024 · 7 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

Financial markets are dynamic: price trends, volatility, and patterns constantly change. These shifts in data distribution, commonly called concept drift, pose a serious challenge for AI models trained on historical data. When the market regime changes—such as transitioning from a calm to a volatile environment—a “stale” model can drastically lose predictive power.

DDG-DA (Data Distribution Generation for Predictable Concept Drift Adaptation) addresses this by forecasting how the data distribution might evolve in the future, instead of only reacting to the most recent data. The approach is rooted in meta-learning (via Qlib’s Meta Controller framework) and helps trading or investment models stay ahead of new trends.

By the end of this article, you will understand:

Why concept drift complicates forecasting in stocks and other financial time series
How DDG-DA uses a future distribution predictor to resample training data
How to incorporate this into Qlib-based workflows to improve stock return and risk-adjusted performance

Concept Drift in Stock Markets

Concept drift refers to changes in the underlying distribution of stock market data. These changes can manifest in multiple ways:

Trends: Bull or bear markets can shift faster or slower than expected
Volatility: Sudden spikes can invalidate models calibrated during calmer periods
Patterns: Market microstructure changes or new correlations can emerge, causing old patterns to wane

Traditional methods often react after drift appears (by retraining on recent data). However, if the drift is somewhat predictable, we can model its trajectory—and proactively train models on future conditions before they fully materialize.

Diagram: Concept Drift Overview

Here, a continuous market data stream (A) encounters distribution shifts (B). These can appear as new trends (C), volatility regimes (D), or changed patterns (E). As a result, a previously trained model (F) gradually loses accuracy (G) if not adapted.

DDG-DA: High-Level Approach

The core principle behind DDG-DA is to forecast the distribution shift itself. Specifically:

Predict Future Distributions
- A meta-model observes historical tasks (for example, monthly or daily tasks in which you train a new stock-prediction model).
- This meta-model estimates how the data distribution might move in the next period, such as anticipating an uptick in volatility or a shift in factor exposures.
Generate Synthetic Training Samples
- Using the distribution forecast, DDG-DA resamples historical data to emulate the expected future conditions.
- It might assign higher weights to periods with similar volatility or market conditions so the final training set reflects what the market might soon become.
Train or Retrain the Forecasting Model
- Your usual forecasting model (for example, LightGBM or LSTM) is then retrained on these forward-looking samples, aligning better with the next period’s actual data distribution.
- As a result, the model remains more accurate when concept drift occurs.

Diagram: DDG-DA Core Steps

This process repeats periodically (for example, each month) to keep your forecasting models aligned with upcoming market conditions.

How It Integrates with Qlib

Qlib provides an AI-oriented Quantitative Investment Platform that handles:

Data: Collecting and structuring historical pricing data, factors, and fundamentals
Modeling: Building daily or intraday forecasts using built-in ML or custom models
Meta Controller: A specialized component for tasks like DDG-DA, which revolve around higher-level meta-learning and distribution adaptation

Diagram: Qlib plus DDG-DA Integration

Qlib Data Layer (A): Feeds into a standard ML pipeline for daily or intraday forecasting (B).
DDG-DA sits in the Meta Controller (C), analyzing tasks, predicting distribution changes, and adjusting the pipeline.
Results circle back into Qlib for backtesting and analysis (D).

Example: Monthly Stock Trend Forecasting

Setting the Tasks
- Suppose you update your stock-ranking model every month, using the last 2 years of data.
- Each month is a “task” in Qlib. Over multiple months, you get a series of tasks for training and validation.
Train the Meta-Model
- DDG-DA learns a function that maps old data distribution patterns to new sample weights.
- This ensures the next month’s training data distribution is closer to the actual conditions that month.
Evaluate
- Compare the results to standard approaches:
  - Rolling Retrain: Only uses the most recent data, ignoring the predictable drift pattern
  - Gradual Forgetting: Weighted by how recent data is, but no direct distribution forecast
  - DDG-DA: Weighs data by predicted future distribution, leading to stronger alignment when drift is not purely random

Diagram: Monthly Task Workflow

Performance and Findings

Research in the associated DDG-DA paper and Qlib examples shows:

Better Signal Quality: Higher Information Coefficient (IC) for stock selection
Enhanced Portfolio Returns: Larger annual returns, improved Sharpe Ratio, and lower drawdowns in backtests
Versatility: Works with a wide range of ML models (Linear, LightGBM, neural networks)
Limitations: If concept drift is completely random or abrupt (no pattern), DDG-DA’s advantages diminish. Predictability is key

Diagram: Performance Improvement

Practical Steps

Install Qlib and ensure you have the dataset (for example, Alpha158) set up

Clone the DDG-DA Example from the Qlib GitHub:

git clone https://github.com/microsoft/qlib.git
cd qlib/examples/benchmarks_dynamic/DDG-DA

Install Requirements:
```
pip install -r requirements.txt
```
Run the Workflow:
```
python workflow.py run
```
- By default, it uses a simple linear forecasting model
- To use LightGBM or another model, specify the --conf_path argument, for example:
```
python workflow.py --conf_path=../workflow_config_lightgbm_Alpha158.yaml run
```
Analyze Results:
- Qlib’s recorder logs signal metrics (IC, ICIR) and backtest performance (annual return, Sharpe)
- Compare with baseline methods (Rolling Retrain, Exponential Forgetting, etc.)

Diagram: Running DDG-DA Workflow

Conclusion

DDG-DA shows how AI can proactively tackle concept drift in stock forecasting. Instead of merely reacting to new data, it anticipates potential distribution changes, producing a more robust, forward-looking training set. When integrated into Qlib’s Meta Controller, it seamlessly fits your existing pipelines, from data ingestion to backtesting.

For practical use:

Ensure your market conditions exhibit some predictability. Random, sudden changes are harder to model
Combine with conventional best practices (risk management, hyperparameter tuning) for a holistic pipeline
Monitor performance: If drift patterns shift, you may need to retrain or retune the DDG-DA meta-model

By forecasting future market states and adapting ahead of time, DDG-DA helps your quantitative strategies remain agile and profitable in evolving financial environments.

Leveraging Qlib and MLflow for Unified Experiment Tracking

December 29, 2024 · 5 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

Financial markets present a dynamic environment where active research and experimentation are critical. Qlib offers a complete “AI-oriented” solution for quantitative investment—covering data loaders, feature engineering, model training, and evaluation. Meanwhile, MLflow provides robust functionality for experiment tracking, handling metrics, artifacts, and hyperparameters across multiple runs. You can further enhance your documentation using specialized syntax for highlighting important information, such as notes or warnings, to help readers navigate complex workflows.

This article shows how to integrate Qlib and MLflow to manage your entire workflow—from data ingestion and factor engineering to model storage and versioning—under a single, unified experiment system. It also demonstrates various ways to emphasize notes or warnings to help readers explore the complexities of this setup.

By the end of this article, you will learn:

How Qlib manages data and modeling in a typical quant workflow
How MLflow tracks experiment artifacts, logs metrics, and organizes multiple runs
How to integrate Qlib’s “Recorder” concept with MLflow’s tracking

1. Qlib Overview

Qlib is a powerful open-source toolkit designed for AI-based quantitative investment. It streamlines common challenges in this domain:

Data Layer: Standardizes daily or intraday bars, fundamental factors, and alpha signals
Feature Engineering: Offers an expression engine (alpha modeling) plus factor definitions
Modeling: Easily pluggable ML models (LightGBM, Linear, RNN, etc.) with out-of-the-box training logic
Evaluation and Backtest: Includes modules for analyzing signals, computing IC/RankIC, and running trading strategies in a backtest simulator

Diagram: Qlib Architecture

Below is a high-level view of Qlib’s architecture—how data flows from raw sources into Qlib’s data handlers, transforms into features, and ultimately fuels model training.

note

Some Qlib features—like intraday data handling or advanced factor expressions—may require additional configuration. Double-check your data paths and environment setup to ensure all pieces are properly configured.

2. MLflow Overview

MLflow is an experiment-tracking tool that organizes runs and artifacts:

Tracking: Logs params, metrics, tags, and artifacts (model checkpoints, charts)
UI: A local or remote interface (mlflow ui) for comparing runs side by side
Model Registry: Version controls deployed models, enabling easy rollback or re-deployment

Diagram: MLflow Overview

warning

When configuring MLflow on remote servers, remember to secure the tracking server appropriately. Unsecured endpoints may expose logs and artifacts to unintended parties.

3. Combining Qlib and MLflow

In typical usage, Qlib handles data ingestion, feature transformations, and model training. MLflow complements it by capturing:

Run Metadata: Each Qlib “Recorder” maps to an MLflow run
Metrics & Params: Qlib logs metrics like Sharpe Ratio or Information Coefficient (IC); MLflow’s UI centralizes them
Artifacts: Saved model files, prediction results, or charts are stored in MLflow’s artifact repository

Diagram: Qlib + MLflow Integration

Below is a top-down diagram showing how user code interacts with Qlib, which in turn leverages MLflow for run logging.

4. Minimal Example

Here’s a simplified script showing the synergy among the three components:

import qlib
from qlib.workflow import R
from qlib.utils import init_instance_by_config

# 1) Init Qlib and MLflow
qlib.init(
    exp_manager={
        "class": "MLflowExpManager",
        "module_path": "qlib.workflow.expm",
        "kwargs": {
            "uri": "file:/path/to/mlruns",
            "default_exp_name": "QlibExperiment"
        },
    }
)

# 2) Start experiment and train
with R.start(experiment_name="QlibExperiment", recorder_name="run1"):
    # Basic config
    model_config = {"class": "LightGBMModel", "kwargs": {"learning_rate": 0.05}}
    dataset_config = {...}

    model = init_instance_by_config(model_config)
    dataset = init_instance_by_config(dataset_config)
    model.fit(dataset)

    # Evaluate
    predictions = model.predict(dataset)

    # log some metrics
    R.log_metrics(Sharpe=1.2, IC=0.03)

    # Save artifacts
    R.save_objects(**{"pred.pkl": predictions, "trained_model.pkl": model})

info

The snippet above logs metrics like Sharpe or IC, making them easily comparable across multiple runs. You can further log hyperparameters via R.log_params(...) for more granular comparisons.

Results:

A new MLflow run named “run1” under “QlibExperiment”
MLflow logs parameters/metrics (learning_rate, Sharpe, IC)
Artifacts “pred.pkl” and “trained_model.pkl” appear in MLflow’s artifact UI

5. Best Practices

Organize Qlib tasks: Use Qlib’s SignalRecord or PortAnaRecord classes to store signals/backtest results, ensuring logs are automatically tied to MLflow runs
Parameter Logging: Send hyperparameters or relevant config to R.log_params(...) for easy comparison in MLflow
Artifact Naming: Keep artifact names consistent (e.g., "pred.pkl") across multiple runs
Model Registry: Consider pushing your best runs to MLflow’s Model Registry for versioned deployment

danger

A mismatch between your local Qlib environment and remote MLflow server can cause logging errors. Ensure both environments are in sync (same Python versions, same library versions).

6. Conclusion

By connecting Qlib’s experiment pipeline to MLflow’s tracking features—and documenting everything thoroughly—you get the best of all worlds:

Qlib: AI-centric quant platform automating data handling, factor engineering, and modeling
MLflow: A robust interface for comparing runs, storing artifacts, and version-controlling the entire process

This synergy simplifies large-scale experimentation—especially when you frequently iterate over factor definitions, hyperparameters, or new trading strategies.

Qlib’s Nested Execution for High-Frequency Trading with AI

December 27, 2024 · 6 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

High-Frequency Trading (HFT) involves handling large volumes of orders at extremely high speeds—often measured in microseconds or milliseconds. AI (machine learning and reinforcement learning, in particular) has become pivotal in capturing fleeting market opportunities and managing real-time decisions in these ultra-fast trading environments.

In Qlib, the Nested Decision Execution Framework simplifies building multi-level HFT strategies, allowing a high-level (daily or weekly) strategy to nest an intraday (or sub-intraday) executor or sub-workflow. This design enables realistic joint backtesting: daily portfolio selection and intraday HFT execution interact seamlessly, ensuring that real slippage, partial fills, and transaction costs are accurately accounted for.

By the end of this guide, you’ll understand:

How Qlib structures multi-level workflows (daily vs. intraday).
How AI techniques (supervised and reinforcement learning) slot into Qlib’s design.
How to set up an Executor sub-workflow for high-frequency order splitting and real-time decision-making.

Multi-Level Strategy Workflow

Below is an overview diagram (adapted from Qlib’s documentation) depicting how daily strategies can nest intraday sub-strategies or RL agents:

Daily Strategy: Generates coarse decisions (e.g., “Buy X shares by day’s end”).
Executor: Breaks decisions into smaller actions. Within it, a Reinforcement Learning policy (or any other AI model) can run at minute or sub-minute intervals.
Simulator/Environment: Provides intraday data, simulates order fills/slippage, and feeds rewards back to the RL policy.

This nesting allows realistic interaction between daily allocation goals and intraday fill performance.

Key Components

1. Information Extractor (Intraday)

For HFT, Qlib can store data at 1-minute intervals, or even tick/orderbook-level data, using specialized backends (e.g., Arctic). An example below shows how Qlib can manage non-fixed-frequency records:

# Example snippet from qlib/examples/orderbook_data
# Download sample data, then import into your local mongo or Arctic DB
python create_dataset.py initialize_library
python create_dataset.py import_data

Once imported, intraday/tick data can be accessed by Qlib’s normal data APIs for feature engineering or direct RL state representation.

2. Forecast Model (Intraday + Daily)

A single Qlib workflow can hold multiple forecast models:

Daily Model: Predicts overnight returns or daily alpha (e.g., LightGBM on daily bars).
Intraday Model: Predicts short-term (minutes/seconds) price movements. This might be a small neural net or an RL policy evaluating states like order-book depth, spread, volume patterns, etc.

Qlib’s reinforcement learning interface (QlibRL) can also handle advanced models:

Policy: Learns from reward signals (e.g., PnL, transaction costs, slippage).
Action Interpreter: Converts policy actions into actual orders.

3. Decision Generator (Daily vs. Intraday)

Daily Decision Generator might produce a target portfolio:

Stock A: +5% allocation
Stock B: -2% allocation

Intraday Decision Generator (within the Executor) can then split these top-level instructions into multiple smaller trades. For example, an RL policy might decide to buy 2% of Stock A during the opening auction, 1% during midday, and 2% near closing, based on real-time microprice signals.

4. Executor & Sub-workflow (Nested)

Executor is where the nested approach truly shines. It wraps a more granular intraday or high-frequency sub-strategy.

This sub-workflow can be as simple as scheduling trades evenly or as advanced as an RL policy that:

Observes short-term price movement.
Acts to minimize slippage and transaction cost.
Receives reward signals from the environment (filled shares, average fill price vs. VWAP, etc.).

5. Environment & Simulator

When applying Reinforcement Learning, Qlib uses an Environment wrapper:

State: Intraday features (latest LOB data, partial fill stats).
Action: The RL agent chooses to place a limit order, market order, or skip.
Reward: Often the negative cost of trading or realized PnL improvement.

You can leverage Qlib’s built-in simulators or customize them for specific market microstructures.

Example Workflow Snippets

Here’s a high-level script illustrating a daily + intraday nested setup. (Pseudocode for demonstration only.)

# daily_intraday_workflow.py

import qlib
from qlib.config import C
from qlib.data import D
from qlib.rl.order_execution_policy import RLOrderExecPolicy
from qlib.strategy.base import BaseStrategy

class DailyAlphaStrategy(BaseStrategy):
    """Generates daily-level decisions (which stocks to buy/sell)."""

    def generate_trade_decision(self, *args, **kwargs):
        # Imagine we have daily predictions from a model...
        scores = self.signal.get_signal()  # daily alpha scores
        # Then produce a dictionary {stock: weight or shares}
        decisions = compute_target_positions(scores)
        return decisions

class NestedExecutor:
    """Executor that calls an intraday RL sub-strategy for each daily decision."""

    def __init__(self, intraday_policy):
        self.intraday_policy = intraday_policy

    def execute_daily_decision(self, daily_decision):
        # Suppose daily_decision = { 'AAPL': +100 shares, 'MSFT': +50 shares }
        # We'll break it into sub-orders via RL
        for stock, shares in daily_decision.items():
            # RL agent decides how to place those shares intraday
            self.intraday_policy.run_execution(stock, shares)

def main():
    qlib.init(provider_uri="your_data_path")  # local data or remote server

    daily_strategy = DailyAlphaStrategy(signal=your_daily_signal)
    intraday_policy = RLOrderExecPolicy()  # RL policy with QlibRL

    executor = NestedExecutor(intraday_policy=intraday_policy)

    # Hypothetical daily loop
    for date in trading_calendar:
        daily_decision = daily_strategy.generate_trade_decision()
        executor.execute_daily_decision(daily_decision)

if __name__ == "__main__":
    main()

Notes:

DailyAlphaStrategy uses a daily alpha model for stock scoring.
NestedExecutor calls RLOrderExecPolicy, which runs intraday steps.
Real code will handle position objects, trade calendars, and backtest frameworks in more detail.

Practical Tips for HFT + AI

Data Freshness: HFT signals must be updated almost in real-time. Ensure your Qlib data pipeline is either streaming or as close to real-time as possible.
Latency Considerations: Real HFT in production must address network latency and order routing. Qlib’s framework focuses on backtesting or simulation; integrating actual exchange connectivity is non-trivial.
Overfitting & Market Regimes: Intraday data is often noisy; guard against overfitting your ML or RL models to fleeting patterns.
Joint Optimization: Tweaking daily portfolio turnover and intraday execution in isolation can be suboptimal. Qlib’s nested design helps you see the whole chain’s PnL effect.
Reinforcement Learning: Start simple (e.g., Q-learning or policy gradient) before moving to complex neural networks. Use carefully designed rewards capturing cost, fill rates, and profit.

Summary

By combining AI (supervised or RL models) with a Nested Decision Execution approach, Qlib lets you:

Unify Daily and Intraday strategies in a single backtest.
Leverage Real-time AI for micro-execution decisions.
Optimize both large-scale allocations and fine-grained order placements simultaneously.

This framework is especially powerful for High-Frequency Trading use cases, where multiple decision layers (portfolio vs. sub-second order slicing) must interact. Whether you’re using classical ML or advanced RL, Qlib streamlines experimentation and helps close the gap between daily trading and ultra-fast intraday execution.

A Comprehensive Guide to Qlib’s Portfolio Strategy, TopkDropoutStrategy, and EnhancedIndexingStrategy

December 25, 2024 · 9 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

In Qlib, portfolio strategies turn prediction scores into actionable orders (buy/sell) for building and rebalancing a portfolio. This article will:

Explain the architecture of key strategy classes.
Demonstrate TopkDropoutStrategy and EnhancedIndexingStrategy in detail.
Present diagrams and code blocks illustrating the step-by-step flows.

By the end, you’ll see how to plug your own predictive model scores into these strategies and make them trade automatically.

Class Hierarchy

Below is a simple diagram showing how these classes inherit from one another:

BaseStrategy: Core abstraction; requires a method to generate a trade decision.
BaseSignalStrategy: Extends BaseStrategy with “signals” (model scores).
TopkDropoutStrategy: Buys the top-K scoring stocks and drops the worst ones.
WeightStrategyBase: Uses target weights (fractions of the portfolio) rather than discrete buy/sell.
EnhancedIndexingStrategy: Adds advanced risk modeling for partial index tracking.

High-Level Trading Flow for Top-K

Here’s a top-down look at a generic daily (or periodic) process once your predictions are ready:

Code Walkthrough

Below we break down the code for Qlib’s portfolio strategies into sections, each supplemented by additional flow diagrams relevant to that part of the code.

1. Imports and Setup

import os
import copy
import warnings
import numpy as np
import pandas as pd

from typing import Dict, List, Text, Tuple, Union
from abc import ABC

from qlib.data import D
from qlib.data.dataset import Dataset
from qlib.model.base import BaseModel
from qlib.strategy.base import BaseStrategy
from qlib.backtest.position import Position
from qlib.backtest.signal import Signal, create_signal_from
from qlib.backtest.decision import Order, OrderDir, TradeDecisionWO
from qlib.log import get_module_logger
from qlib.utils import get_pre_trading_date, load_dataset
from qlib.contrib.strategy.order_generator import OrderGenerator, OrderGenWOInteract
from qlib.contrib.strategy.optimizer import EnhancedIndexingOptimizer

Explanation

Core Python imports for numerical operations, data processing, and type hints.
Qlib-specific imports:
- BaseStrategy, Position, Signal, and TradeDecisionWO for implementing custom strategies and managing trade decisions.
- OrderGenerator and EnhancedIndexingOptimizer for generating orders from target weights and optimizing risk exposure.

2. `BaseSignalStrategy`

Below is a class diagram illustrating BaseSignalStrategy inheriting from BaseStrategy and adding a signal field:

class BaseSignalStrategy(BaseStrategy, ABC):
    def __init__(
        self,
        *,
        signal: Union[Signal, Tuple[BaseModel, Dataset], List, Dict, Text, pd.Series, pd.DataFrame] = None,
        model=None,
        dataset=None,
        risk_degree: float = 0.95,
        trade_exchange=None,
        level_infra=None,
        common_infra=None,
        **kwargs,
    ):
        """
        Parameters
        -----------
        signal :
            Could be a Signal object or raw predictions from a model/dataset.
        risk_degree : float
            Fraction of total capital to invest (default 0.95).
        trade_exchange : Exchange
            Market info for dealing orders, generating reports, etc.
        """
        super().__init__(level_infra=level_infra, common_infra=common_infra, trade_exchange=trade_exchange, **kwargs)

        self.risk_degree = risk_degree

        # For backward-compatibility with (model, dataset)
        if model is not None and dataset is not None:
            warnings.warn("`model` `dataset` is deprecated; use `signal`.", DeprecationWarning)
            signal = model, dataset

        self.signal: Signal = create_signal_from(signal)

    def get_risk_degree(self, trade_step=None):
        """Return the fraction of total value to allocate."""
        return self.risk_degree

Key Points

BaseSignalStrategy extends BaseStrategy and integrates a concept of a signal (predictions).
risk_degree indicates what fraction of the portfolio’s capital is invested (defaults to 95%).

3. `TopkDropoutStrategy`

Here’s a flow diagram specifically for the generate_trade_decision method in TopkDropoutStrategy, showing how the code sorts holdings, identifies “drop” stocks, and selects new buys:

class TopkDropoutStrategy(BaseSignalStrategy):
    def __init__(
        self,
        *,
        topk,
        n_drop,
        method_sell="bottom",
        method_buy="top",
        hold_thresh=1,
        only_tradable=False,
        forbid_all_trade_at_limit=True,
        **kwargs,
    ):
        """
        Parameters
        -----------
        topk : int
            Desired number of stocks to hold.
        n_drop : int
            Number of stocks replaced each rebalance.
        method_sell : str
            Approach to dropping existing stocks (e.g. 'bottom').
        method_buy : str
            Approach to adding new stocks (e.g. 'top').
        hold_thresh : int
            Must hold a stock for at least this many days before selling.
        only_tradable : bool
            Ignore non-tradable stocks.
        forbid_all_trade_at_limit : bool
            Disallow trades if limit up/down is reached.
        """
        super().__init__(**kwargs)
        self.topk = topk
        self.n_drop = n_drop
        self.method_sell = method_sell
        self.method_buy = method_buy
        self.hold_thresh = hold_thresh
        self.only_tradable = only_tradable
        self.forbid_all_trade_at_limit = forbid_all_trade_at_limit

    def generate_trade_decision(self, execute_result=None):
        trade_step = self.trade_calendar.get_trade_step()
        trade_start_time, trade_end_time = self.trade_calendar.get_step_time(trade_step)
        pred_start_time, pred_end_time = self.trade_calendar.get_step_time(trade_step, shift=1)
        pred_score = self.signal.get_signal(start_time=pred_start_time, end_time=pred_end_time)

        # If no score, do nothing
        if pred_score is None:
            return TradeDecisionWO([], self)

        # If multiple columns, pick the first
        if isinstance(pred_score, pd.DataFrame):
            pred_score = pred_score.iloc[:, 0]

        # Helper functions for picking top/bottom stocks...
        ...

        # Copy current position
        current_temp: Position = copy.deepcopy(self.trade_position)
        sell_order_list = []
        buy_order_list = []
        cash = current_temp.get_cash()
        current_stock_list = current_temp.get_stock_list()

        # Sort current holdings by descending score
        last = pred_score.reindex(current_stock_list).sort_values(ascending=False).index

        # Identify new stocks to buy
        ...

        # Figure out which existing stocks to sell
        ...

        # Create Sell Orders
        ...

        # Create Buy Orders
        ...

        return TradeDecisionWO(sell_order_list + buy_order_list, self)

Key Points

The “top-K, drop worst-K” concept is implemented by comparing current holdings to the broader universe of stocks sorted by score.
Some specifics:
- method_sell can be "bottom", so you drop the lowest-scored holdings.
- method_buy can be "top", so you pick the top new stocks that aren’t in the portfolio.

4. `WeightStrategyBase`

Below is a quick diagram for how WeightStrategyBase converts target weights into final orders:

class WeightStrategyBase(BaseSignalStrategy):
    def __init__(
        self,
        *,
        order_generator_cls_or_obj=OrderGenWOInteract,
        **kwargs,
    ):
        super().__init__(**kwargs)
        if isinstance(order_generator_cls_or_obj, type):
            self.order_generator: OrderGenerator = order_generator_cls_or_obj()
        else:
            self.order_generator: OrderGenerator = order_generator_cls_or_obj

    def generate_target_weight_position(self, score, current, trade_start_time, trade_end_time):
        """
        Subclasses must override this to return:
        {stock_id: target_weight}
        """
        raise NotImplementedError()

    def generate_trade_decision(self, execute_result=None):
        trade_step = self.trade_calendar.get_trade_step()
        trade_start_time, trade_end_time = self.trade_calendar.get_step_time(trade_step)
        pred_start_time, pred_end_time = self.trade_calendar.get_step_time(trade_step, shift=1)
        pred_score = self.signal.get_signal(start_time=pred_start_time, end_time=pred_end_time)
        if pred_score is None:
            return TradeDecisionWO([], self)

        current_temp = copy.deepcopy(self.trade_position)
        assert isinstance(current_temp, Position)

        # Let the subclass produce the weights
        target_weight_position = self.generate_target_weight_position(
            score=pred_score, current=current_temp, trade_start_time=trade_start_time, trade_end_time=trade_end_time
        )

        # Convert weights -> Orders
        order_list = self.order_generator.generate_order_list_from_target_weight_position(
            current=current_temp,
            trade_exchange=self.trade_exchange,
            risk_degree=self.get_risk_degree(trade_step),
            target_weight_position=target_weight_position,
            pred_start_time=pred_start_time,
            pred_end_time=pred_end_time,
            trade_start_time=trade_start_time,
            trade_end_time=trade_end_time,
        )
        return TradeDecisionWO(order_list, self)

Key Points

WeightStrategyBase uses a target-weight approach: you specify a final allocation for each stock.
The built-in order_generator calculates how many shares to buy/sell to achieve the target allocation.

5. `EnhancedIndexingStrategy`

Lastly, a diagram shows how this strategy merges model scores with factor data and a benchmark:

class EnhancedIndexingStrategy(WeightStrategyBase):
    """
    Combines active and passive management, aiming to
    outperform a benchmark index while controlling tracking error.
    """

    FACTOR_EXP_NAME = "factor_exp.pkl"
    FACTOR_COV_NAME = "factor_cov.pkl"
    SPECIFIC_RISK_NAME = "specific_risk.pkl"
    BLACKLIST_NAME = "blacklist.pkl"

    def __init__(
        self,
        *,
        riskmodel_root,
        market="csi500",
        turn_limit=None,
        name_mapping={},
        optimizer_kwargs={},
        verbose=False,
        **kwargs,
    ):
        super().__init__(**kwargs)
        self.logger = get_module_logger("EnhancedIndexingStrategy")

        self.riskmodel_root = riskmodel_root
        self.market = market
        self.turn_limit = turn_limit

        self.factor_exp_path = name_mapping.get("factor_exp", self.FACTOR_EXP_NAME)
        self.factor_cov_path = name_mapping.get("factor_cov", self.FACTOR_COV_NAME)
        self.specific_risk_path = name_mapping.get("specific_risk", self.SPECIFIC_RISK_NAME)
        self.blacklist_path = name_mapping.get("blacklist", self.BLACKLIST_NAME)

        self.optimizer = EnhancedIndexingOptimizer(**optimizer_kwargs)
        self.verbose = verbose
        self._riskdata_cache = {}

    def get_risk_data(self, date):
        if date in self._riskdata_cache:
            return self._riskdata_cache[date]

        root = self.riskmodel_root + "/" + date.strftime("%Y%m%d")
        if not os.path.exists(root):
            return None

        factor_exp = load_dataset(root + "/" + self.factor_exp_path, index_col=[0])
        factor_cov = load_dataset(root + "/" + self.factor_cov_path, index_col=[0])
        specific_risk = load_dataset(root + "/" + self.specific_risk_path, index_col=[0])

        if not factor_exp.index.equals(specific_risk.index):
            specific_risk = specific_risk.reindex(factor_exp.index, fill_value=specific_risk.max())

        universe = factor_exp.index.tolist()
        blacklist = []
        if os.path.exists(root + "/" + self.blacklist_path):
            blacklist = load_dataset(root + "/" + self.blacklist_path).index.tolist()

        self._riskdata_cache[date] = factor_exp.values, factor_cov.values, specific_risk.values, universe, blacklist
        return self._riskdata_cache[date]

    def generate_target_weight_position(self, score, current, trade_start_time, trade_end_time):
        trade_date = trade_start_time
        pre_date = get_pre_trading_date(trade_date, future=True)

        outs = self.get_risk_data(pre_date)
        if outs is None:
            self.logger.warning(f"No risk data for {pre_date:%Y-%m-%d}, skipping optimization")
            return None

        factor_exp, factor_cov, specific_risk, universe, blacklist = outs

        # Align score with risk model universe
        score = score.reindex(universe).fillna(score.min()).values

        # Current portfolio weights
        cur_weight = current.get_stock_weight_dict(only_stock=False)
        cur_weight = np.array([cur_weight.get(stock, 0) for stock in universe])
        cur_weight = cur_weight / self.get_risk_degree(trade_date)

        # Benchmark weight
        bench_weight = D.features(
            D.instruments("all"), [f"${self.market}_weight"], start_time=pre_date, end_time=pre_date
        ).squeeze()
        bench_weight.index = bench_weight.index.droplevel(level="datetime")
        bench_weight = bench_weight.reindex(universe).fillna(0).values

        # Track which stocks are tradable and which are blacklisted
        tradable = D.features(D.instruments("all"), ["$volume"], start_time=pre_date, end_time=pre_date).squeeze()
        tradable.index = tradable.index.droplevel(level="datetime")
        tradable = tradable.reindex(universe).gt(0).values
        mask_force_hold = ~tradable
        mask_force_sell = np.array([stock in blacklist for stock in universe], dtype=bool)

        # Optimize based on scores + factor model
        weight = self.optimizer(
            r=score,
            F=factor_exp,
            cov_b=factor_cov,
            var_u=specific_risk**2,
            w0=cur_weight,
            wb=bench_weight,
            mfh=mask_force_hold,
            mfs=mask_force_sell,
        )

        target_weight_position = {stock: w for stock, w in zip(universe, weight) if w > 0}

        if self.verbose:
            self.logger.info(f"trade date: {trade_date:%Y-%m-%d}")
            self.logger.info(f"number of holding stocks: {len(target_weight_position)}")
            self.logger.info(f"total holding weight: {weight.sum():.6f}")

        return target_weight_position

Key Points

Uses riskmodel_root to pull factor exposures, covariances, and specific risk estimates.
Combines your model scores with a benchmark weight to control tracking error via an optimizer.
Produces a final weight map, which Qlib then converts to buy/sell orders.

Summary

BaseSignalStrategy attaches prediction data to a strategy.
TopkDropoutStrategy implements a straightforward “buy top-K, drop worst-K” approach.
WeightStrategyBase generalizes weight-based rebalancing.
EnhancedIndexingStrategy is a powerful extension, combining active signals and passive indexing with risk control.

By customizing just a few methods or parameters, you can adapt these strategies to your own investing style. Simply feed your daily scores (prediction of returns) into Qlib, pick a strategy class, and let Qlib do the rest.

Happy Trading!

Understanding Score IC in Qlib for Enhanced Profit

December 24, 2024 · 6 min read

Vadim Nicolai

Senior Software Engineer at Vitrifi

Introduction

One of the core ideas in quantitative finance is that model predictions—often called “scores”—can be mapped to expected returns on an instrument. In Qlib, these scores are evaluated using metrics like the Information Coefficient (IC) and Rank IC to show how well the scores predict future returns. Essentially, the higher the score, the more profit the instruments—if your IC is positive and statistically significant, the highest-scored stocks should, on average, outperform the lower-scored ones.

1 Self-Diagnosis & Mind-Set​

2 Historical Perspective & Narrative Traps​

3 Quantitative Evidence​

4 Risk & Capital Allocation​

5 Process & Decision Architecture​

6 Psychology & Continuous Improvement​

7 Wrap-Up​

1 Background​

2 Why the Change Was Necessary​

3 What PR #2577 Delivers​

4 File Walkthrough Highlights​

6 Order-Lifecycle Diagram​

7 Using the New API​

8 Impact & Next Steps​

9 References​

Introduction​

Why the Change Was Needed​

Key Enhancements​

Implementation Highlights​

Impact on Integrators​

Conclusion​

Introduction​

1. Why Combine OpenAI and FFmpeg​

2. FastAPI Endpoint and Background Tasks​

Key Takeaways​

3. Using FFmpeg Under the Hood​

4. Mermaid JS Diagram​

5. Conclusion​

Introduction​

1. Why Run DeepSeek-R1 Locally?​

2. Setting Up DeepSeek-R1 Locally With Ollama​

2.1 Installing Ollama​

2.2 Download and Test DeepSeek-R1​

2.3 Running DeepSeek-R1 in the Background​

3. Using DeepSeek-R1 Locally​

3.1 Command-Line (CLI) Inference​

3.2 Accessing DeepSeek-R1 via API​

3.3 Python Integration​

4. FastAPI Integration and Streaming Responses​

Key Points:​

6. Conclusion​

Introduction​

Concept Drift in Stock Markets​

Diagram: Concept Drift Overview​

DDG-DA: High-Level Approach​

Diagram: DDG-DA Core Steps​

How It Integrates with Qlib​

Diagram: Qlib plus DDG-DA Integration​

Example: Monthly Stock Trend Forecasting​

Diagram: Monthly Task Workflow​

Performance and Findings​

Diagram: Performance Improvement​

Practical Steps​

Diagram: Running DDG-DA Workflow​

Conclusion​

Further Reading and References​

Introduction​

1. Qlib Overview​

Diagram: Qlib Architecture​

2. MLflow Overview​

Diagram: MLflow Overview​

3. Combining Qlib and MLflow​

Diagram: Qlib + MLflow Integration​

4. Minimal Example​

5. Best Practices​

6. Conclusion​

Further Reading and References​

Introduction​

Multi-Level Strategy Workflow​

Key Components​

1. Information Extractor (Intraday)​

2. Forecast Model (Intraday + Daily)​

3. Decision Generator (Daily vs. Intraday)​

4. Executor & Sub-workflow (Nested)​

5. Environment & Simulator​

Example Workflow Snippets​

Practical Tips for HFT + AI​

Summary​

Further Reading & References​

Introduction​

1 Self-Diagnosis & Mind-Set

2 Historical Perspective & Narrative Traps

3 Quantitative Evidence

4 Risk & Capital Allocation

5 Process & Decision Architecture

6 Psychology & Continuous Improvement

7 Wrap-Up

1 Background

2 Why the Change Was Necessary

3 What PR #2577 Delivers

4 File Walkthrough Highlights

6 Order-Lifecycle Diagram

7 Using the New API

8 Impact & Next Steps

9 References

Introduction

Why the Change Was Needed

Key Enhancements

Implementation Highlights

Impact on Integrators

Conclusion

Introduction

1. Why Combine OpenAI and FFmpeg

2. FastAPI Endpoint and Background Tasks

Key Takeaways

3. Using FFmpeg Under the Hood

4. Mermaid JS Diagram

5. Conclusion

Introduction

1. Why Run DeepSeek-R1 Locally?

2. Setting Up DeepSeek-R1 Locally With Ollama

2.1 Installing Ollama

2.2 Download and Test DeepSeek-R1

2.3 Running DeepSeek-R1 in the Background

3. Using DeepSeek-R1 Locally

3.1 Command-Line (CLI) Inference

3.2 Accessing DeepSeek-R1 via API

3.3 Python Integration

4. FastAPI Integration and Streaming Responses

Key Points:

6. Conclusion

Introduction

Concept Drift in Stock Markets

Diagram: Concept Drift Overview

DDG-DA: High-Level Approach

Diagram: DDG-DA Core Steps

How It Integrates with Qlib

Diagram: Qlib plus DDG-DA Integration

Example: Monthly Stock Trend Forecasting

Diagram: Monthly Task Workflow

Performance and Findings

Diagram: Performance Improvement

Practical Steps

Diagram: Running DDG-DA Workflow

Conclusion

Further Reading and References

Introduction

1. Qlib Overview

Diagram: Qlib Architecture

2. MLflow Overview

Diagram: MLflow Overview

3. Combining Qlib and MLflow

Diagram: Qlib + MLflow Integration

4. Minimal Example

5. Best Practices

6. Conclusion

Further Reading and References

Introduction

Multi-Level Strategy Workflow

Key Components

1. Information Extractor (Intraday)

2. Forecast Model (Intraday + Daily)

3. Decision Generator (Daily vs. Intraday)

4. Executor & Sub-workflow (Nested)

5. Environment & Simulator

Example Workflow Snippets

Practical Tips for HFT + AI

Summary

Further Reading & References

Introduction