Lecture 3: The Anatomy of Price Discovery
A practical series for the discerning retail trader and the quantitative alchemist on Market Microstructure
đŻïžGreetings, esteemed reader!
In the previous lecture, we rolled up our sleeves and laid out the theory and practical applications of various Liquidity Measure methods.
Now, we lay the last piece of theory you will need before we dive into marketâmaking models: how prices digest information and what âefficiencyââ really means.
I will keep the âđŹâ tags for theoryâheavy passages (great for context, less immediately monetisable) and âđ ïžâ for handsâon ideas you can plug straight into code or trading heuristics.
đ„ Shall we commence?
Why do people trade? đŹ
The market is a game of expectations on the assetsâ value. If a market participant thinks ETH will make 10x, they buy spot from someone who wants to get rid of it, thinking it is overpriced; both participants have their own understanding of the assetâs fundamentals.
Prices move continuously because market participants place orders for three main reasons:
Risk-sharing/rebalancing â move along the efficient frontier to earn risk premiums. Everybody has their own risk profile; if they want to take risks, they want to get paid for it.
Personal liquidity needs â raise cash or deploy capital. Real âgrocery marketâ situation, - people are selling assets to get money, buying assets to invest, expecting long-term growth.
Speculation â act on heterogeneous expectations about the future price that stem from information. Pure information imbalance about the assetâs value.
âŻInformation taxonomy đŹ
Speaking of information market participants base their trades on, it can be classified in a binary way:
Public information: asset valuation moves without trade due to public announcements (press releases, macro data, earnings, etc.), and there is no internal disagreement.
Private information: only some traders possess it, and they reveal it through their trading activities.
Insider info (can be illegal depending on the case! Defo illegal in TradFi in most jurisdictions)
Academic alpha: more knowledge and better tools to convert public information into private.
Famaâs Efficient Market Hypothesis
Letâs define three tiers of price efficiency weâll be referring to later:
Weak: The price reflects historic price information.
Semi-strong: all publicly available info.
Strong form: all public and private info.
Fama (1970) argued that, in equilibrium, prices should reflect all available information (3. Strong form).
Realâlife frictions generate three famous counterâarguments:
Noâtrade theorem (Milgrom & Stokey, 1982): if everyone is rational and riskâneutral, private information alone should never induce trade.
GrossmanâStiglitz paradox (1980): if prices already embed everyoneâs private info, nobody will pay the cost of acquiring it.
Excess volatility: price jumps too large to be justified by public news flow alone.
Information â Price transformation unclear: EMH doesnât explain how information is reflected in the prices.
EMH overall is somewhat VERY questionable!
But itâs still just a model, and like any model, it works under specific conditions, so it would be incorrect to dismiss it outright.
In reality, market making and arbitrage, which are the core areas of algotraderâs interest, rely on EMH-like thinking - they assume price discrepancies between related instruments should converge quickly.
The central paradox with EMH is EMH itself, which is simultaneously foundational and frequently violated in practice.
âŻAsset value vs price đŹ
Now letâs write what we talked about in math.
Let
Ωt â the public information set (the âmarketâs knowledgeââ) at time t.
I(t+1)â â new public info arriving in [t,t+1] so that
We distinguish price pt (what you actually pay) from market value ÎŒ(t) (consensus estimate of âtrueââ worth).
Two common approaches to defining the market value (not price!) of an asset:
Discounted cashâflow value
Thatâs just an expectation of the future cash flow the asset gives, where c(s) future cash flows, ÎŽâ(0,1] is a discount factor.
âŻâŻFundamental (stateâprice) value
ÎŒ(t) is the market makersâ estimate of the securityâs value v as of time t, and Ωt denotes the information available to them at that time. v is the assetâs underlying fundamental payoff (could be liquidation value, longârun dividend sum, etc.).
Informational efficiency in equations đŹ
Assume semiâstrong efficiency (price equals to value estimate equals to value expectation given the information available).
At every instant, the traded price is the marketâs best public estimate of fundamental value.
ptââ= transaction price (last trade or midâquote).
ÎŒtââ= âmarket valueâââshorthand for the conditional expectation.
vâ= fundamental payoff (liquidation value, discounted cashâflow, âŠ).
Ωtââ= all public information known just before time t
âŻGiven everything the crowd collectively knows at t, no other unbiased estimate of v beats the price; if it did, arbitrageurs would trade until the two match.
Valuation innovation
When new info arrives (If tomorrowâs earnings come in better than expected, Ï”(t+1)>0; if the CEO resigns unexpectedly, Ï”(t+1)<0):
News has zero predictable mean. Conditional on todayâs info, the expected size of tomorrowâs shock is zero:
Why? Because conditional expectations are towerâproperty martingales. Just apply the âtower propertyâ, which is a law of integrated expectations:
So no part of tomorrowâs value change is forecastable using information that is already common knowledge today.
Further, for any two different dates s â t
â Innovations are serially uncorrelated. Which means yesterdayâs surprise tells you nothing about todayâs. If it did, yesterdayâs information wouldnât have been fully incorporated, contradicting efficiency
Price innovation equals value innovation
Because p(t)=ÎŒ(t), the same Ï”(t+1) â drives the price:
Taking the conditional expectation again:
â Under informational efficiency, the price process is a martingale.
Add risk aversion and you obtain a âfairâgameââ plus permanent impact framework Ă la Kyle.
A martingale is a process whose next expected value equals the current one, given all available information.
đ ïž If prices are martingales, you cannot design a strategy that forecasts the direction of the next price move using only public dataâedge must come from
superior processing of that data,
private signals, or
supplying liquidity rather than predicting prices.
Sounds reasonable, doesnât it?
âŻLiquidityâŻCostâŻToolkit đ ïž
Now, on this third lecture, letâs sum up a headâfirst catalogue of the priceâbased liquidity measures you will reach for in production.
Pairwise BidâAsk SpreadâŻEstimators
Roll (1984)
Core idea
Use the negative firstâorder autocovariance of price changes to back out the effective spread.Formula
\( \widehat{s} = 2\sqrt{-\operatorname{Cov}(\Delta p_t,\Delta p_{t-1})}\)When it shinesâ
Tickâbyâtick data with reliable sequencing, no need for quotes.Caveatâ
Breaks down when quote revisions are frequent or trade classification is noisy - a typical crypto case.
CorwinâSchultz (2012)
Core ideaâ
Highâlow price range over two overlapping days proxies the spread.Why traders like itâ
Works on daily barsâhandy when you lack highâfreq prints.Watch outâ
Overnight gaps inflate the range; adjust or pair with AbdiâRanaldo.
AbdiâRanaldo (2017)
Enhancementâ
Separates intraâday and overnight volatility to refine CorwinâSchultz.Sweet spotâ
Assets that close each day with sizeable news risk.
ImpactâBased Measures
Kyleâs⯠λ
Modelâ
In Kyleâs auction, price change is linear in signed orderflow\(\Delta p_t = \lambda\,q_t + \eta_t\)where q(t) is the net trade size.
Practical readâoutâ
λ captures permanent impact per share/contract.Use caseâ
Estimate with intraday regression, then size trades so that λQ stays below your risk budget.
SquareâRoot Impact (Empirical law)
Rule of thumb
\(\Delta p \;\approx\; \sigma \sqrt{\tfrac{Q}{V}}\)where Q is your metaâorder size and V the dayâs volume.
Good forâQuick whatâif checks when pitching trade sizes to PMs.
Limitation:âPurely empirical; the coefficient hides regime shifts.
ExecutionâCost Decomposition
âŻImplementation Shortfall (IS)
Definitionâ
(IS) = benchmark price â your average execution price.Decomposes into
Delay cost â waiting to start.
Impact cost â you moved the market.
Opportunity cost â child orders left unfilled.
âŻRealised Spread
Idea
Quote halfâspread earned minus adverse selection.Howâ
Compare execution price to midâprice a short time later (e.g., +1âŻmin).Signalâ
High realised spread â you provide liquidity without getting picked off.
BenchmarkâDeviation Metrics
VWAPâŻ&âŻSlippage
VWAPâŻ(VolumeâWeighted Average Price) is the crowdâs yardâstick.
SlippageâŻ= |your execution â VWAP|.
Use itâŻto tune TWAP/VWAP algos and report to clients who think in benchmarks.
LowâFrequency IlliquidityâŻProxy
âŻAmihudâsâŻIlliquidity (2002)
Statistic
\(I_t \;=\; \frac{|r_t|}{\text{VOL}_t}\)daily absolute return divided by dollar volume.
Interpretationâ
âHow much price move for one dollar traded?ââGreat forâ
Crossâsectional screens when only daily data are available.
âŻProduction pointers đ ïž
Avoid lookâahead bias â use only Ωt when computing any statistic at t.
Volumeâscaling â normalise impact by daily volume to compare across assets.
Highâfreq data quality â misâstamped trades will break serialâcovariance estimators like Roll; clean aggressively.
Marketâmaking models â the martingale property is a baseline; any predictable drift you discover is potential edge, but will shrink once you trade on it (GS paradox in action) - weâll talk about that later.
Next lecture: we switch from theory to action â calibrating a simple dealer marketâmaking model and stressâtesting it on tick data.
Happy coding & good hunting, my dear reader!