Mempool inclusion timings can meaningfully improve causal estimates of short-term price impact, but only when researchers account for important selection effects and infrastructure differences. Observing when a transaction first appears and when it is mined gives a higher-resolution view of market intent than block timestamps alone, yet that resolution is conditional on who sees the mempool and on how miners, relays, and bots interact with that visibility.
How mempool timings add causal leverage
Mempool timestamps expose the latency between broadcast and inclusion, which helps separate order arrival from execution and reduces endogeneity between executed trades and prevailing prices. Research by Philip Daian, Cornell University, and collaborators has shown that transaction propagation and ordering dynamics are central to measurable extraction of value in decentralized exchanges, implying mempool observations are informative about causal sequencing. Using those timings, econometric strategies such as event-time regressions and instrumental approaches can better isolate the effect of a specific transaction on subsequent price moves because the ordering is observed at sub-block granularity rather than inferred.
Important limitations and consequential biases
Improved timing does not automatically yield unbiased causal estimates. Selection bias arises because not all nodes observe the same mempool: some traders send transactions through private relays or directly to miners, and miner-controlled reorderings and censorship introduce endogenous inclusion delays. Work by Dan Robinson, Paradigm, and reporting from Flashbots indicates that private transaction submission and miner extractable value auctions materially change which transactions are visible to public observers. Those practices create unobserved confounders that, if ignored, lead to overstated or misattributed price impacts.
The consequences extend beyond measurement. Public mempool transparency can enable predatory strategies such as front-running and sandwich attacks, altering trader behavior and market design. Geographic and infrastructural disparities matter too: regions with concentrated mining pools or fewer relay operators exhibit different propagation patterns, affecting territorial fairness and environmental incentives for latency-reducing infrastructure.
When applied carefully, mempool inclusion timings are a powerful tool for causal inference in crypto markets, but responsible empirical work must combine those timings with robustness checks, transparency about data coverage, and sensitivity analyses to miner and relay behavior to avoid misleading conclusions.