How ABX testing works
The protocol
ABX is the gold standard for auditory discrimination. You get two known references (A and B) and an unknown X, randomly assigned to be A or B each round. Audition each as many times as you like, then commit an answer.
Sample-accurate switching matters. The position carries across A→B→X in this tool, so you swap mid-phrase. Your auditory echoic memory is roughly 3-4 seconds; if the switch resets the playhead, the gap erases the trace you were comparing to.
The statistics
The p-value is the probability of getting at least your hit count from pure random guessing on a binomial distribution. The standard "this is real" threshold is p < 0.05.
For 10 rounds: 8 ≈ p 0.055 (borderline), 9 ≈ p 0.011 (significant), 10 ≈ p 0.001 (highly significant). Longer tests need higher absolute hit counts but lower percentage: 15/20 (75 %) is p 0.021, comfortably significant.