Informal description and motivation
I am comparing the output of two search engines. Each engine is searching over the same set of 2,000 documents and returning the top 20 hits. I'm trying to construct a hypothesis test where $H_0$ is the hypothesis that the search results are being randomly generated by both engines (i.e., that the results of one search engine are independent of the results of the other search engine).
My first strategy is to answer the question: If I select 20 numbers at random (and without replacement) from the integers 1 to 2,000 to form one hitlist, and repeat to form another hitlist of 20, how likely is it that they these two lists will share $0 \leq X\leq 20$ hits in common?
More formal, and general, description
Let $S_X:=\{1,2,...,N\},S_Y:=\{1,2,...,M\}$.
Let $X_k\subset S_X:|X_k|=k\leq N$ and $Y_j\subset S_Y:|Y_j|=j \leq M$
Finally, let $P(x\in X_1)=\frac{1}{N}\; \forall x \in S_X, \textrm{and}\; P(y \in Y_1)=\frac{1}{M}\;\forall y \in S_Y$, and, more generally:
$$P(x\in X_{k+1}|x \notin X_k) = \frac{1}{N-k} \; \forall x \notin X_k$$ and
$$P(y\in Y_{k+1}|x \notin Y_k) = \frac{1}{M-k} \; \forall y \notin Y_k$$
Question
Given $0\leq k \leq N$, $0 \leq j \leq M$, and $ 0 \leq c \leq \min \{k,j\}$, find $P(|X_k\cap Y_j|\leq c)$