Bit vector join; Bloom filter join; Bloom join; Hash filter join; Semijoin filter
Semijoin is a technique for processing a join between two tables that are stored at different sites. The basic idea is to reduce the transfer cost by first sending only the projected join column(s) to the other site, where it is joined with the second relation. Then, all matching tuples from the second relation are sent back to the first site to compute the final join result.
The semijoin technique was originally developed by Bernstein et al.  as part of the SDD-1 project as a reduction operator for distributed query processing. The idea of applying hash filtering was proposed by Babb  as well as by Valduriez  particularly for specialized hardware (content addressed file stores and distributed database machines respectively). The theory of semijoin-based distributed query processing was presented in . In  semijoins are also exploited for query...
- 6.Lu H, Carey M. Some experimental results on distributed join algorithms in a local network. In: Proceedings of the 11th International Conference on Very Large Data Bases; 1985. p. 229–304.Google Scholar
- 8.Özsu MT, Valduriez P. Principles of distributed database systems. 2nd ed. Prentice-Hall; 1999.Google Scholar
- 9.Valduriez P. Semi-join algorithms for distributed database machines. In: Schneider J-J, editor. Distributed data bases. Amsterdam: North-Holland; 1982. p. 23–37.Google Scholar