Instrumental insemination of queen honeybees has become a routine technology in honeybee breeding (Laidlaw 1944). The technique overcomes the problems resulting from lack of rigorous mating control under natural conditions. The queen is typically inseminated with semen from 8 to 12 drones (Cobey et al. 2013). Often, bee breeders cross a queen with drones of a single ‘father’ lineage to achieve their selection goals. However, this process imposes extreme constraints on the genotypic composition of the colony. The queen naturally mates with many unrelated males resulting in genotypically much more diverse colonies than after instrumental insemination. High intra-colonial genotypic variance among the workers has been shown to be beneficial for colony efficiency (Tarpy and Seeley 2006; Mattila and Seeley 2007). As genotypically distinct workers specialize in different tasks, reduced genetic variance may negatively impact on division of labour in the colony as well (Lattorff and Moritz 2013). If the combination of specific genotypes results in the desired colony trait, it might be useful to inseminate a queen with a set of drones from different lineages instead of a single one. Workers of the various patrilines in such a multi-lineage colony might complement each other to release a more efficient colony phenotype.

In order to produce such ‘custom design’ colonies, it will however be necessary to reliably reproduce suitable sperm mixes to inseminate the queens. Using individual drones to inseminate a queen will be insufficient. Another queen will be inseminated with a set of semen from different drones. Hence, no single colony will have identical fathers unlike classical animal breeding, where a single siring male can be mated to many females. The replication of the insemination is not possible, which dilutes any assessment of breeding values. Here, we use a new technique for mixing of semen from a very large pool comprising thousands of drones (Brauße and van Praagh 2010) to test if these can be homogeneously mixed and used for replicate inseminations.

The semen of 2037 drones (204 ± 34 per colony) from 10 unrelated honeybee colonies was collected and mixed according to Brauße and van Praagh (2010) resulting in 960 μL mixed semen. Three virgin queens were instrumentally inseminated with 10 μL aliquots of the sperm mix and allowed to produce worker offspring. When the colonies were in full brood production, 32 pupae were sampled from each colony for genotyping. Additionally, semen of one drone from each of the 10 drone producing colonies was sampled for individual genotyping and lineage identification. All samples (sperm and brood) were stored in 70% ethanol till DNA extraction.

The sperm samples were centrifuged for 15 min at 20,800g, the pellets washed twice in 500 μL distilled water for 1 h, and dried at 37 °C. DNA was extracted following the DNeasy Blood & Tissue Kit (QIAGEN) protocol but with 40 μL 1% Proteinase K (instead of 20 μL) and incubation at 56 °C for 12 h. Two legs of each worker pupa were used for DNA extraction, using a modified Chelex protocol (Walsh et al. 1991) as described by Erler and Lattorff (2010). Genotyping details, using five highly polymorphic loci, are given in the Online Resource.

Maternal and paternal genotypes of every worker pupae could be unambiguously assigned by Mendelian inference. The siring drones were assigned to one of the 10 drone providing queens in the semen mix, based on the genotypic information from the single sperm samples (Online Resource Table SI, SII). Further details on allele identification in the sperm mix and statistics are given in the Online Resource.

The marker set used was sufficiently sensitive to address the problem with a non-detection error of 3.4 × 10−3 (Boomsma and Ratnieks 1996). None of the 10 lineages was significantly over- or under-represented neither in the sperm mix (Figure 1a; Kruskal-Wallis test H = 4.13, df = 9, P = 0.90) nor in the comparison offspring worker pupae vs. sperm mix (χ 2 = 14.23, df = 9, P = 0.11; power of 0.85) (Figure 1 a, b). The distribution of drone lineages among the offspring worker pupae did not significantly differ from an expected even distribution (χ 2 = 12.38, df = 9, P = 0.19; power of 0.87). Finally, there were no significant differences among the three analysed colonies regarding the drone lineage distribution (Fisher exact test n = 94, P = 0.97; power of 1.0) (Figure 1b).

Figure 1.
figure 1

a Distribution of drone families within the sperm mix (median with box 25–75% range, and whisker: min-max range, dotted grey line: expected even distribution). b Absolute number of pupae sired by each drone lineage in the three natal colonies (white: colony A (n total = 32), filled grey: colony B (n total = 32), filled black: colony C (n total = 30), dotted grey line: expected even distribution). None of the distributions (colony A-C) deviated significantly from an even distribution (Fisher’s exact tests, P > 0.05).

Neither the distribution of drone lineages within the sperm mix nor the distribution among the offspring worker pupae differed significantly from an even distribution. The repeatability of the technique among the three queens was remarkable with r = 0.45 (Boake 1989). We expect this to be stable over time as any patriline clustering of sperm after having been mixed seems biologically most unlikely (Page et al. 1984). Large volume sperm mixing may set the stage to implement novel breeding strategies. It allows for homogenous mixing of 3000 μL sperm (approx. 7000 drones), sufficient to inseminate up to 250 honeybee queens (van Praagh et al. 2014). The technique may set the stage to establish custom design colonies not just for honey production but also for other colony phenotypes of apicultural interest.