The relevant HIV data sets used for predicting outcomes of HIV combination therapies suffer from several problems: different treatment backgrounds of the samples, uneven representation with respect to the level of therapy experience and uneven therapy representation. Also, they comprise only viral strain(s) that can be detected in the patients’ blood serum.