Data mining, computer-aided molecular modeling, descriptor calculation and multiple linear regression techniques were utilized to produce statistically significant and predictive models for O/W and W/O microemulsions. The literature was scanned over the last 20 years, subsequently, 68 phase diagrams from eight different references were collected. Molecular modeling techniques were then applied on the components of the microemulsion systems to generate plausible 3-D structures. Subsequently, various physicochemical descriptors were calculated based on the resulting 3-D structures. The generated descriptors were correlated with microemulsion existence areas utilizing multiple linear regression analysis (MLR). The generated models were statistically cross-validated and were found to be of significant predictive power. Furthermore, the resulting models allowed better understanding of the process of microemulsion formation.