Computer-aided diagnosis has potential in improving radiologists' diagnosis, and presentation of similar images as a reference may provide additional useful information for distinction between benign and malignant lesions. In this study, we evaluated the usefulness of presentation of reference images in observer performance studies and compared the results obtained by groups of observers practicing in the United States and Japan. The results showed that the presentation of the reference images was generally effective for both groups, as the areas under the receiver operating characteristic curves improved from 0.915 to 0.924 for the group in the US and from 0.913 to 0.925 for the group in Japan, although the differences were marginally (p = 0.047) and not (p = 0.13) statistically significant, respectively. There was a slight difference between the two groups in the way that the observers reacted to some benign cases, which might be due to differences in the population of screenees and in the socio-clinical environment. In the future, it may be worthwhile to investigate the development of a customized system for physicians in different socio-clinical environments.