The availability of big data has the potential to transform many areas of the life sciences and usher in new ways of doing research. Here, I argue that big data biology also raises fundamental questions in the philosophy of science: for example, what is a good dataset, and how can reliable knowledge be extracted from big data? Collaborations between biologists, data scientists and philosophers of science will help us to answer these and other questions.