References

Published

2025-11-29

(Note to ADA511 students: in the course’s Canvas page) you can find further information on how to access these references.

Believe nothing, O monks, merely because you have been told it, or because it is traditional, or because you yourselves have imagined it. Do not believe what your teacher tells you merely out of respect for the teacher. (Attributed to Gautama Buddha)

But in the natural sciences, whose conclusions are true and necessary and have nothing to do with human will, one must take care not to place oneself in the defense of error; for here a thousand Demostheneses and a thousand Aristotles would be left in the lurch by every mediocre wit who happened to hit upon the truth for himself. (Galileo Galilei)

The notions, ideas, and rules that you have learned in this course have been presented in such a way as to appear plausible and intuitively understandable. In some places we gave sketch of proofs.

But that is not enough.

A “data mechanic” (see preface) might be excused for using incorrect formulae, and might simply say “this is the procedure I was taught”. You instead, as a data engineer and data scientist, have the duty to check the validity of the theory and principles that you use in developing new algorithms, code, solutions.

In particular, you cannot accept theories or methods simply because

they are commonly used, or used by the majority of some community;
some “authority” or known scientist says they are correct.

In fact, Science began when the two criteria above were discarded as not valid. Galileo’s quote above says this very explicitly. Imagine if Einstein had said “all scientists see no problem with the notion of simultaneity, so it must be correct”, or “great scientists like Maxwell or Poincaré did not see any problem with the notion of simultaneity, so it must be correct”. There is no scientific progress with this kind of reasoning.

Instead, the only two scientific criteria you have to decide on the validity of a method or theory are

logical proof,
experimental corroboration.

which you must do as much as possible by yourself. The more verification you delegate to others, to majority or “authority”, the less you are doing science.

For this reason you have, at some point, go and check for yourself the validity of what you’ve learned in this course. You might in fact find out that something was not correct! Then you’ll correct it and make science advance. Throughout the course We have given references where many proofs can be found. Here are some final references containing the main proofs of what you have learned here; you should check and validate them at some point.

R. A. Briggs (2014/2019): Normative Theories of Rational Choice: Expected Utility.
M. Ben-Ari (1993/2012): Mathematical Logic for Computer Science.
T. M. Cover, J. A. Thomas (1991/2006): Elements of Information Theory.
R. T. Cox (1946): Probability, Frequency and Reasonable Expectation.
M. Cox, A. O’Hagan (2022): Meaningful expression of uncertainty in measurement.
R. M. Dawes, T. L. Smith (1985): Attitude and opinion measurement, pp. 509–566 in G. Lindzey, E. Aronson: Handbook of Social Psychology. Vol. I: Theory and Method.
P. Diaconis, S. Holmes, R. Montgomery (2007): Dynamical Bias in the Coin Toss.
C. Drummond, R. C. Holte (2005): Severe Class Imbalance: Why Better Algorithms Aren’t the Answer.
K. Dyrland, A. S. Lundervold, P.G.L. Porta Mana (2022): Does the evaluation stand up to evaluation?: A first-principle approach to the evaluation of classifiers.
E. Eells (1982/2016): Rational Decision and Causality.
N. Fenton, M. Neil (2019): Risk Assessment and Decision Analysis with Bayesian Networks.
G. Galilei (1632/1967): Dialogue concerning the two chief world systems – Ptolemaic & Copernican
S. J. Gould (1985/2013): The Median Isn’t the Message.
P. C. Gregory (2005): Bayesian Logical Data Analysis for the Physical Sciences.
T. Hailperin (1965): Best Possible Inequalities for the Probability of a Logical Function of Events.
T. Hailperin (1996): Sentential Probability Logic: Origins, Development, Current Status, and Technical Applications.
R. Hastie, R. M. Dawes (2001/2010): Rational Choice in an Uncertain World: The Psychology of Judgment and Decision Making.
D. Heath, W. Sudderth (1976): De Finetti’s Theorem on Exchangeable Variables.
M. R. A. Huth, M. D. Ryan (2000/2004): Logic in Computer Science.
M. Ingham (2012): No More Band-Aids: Integrating FM into the Onboard Execution Architecture.
E. T. Jaynes (1994/2003): Probability Theory: The Logic of Science.
R. L. Keeney, H. Raiffa (1976/1993): Decisions with Multiple Objectives: Preferences and Value Tradeoffs.
W. Kruskal, F. Mosteller (1979): Representative Sampling, I: Non-scientific Literature.
D. V. Lindley (1971/1988): Making Decisions.
D. V. Lindley, M. R. Novick (1981): The role of exchangeability in inference.
D. J. C. MacKay (1992): A practical Bayesian framework for backpropagation networks.
D. J. C. MacKay (1995/2005): Information Theory, Inference, and Learning Algorithms.
G. Malinas, J. Bigelow (2004/2016): Simpson’s paradox.
C. E. Metz (1978): Basic principles of ROC analysis.
D. G. Morrison (1967): On the consistency of preferences in Allais’ paradox.
A. O’Hagan (1988): Probability: Methods and measurement.
M. Ono, A. Nicholas, F. Alibay, J. Parrish (2015): SMART: A propositional logic-based trade analysis and risk assessment tool for a complex mission.
F. J. Pelletier, A. Hazen (2021/2023): Natural Deduction Systems in Logic.
J. W. Pratt, H. Raiffa, R. Schlaifer (1995/1996): Introduction to Statistical Decision Theory.
H. Raiffa (1968/1970): Decision Analysis: Introductory Lectures on Choices under Uncertainty.
S. J. Russell, P. Norvig (1995/2022): Artificial Intelligence: A Modern Approach.
D. S. Sivia (1996/2006): Data Analysis: A Bayesian Tutorial.
H. C. Sox, M. C. Higgins, D. K. Owens (1988/2013): Medical Decision Making.
K. Steele, H. O. Stef{'a}nsson (2015/2020): Decision Theory.
J. A. Swets, W. P. Tanner, Jr., T. G. Birdsall (1961): Decision processes in perception.
G. Takeuti (1975/1987): Proof Theory.
B. C. Williams, M. D. Ingham, S. H. Chung, P. H. Elliott (2003): Model-based programming of intelligent embedded systems and robotic space explorers.
O. Zekri, A. Odonnat, A. Benechehab, L. Bleistein, N. Boullé, I. Redko (2024): Large language models as Markov chains.