Dr.Jiw: กุมภาพันธ์ 2023

วันเสาร์ที่ 25 กุมภาพันธ์ พ.ศ. 2566

AUC vs F1 in Imbalanced data

เพราะ imbalanced data คือมี #TP ต่างจาก #TN มาก

F1 ไม่ดู TN แต่ AUC คิด TN ด้วย ดังนั้นควรใช้ F1 เพื่อเลี่ยง TN ที่น้อยไปจนถ้าเอามาพิจารณาร่วมด้วยผลลัพธ์จะเพี้ยน

Geometric Mean of TPR and TNR

Used in binary classification

G-mean = Sqrt of TPR*TNR

AUC is an error or performance metric very useful for replacing accuracy in binary classification with strong class imbalance. https://thedigitalskye.com/2021/04/19/6-useful-metrics-to-evaluate-binary-classification-models/

Geometric mean in general isn’t an error or performance metric, but is just an alternative to an arithmetic mean that’s robust to different normalization schemes.

Metrics in binary classification https://neptune.ai/blog/evaluation-metrics-binary-classification

วันศุกร์ที่ 24 กุมภาพันธ์ พ.ศ. 2566

Metric in medical testing

ควรวัดค่า negative likelihood ratio (LR-) = fnr/tnr ด้วย เพื่อให้ความสำคัญกับค่า False negative rate i.e. Type II error besides Type I error (FPR)

cf.https://medium.com/@saurabhdhandeblog/confusion-matrix-explained-calculating-accuracy-tpr-fpr-tnr-precision-and-prevalence-87557fe8714d

วันอังคารที่ 21 กุมภาพันธ์ พ.ศ. 2566

Sensitivity and specificity

https://en.m.wikipedia.org/wiki/Sensitivity_and_specificity

Expectation values and Variance and Covariance matrix

Variance is the expectation of the squared deviation of a random variable from its population mean or sample mean. Variance is a measure of dispersion, meaning it is a measure of how far a set of numbers is spread out from their average value.

The expected value should be regarded as the average value. When X is a discrete random variable, then the expected value of X is precisely the mean of the corresponding data. The variance should be regarded as (something like) the average of the difference of the actual values from the average.

https://math.berkeley.edu/~scanlon/m16bs04/ln/16b2lec30.pdf

Covariance matrix is a square matrix that displays the variance exhibited by elements of each of datasets and the covariance between a pair of datasets.

https://www.cuemath.com/algebra/covariance-matrix/