docs: add scoring docs

BioPhoton · BioPhoton · commit 7dc7f7db79de · 2025-07-10T21:36:04.000+02:00
diff --git a/packages/cli/docs/scoring.md b/packages/cli/docs/scoring.md
@@ -0,0 +1,288 @@
+# Scoring
+
+**Scoring strategies** are mathematical approaches used to convert raw metrics into normalized scores
+(typically 0-1 range) that provide meaningful feedback about performance against targets or budgets. These
+strategies are particularly valuable in software development contexts where you need to track and visualize how
+well various metrics (like bundle sizes, test coverage, API latency, or build times) are performing relative to
+established thresholds or baselines. Each strategy offers different characteristics - some provide linear
+relationships between the metric and score, while others offer more nuanced behaviors like soft caps or
+logarithmic scaling.
+
+## Value Scoring Strategies
+
+### Overview Table
+
+| **Strategy Name**     | **Formula**                                                                                                                            | **Description**                                                                |
+| --------------------- | -------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------ |
+| **Percent Used**      | $$\text{score}= \max\!\left(0,\,1 - \frac{S}{M}\right)$$                                                                               | Linearly falls from 1→0 as **S** goes 0→**M**, then clamps at 0.               |
+| **Linear Overshoot**  | $$\text{score}= \begin{cases}1,&S\le M\\[4pt]1 - \dfrac{S - M}{M},&S>M\end{cases}$$                                                    | Holds 1 until **S**=**M**, then falls linearly to 0 by **S**=2·**M**.          |
+| **Relative Baseline** | $$\text{score}= \mathrm{clamp}_{0..1}\!\Bigl(0.5 + \frac{\text{baseline} - S}{2\,\text{baseline}}\Bigr)$$                              | Centers at 0.5 when **S** matches baseline; >0.5 improvement, <0.5 regression. |
+| **Sigmoid Soft-Cap**  | $$\text{score}= \frac{1}{1 + e^{\,k\,(S - M)}}$$                                                                                       | Remains near 1 before **M**, then gently decays, steep after threshold.        |
+| **Logarithmic Decay** | $$\text{score}= \max\!\Bigl(0,\,1 - \frac{\log_{10}(1+S)}{\log_{10}(1 + M\,k)}\Bigr)$$                                                 | Penalises large **S** sharply, small overruns have mild impact.                |
+| **Tiered Grading**    | $$\text{score}= \begin{cases}1,&P\ge0.9\\[2pt]0.75,&P\ge0.75\\[2pt]0.5,&P\ge0.5\\[2pt]0,&\text{otherwise}\end{cases},\;P=\frac{S}{M}$$ | Maps **S/M** into A/B/C/F bands for clear grade visualization.                 |
+
+---
+
+### Percent Used Details
+
+Directly reflects remaining budget fraction.
+
+**Pros:**
+
+- Intuitive (% headroom).
+- Simple to compute.  
+  **Cons:**
+- Flatlines at 0 once budget is exceeded.
+
+**Implementation:**
+
+```ts
+function percentUsed(current: number, max: number): number {
+  return Math.max(0, 1 - current / max);
+}
+```
+
+**When to use it:**  
+Ideal for any strict size or time budget (bundle size, build time) where you care about how much headroom remains. Simple dashboards and alerts benefit from a direct “% remaining” metric.
+
+**Examples:**
+
+- **Bundle Stats:** Track JS bundle headroom under a 500 KB limit.
+- **Test Coverage:** % lines covered vs 80 % threshold.
+- **Page Load (LCP):** Remaining ms under a 2 s budget.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Percent Used (M=10)"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Percent [1,0.9,0.8,0.7,0.6,0.5,0.4,0.3,0.2,0.1,0]
+```
+
+---
+
+### Linear Overshoot Details
+
+Penalty grows linearly with overshoot.
+
+**Pros:**
+
+- Proportional and predictable.
+- Easy to explain to stakeholders.  
+  **Cons:**
+- Can go negative if not clamped.
+
+**Implementation:**
+
+```ts
+function linearOvershoot(current: number, max: number): number {
+  if (current <= max) {
+    return 1;
+  }
+  return Math.max(0, 1 - (current - max) / max);
+}
+```
+
+**When to use it:**  
+When you want to maintain proportional visibility into how far a metric has exceeded its limit (e.g., memory usage, API latency). Works well if you clamp at 0 to avoid negatives.
+
+**Examples:**
+
+- **API Latency:** Overshoot above 200 ms threshold.
+- **Memory Usage:** RAM above configured max.
+- **CSS Sizes:** Style sheets above a 50 KB cap.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Linear Overshoot (M=10)"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Linear [1,0.9,0.8,0.7,0.6,0.5,0.4,0.3,0.2,0.1,0]
+```
+
+---
+
+### Relative Baseline Details
+
+Compares current value to a historical baseline.
+
+**Pros:**
+
+- Symmetric: 0.5 = “no change.”
+- Highlights improvements/regressions.  
+  **Cons:**
+- Requires reliable baseline data.
+
+**Implementation:**
+
+```ts
+function relativeBaseline(current: number, baseline: number): number {
+  const score = 0.5 + (baseline - current) / (2 * baseline);
+  return Math.max(0, Math.min(1, score));
+}
+```
+
+**When to use it:**  
+Perfect for tracking trends across releases (bundle size, test performance, code complexity). Shifts focus from absolute budgets to continuous improvement.
+
+**Examples:**
+
+- **Bundle Stats:** Compare today’s bundle size vs last release.
+- **Test Duration:** Measure speedup/regression of test suite.
+- **Doc Coverage:** Track improvements in storybook coverage.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Relative Baseline (baseline=10)"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Delta [1,0.95,0.9,0.85,0.8,0.75,0.7,0.65,0.6,0.55,0.5]
+```
+
+---
+
+### Sigmoid Soft-Cap Details
+
+Soft threshold with tunable steepness.
+
+**Pros:**
+
+- Gentle near the cap; harsh after.
+- Never negative or >1.  
+  **Cons:**
+- Requires selecting a constant **k**.
+- Less intuitive without plotting.
+
+**Implementation:**
+
+```ts
+function sigmoidSoftCap(current: number, max: number, k: number = 0.5): number {
+  return 1 / (1 + Math.exp(k * (current - max)));
+}
+```
+
+**Parameter Details:**
+
+- **k** (steepness factor): Controls how quickly the score drops after exceeding the threshold
+  - `k = 0.1`: Very gentle decline, scores remain high even with significant overshoot
+  - `k = 0.5`: Moderate decline (default), balanced between tolerance and penalty
+  - `k = 1.0`: Steep decline, harsh penalty for any overshoot
+  - `k = 2.0`: Very steep decline, almost binary behavior near the threshold
+
+**When to use it:**  
+Use when minor breaches are acceptable but you want strong pressure against large overruns (e.g. accessibility violations, deprecation warnings).
+
+**Examples:**
+
+- **ESLint Errors:** Small numbers of lint errors tolerated.
+- **TS Diagnostics:** Minor type mismatches vs critical errors.
+- **Security Audits:** Few low-severe alerts OK; many not.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Sigmoid Soft-Cap (M=5, k=0.5)"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Sigmoid [0.92,0.88,0.82,0.73,0.62,0.50,0.38,0.27,0.18,0.12,0.08]
+```
+
+---
+
+### Logarithmic Decay Details
+
+Log-scale penalty for wide-range data.
+
+**Pros:**
+
+- Highlights massive overruns.
+- Small ones barely noticed.  
+  **Cons:**
+- Formula less intuitive.
+
+**Implementation:**
+
+```ts
+function logarithmicDecay(current: number, max: number, k: number = 2): number {
+  const score = 1 - Math.log10(1 + current) / Math.log10(1 + max * k);
+  return Math.max(0, score);
+}
+```
+
+**Parameter Details:**
+
+- **k** (scaling factor): Controls the logarithmic compression range
+  - `k = 1`: Minimal compression, behaves more like linear decay
+  - `k = 2`: Moderate compression (default), good balance for most metrics
+  - `k = 5`: Strong compression, very tolerant of small overruns but penalizes large ones
+  - `k = 10`: Maximum compression, almost flat until very large overruns
+
+**When to use it:**  
+Ideal for metrics spanning orders of magnitude (e.g. network RTT, bundle gzipped vs uncompressed sizes).
+
+**Examples:**
+
+- **CWV (FID):** First Input Delay distribution.
+- **Bundle Stats:** Large vs small chunks on log scale.
+- **API Throttling:** High‐tail latency events.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Logarithmic Decay (M=5, k=2)"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Log [1,0.71,0.54,0.42,0.33,0.25,0.19,0.13,0.08,0.04,0]
+```
+
+---
+
+### Tiered Grading Details
+
+Discrete grade bands (A/B/C/F).
+
+**Pros:**
+
+- Very readable.
+- No continuous math needed for viewers.  
+  **Cons:**
+- Abrupt jumps hide nuance.
+
+**Implementation:**
+
+```ts
+function tieredGrading(current: number, max: number): number {
+  const ratio = current / max;
+  if (ratio >= 0.9) return 1;
+  if (ratio >= 0.75) return 0.75;
+  if (ratio >= 0.5) return 0.5;
+  return 0;
+}
+```
+
+**When to use it:**  
+Best for executive dashboards or compliance checks where letter‐grade is preferred over percentages.
+
+**Examples:**
+
+- **Test Coverage:** A: ≥90 %, B: 75 – 89 %, C: 50 – 74 %.
+- **Doc Coverage:** Storybook or JSDoc rates.
+- **Security Score:** CVSS banding.
+
+**Chart:**
+
+```mermaid
+xychart-beta
+    title "Tiered Grading"
+    x-axis [0,1,2,3,4,5,6,7,8,9,10]
+    y-axis "Score" 0 --> 1
+    line Grade [0,0,0,0,0.5,0.5,0.5,0.75,1,1,1]
+```