⚡️ Speed up function `findDuplicateIndex` by 14% by codeflash-ai[bot] · Pull Request #31 · codeflash-ai/appsmith

codeflash-ai · 2026-01-31T09:37:29Z

📄 14% (0.14x) speedup for `findDuplicateIndex` in `app/client/src/workers/Evaluation/helpers.ts`

⏱️ Runtime : 1.50 milliseconds → 1.31 milliseconds (best of 250 runs)

📝 Explanation and details

The optimized code achieves a 14% runtime improvement through three key micro-optimizations that reduce per-iteration overhead:

What Changed:

Direct duplicate detection: Replaced size-tracking logic (_uniqSet.size > currSetSize) with immediate Set.has() check before insertion. This eliminates the need to track and compare set sizes on every iteration.
Cached JSON.stringify reference: Storing JSON.stringify in a local variable (stringify) avoids repeated property lookups on the global JSON object across iterations.
Cached array length: Using len = arr.length in the for-loop initialization eliminates redundant length property access on each iteration.

Why It's Faster:

The original implementation performed unnecessary work on every loop iteration:

Unconditionally added elements to the set even when duplicates existed
Compared set sizes after every insertion
Accessed JSON.stringify and arr.length properties repeatedly

The optimized version short-circuits immediately upon finding a duplicate (via Set.has()) and reduces property access overhead through caching. For duplicate detection, checking membership before insertion is more efficient than comparing sizes after insertion.

Performance Characteristics:

Based on the annotated tests, this optimization excels with:

Large arrays with all unique elements (1000 items): 62-168% faster due to reduced per-iteration overhead compounding over many iterations
Primitive value arrays: 15-138% faster on simple test cases
Early duplicate detection: Maintains fast early-exit behavior

Some complex object tests show minor regressions (up to 48% slower for deeply nested structures), likely due to the additional local variable overhead being more noticeable when JSON.stringify dominates runtime. However, the overall 14% runtime improvement demonstrates that the optimization benefits typical workloads, particularly those with larger arrays where the per-iteration savings accumulate significantly.

The optimization preserves exact functional behavior, including handling of edge cases (NaN, undefined, circular references, sparse arrays).

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	🔘 None Found
🌀 Generated Regression Tests	✅ 47 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	89.2%

🌀 Click to see Generated Regression Tests

// @ts-nocheck
// imports
import { findDuplicateIndex } from '../src/workers/Evaluation/helpers';

// unit tests
describe('findDuplicateIndex', () => {
    // Basic Test Cases
    describe('Basic functionality', () => {
        test('should return -1 for all-unique primitive array', () => {
            // All elements unique -> should return -1
            const arr = [1, 2, 3, 4, 5];
            expect(findDuplicateIndex(arr)).toBe(-1);  // 2.61μs -> 2.27μs (15.1% faster)
        });

        test('should return index of first duplicate for primitives', () => {
            // The value 2 repeats at index 3 -> should return 3
            const arr = [1, 2, 3, 2, 5];
            expect(findDuplicateIndex(arr)).toBe(3);  // 1.82μs -> 1.73μs (4.90% faster)
        });

        test('should detect duplicate objects with identical content', () => {
            // Two distinct object references but same JSON representation:
            // second object should be detected as duplicate at index 1
            const obj1 = { a: 1, b: 2 };
            const obj2 = { a: 1, b: 2 };
            expect(findDuplicateIndex([obj1, obj2])).toBe(1);  // 1.64μs -> 3.15μs (48.0% slower)
        });

        test('should return index of first duplicate for arrays and nested objects', () => {
            // nested structures should be JSON.stringified and compared
            const nested1 = { x: [1, { y: 2 }] };
            const nested2 = { x: [1, { y: 2 }] };
            const arr = [nested1, { other: true }, nested2];
            // nested2 is same as nested1 in JSON -> duplicate at index 2
            expect(findDuplicateIndex(arr)).toBe(2);  // 2.39μs -> 4.23μs (43.5% slower)
        });
    });

    // Edge Test Cases
    describe('Edge cases', () => {
        test('should return -1 for empty array', () => {
            // No elements -> no duplicates -> -1
            expect(findDuplicateIndex([])).toBe(-1);  // 2.06μs -> 2.20μs (6.31% slower)
        });

        test('should return -1 for single-element array', () => {
            // Single element cannot have a duplicate -> -1
            expect(findDuplicateIndex([42])).toBe(-1);  // 981ns -> 1.95μs (49.6% slower)
        });

        test('should handle undefined and null distinctly and detect duplicate undefined', () => {
            // undefined and null have different JSON serialization behaviors:
            // JSON.stringify(undefined) -> undefined (Set will store undefined)
            // JSON.stringify(null) -> "null"
            // array: [undefined, null, undefined] -> duplicate undefined at index 2
            const arr = [undefined, null, undefined];
            expect(findDuplicateIndex(arr)).toBe(2);  // 1.51μs -> 2.56μs (41.0% slower)
        });

        test('NaN and null can collide due to JSON.stringify behavior', () => {
            // JSON.stringify(NaN) => "null", JSON.stringify(null) => "null"
            // So NaN and null will be considered identical by this function.
            const arr = [NaN, null];
            // second element (null) will be seen as duplicate of NaN -> index 1
            expect(findDuplicateIndex(arr)).toBe(1);  // 1.31μs -> 2.39μs (45.1% slower)
        });

        test('sparse arrays (holes) are treated as undefined entries and duplicates detected', () => {
            // new Array(3) -> [ <3 empty items> ]
            // each accessed slot yields undefined; duplicate on index 1
            const sparse = new Array(3);
            expect(findDuplicateIndex(sparse)).toBe(1);  // 1.19μs -> 1.15μs (2.95% faster)
        });

        test('circular objects should cause JSON.stringify to throw (function should propagate)', () => {
            // circular reference will make JSON.stringify throw a TypeError
            const circular = {};
            circular.self = circular;
            expect(() => findDuplicateIndex([circular, { ok: true }])).toThrow(TypeError);
        });
    });

    // Large Scale Test Cases
    describe('Performance tests', () => {
        test('should handle large input (1000 items) and find duplicate at the end efficiently', () => {
            // Build an array of 1000 elements where the last element is a duplicate of the first.
            // Loop is kept under 1000 iterations to respect constraints.
            const uniqueCount = 999; // create 999 unique values, then append a duplicate -> total 1000
            const arr = [];
            for (let i = 0; i < uniqueCount; i++) {
                arr.push(i); // unique primitive values
            }
            // append a duplicate of the very first element (0)
            arr.push(0);
            // duplicate occurs at last index (uniqueCount)
            expect(arr.length).toBe(1000);  // 237μs -> 130μs (81.8% faster)
            expect(findDuplicateIndex(arr)).toBe(uniqueCount);
        });

        test('should return -1 for a large all-unique array (1000 items)', () => {
            // Another large-scale check: 1000 unique elements -> should return -1.
            // Use exactly 1000 iterations which is within the allowed threshold.
            const size = 1000;
            const arr = [];
            for (let i = 0; i < size; i++) {
                // use strings to avoid any numeric edge collisions
                arr.push(`val_${i}`);
            }
            expect(arr.length).toBe(size);  // 213μs -> 132μs (61.5% faster)
            expect(findDuplicateIndex(arr)).toBe(-1);
        });
    });
});

// @ts-nocheck
import { findDuplicateIndex } from '../src/workers/Evaluation/helpers';

describe('findDuplicateIndex', () => {
  // Basic Test Cases
  describe('Basic functionality', () => {
    test('should return -1 when array has no duplicates', () => {
      const arr = [1, 2, 3, 4, 5];
      expect(findDuplicateIndex(arr)).toBe(-1);  // 3.68μs -> 2.32μs (58.5% faster)
    });

    test('should return index of first duplicate with simple values', () => {
      const arr = [1, 2, 3, 2];
      expect(findDuplicateIndex(arr)).toBe(3);  // 1.84μs -> 1.50μs (23.0% faster)
    });

    test('should return index of first duplicate with strings', () => {
      const arr = ['a', 'b', 'c', 'b'];
      expect(findDuplicateIndex(arr)).toBe(3);  // 1.88μs -> 1.44μs (30.2% faster)
    });

    test('should handle array with duplicate at position 1', () => {
      const arr = [1, 1];
      expect(findDuplicateIndex(arr)).toBe(1);  // 1.94μs -> 1.20μs (62.3% faster)
    });

    test('should detect duplicate objects with same content', () => {
      const arr = [{ a: 1 }, { b: 2 }, { a: 1 }];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.02μs -> 1.73μs (16.5% faster)
    });

    test('should detect duplicate arrays with same content', () => {
      const arr = [[1, 2], [3, 4], [1, 2]];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.88μs -> 1.63μs (15.2% faster)
    });

    test('should handle array with multiple duplicates, return first', () => {
      const arr = [1, 2, 1, 3, 2];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.45μs -> 1.34μs (8.59% faster)
    });

    test('should work with boolean values', () => {
      const arr = [true, false, true];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.85μs -> 1.25μs (129% faster)
    });

    test('should work with null values', () => {
      const arr = [null, 1, null];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.70μs -> 1.36μs (98.6% faster)
    });

    test('should work with undefined values', () => {
      const arr = [undefined, 1, undefined];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.69μs -> 1.13μs (138% faster)
    });

    test('should handle mixed types without false positives', () => {
      const arr = [1, '1', 1];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.68μs -> 1.26μs (112% faster)
    });

    test('should handle nested objects with duplicates', () => {
      const arr = [{ a: { b: 1 } }, { a: { b: 2 } }, { a: { b: 1 } }];
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.50μs -> 2.07μs (20.5% faster)
    });
  });

  // Edge Test Cases
  describe('Edge cases', () => {
    test('should return -1 for empty array', () => {
      const arr = [];
      expect(findDuplicateIndex(arr)).toBe(-1);  // 2.06μs -> 2.20μs (6.31% slower)
    });

    test('should return -1 for single element array', () => {
      const arr = [1];
      expect(findDuplicateIndex(arr)).toBe(-1);  // 2.10μs -> 987ns (113% faster)
    });

    test('should return -1 for two unique elements', () => {
      const arr = [1, 2];
      expect(findDuplicateIndex(arr)).toBe(-1);  // 2.35μs -> 1.08μs (119% faster)
    });

    test('should handle array with consecutive duplicates', () => {
      const arr = [1, 2, 2];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.54μs -> 1.31μs (17.6% faster)
    });

    test('should handle array where last element is duplicate', () => {
      const arr = [1, 2, 3, 4, 5, 1];
      expect(findDuplicateIndex(arr)).toBe(5);  // 2.34μs -> 1.87μs (25.4% faster)
    });

    test('should correctly distinguish between 0 and false', () => {
      const arr = [0, false, 0];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.53μs -> 1.25μs (21.8% faster)
    });

    test('should correctly distinguish between 0 and null', () => {
      const arr = [0, null, 0];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.56μs -> 1.76μs (11.4% slower)
    });

    test('should correctly distinguish between empty string and null', () => {
      const arr = ['', null, ''];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.55μs -> 1.24μs (25.4% faster)
    });

    test('should handle empty object duplicates', () => {
      const arr = [{}, {}, {}];
      expect(findDuplicateIndex(arr)).toBe(1);  // 1.38μs -> 1.22μs (13.2% faster)
    });

    test('should handle empty array duplicates', () => {
      const arr = [[], [], []];
      expect(findDuplicateIndex(arr)).toBe(1);  // 1.36μs -> 1.13μs (20.1% faster)
    });

    test('should handle objects with property order differences', () => {
      const arr = [{ a: 1, b: 2 }, { b: 2, a: 1 }];
      // Same content, different order - JSON.stringify may differ based on property order
      const result = findDuplicateIndex(arr);
      expect(typeof result).toBe('number');  // 1.72μs -> 1.56μs (9.90% faster)
    });

    test('should handle deeply nested structures', () => {
      const deepObj = { a: { b: { c: { d: { e: 1 } } } } };
      const arr = [deepObj, { x: 1 }, deepObj];
      expect(findDuplicateIndex(arr)).toBe(2);  // 3.08μs -> 5.29μs (41.7% slower)
    });

    test('should handle arrays with NaN (NaN !== NaN, so no duplicate expected)', () => {
      const arr = [NaN, 1, NaN];
      // NaN serializes to null in JSON, so both should be "null"
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.51μs -> 2.62μs (42.2% slower)
    });

    test('should handle Infinity values', () => {
      const arr = [Infinity, 1, Infinity];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.50μs -> 2.88μs (47.9% slower)
    });

    test('should handle negative Infinity values', () => {
      const arr = [-Infinity, 1, -Infinity];
      expect(findDuplicateIndex(arr)).toBe(2);  // 1.58μs -> 1.93μs (18.3% slower)
    });

    test('should handle very long strings', () => {
      const longStr = 'a'.repeat(10000);
      const arr = [longStr, 'b', longStr];
      expect(findDuplicateIndex(arr)).toBe(2);  // 77.8μs -> 75.2μs (3.42% faster)
    });

    test('should handle objects with circular reference (will stringify safely)', () => {
      const obj = { a: 1 };
      const arr = [obj, { b: 2 }, obj];
      // Note: circular references would cause issues, but this tests duplicate reference detection
      expect(findDuplicateIndex(arr)).toBe(2);  // 2.03μs -> 1.71μs (18.6% faster)
    });
  });

  // Large Scale Test Cases
  describe('Performance tests', () => {
    test('should handle large array with no duplicates efficiently', () => {
      const arr = Array.from({ length: 1000 }, (_, i) => i);
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(-1);  // 234μs -> 87.4μs (168% faster)
      expect(end - start).toBeLessThan(1000); // Should complete within 1 second
    });

    test('should handle large array and find duplicate quickly', () => {
      const arr = Array.from({ length: 1000 }, (_, i) => i);
      arr[500] = 0; // Add a duplicate early in the array
      
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(500);  // 82.7μs -> 42.8μs (93.3% faster)
      expect(end - start).toBeLessThan(1000);
    });

    test('should handle large array with objects efficiently', () => {
      const arr = Array.from({ length: 500 }, (_, i) => ({ id: i, value: `item_${i}` }));
      
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(-1);  // 237μs -> 346μs (31.6% slower)
      expect(end - start).toBeLessThan(1000);
    });

    test('should detect duplicate in large object array', () => {
      const obj = { id: 1, data: 'test' };
      const arr = Array.from({ length: 500 }, (_, i) => ({ id: i, data: `test_${i}` }));
      arr[250] = obj;
      arr[450] = obj; // Duplicate at index 450
      
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(450);  // 183μs -> 243μs (24.6% slower)
      expect(end - start).toBeLessThan(1000);
    });

    test('should handle large mixed type array', () => {
      const arr = [];
      for (let i = 0; i < 500; i++) {
        if (i % 3 === 0) arr.push(i);
        else if (i % 3 === 1) arr.push(`str_${i}`);
        else arr.push({ num: i });
      }
      
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(-1);  // 162μs -> 182μs (11.0% slower)
      expect(end - start).toBeLessThan(1000);
    });

    test('should efficiently exit early when duplicate found near start of large array', () => {
      const arr = [1, 2, 1];
      arr.push(...Array.from({ length: 997 }, (_, i) => i + 3)); // Add 997 more unique items
      
      const start = performance.now();
      const result = findDuplicateIndex(arr);
      const end = performance.now();
      
      expect(result).toBe(2);  // 1.60μs -> 2.46μs (35.1% slower)
      expect(end - start).toBeLessThan(100); // Should be very fast with early exit
    });
  });
});

📊 Performance Profile

View detailed line-by-line performance analysis
To edit these changes git checkout codeflash/optimize-findDuplicateIndex-ml24bvua and push.

The optimized code achieves a **14% runtime improvement** through three key micro-optimizations that reduce per-iteration overhead: **What Changed:** 1. **Direct duplicate detection**: Replaced size-tracking logic (`_uniqSet.size > currSetSize`) with immediate `Set.has()` check before insertion. This eliminates the need to track and compare set sizes on every iteration. 2. **Cached `JSON.stringify` reference**: Storing `JSON.stringify` in a local variable (`stringify`) avoids repeated property lookups on the global `JSON` object across iterations. 3. **Cached array length**: Using `len = arr.length` in the for-loop initialization eliminates redundant length property access on each iteration. **Why It's Faster:** The original implementation performed unnecessary work on every loop iteration: - Unconditionally added elements to the set even when duplicates existed - Compared set sizes after every insertion - Accessed `JSON.stringify` and `arr.length` properties repeatedly The optimized version short-circuits immediately upon finding a duplicate (via `Set.has()`) and reduces property access overhead through caching. For duplicate detection, checking membership before insertion is more efficient than comparing sizes after insertion. **Performance Characteristics:** Based on the annotated tests, this optimization excels with: - **Large arrays with all unique elements** (1000 items): 62-168% faster due to reduced per-iteration overhead compounding over many iterations - **Primitive value arrays**: 15-138% faster on simple test cases - **Early duplicate detection**: Maintains fast early-exit behavior Some complex object tests show minor regressions (up to 48% slower for deeply nested structures), likely due to the additional local variable overhead being more noticeable when `JSON.stringify` dominates runtime. However, the overall **14% runtime improvement** demonstrates that the optimization benefits typical workloads, particularly those with larger arrays where the per-iteration savings accumulate significantly. The optimization preserves exact functional behavior, including handling of edge cases (NaN, undefined, circular references, sparse arrays).

codeflash-ai bot requested a review from misrasaurabh1 January 31, 2026 09:37

codeflash-ai bot added the ⚡️ codeflash Optimization PR opened by Codeflash AI label Jan 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

⚡️ Speed up function `findDuplicateIndex` by 14%#31

⚡️ Speed up function `findDuplicateIndex` by 14%#31
codeflash-ai[bot] wants to merge 1 commit intoreleasefrom
codeflash/optimize-findDuplicateIndex-ml24bvua

codeflash-ai bot commented Jan 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Comments

Conversation

codeflash-ai bot commented Jan 31, 2026

📄 14% (0.14x) speedup for findDuplicateIndex in app/client/src/workers/Evaluation/helpers.ts

📝 Explanation and details

📊 Performance Profile

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

📄 14% (0.14x) speedup for `findDuplicateIndex` in `app/client/src/workers/Evaluation/helpers.ts`