⚡️ Speed up function `encode_query` by 19% by codeflash-ai[bot] · Pull Request #13 · aseembits93/deepgram-python-sdk

codeflash-ai · 2025-10-17T08:30:51Z

📄 19% (0.19x) speedup for `encode_query` in `src/deepgram/core/query_encoder.py`

⏱️ Runtime : 5.96 milliseconds → 5.02 milliseconds (best of 145 runs)

📝 Explanation and details

Impact: high
Impact_explanation: Looking at this optimization report, I need to assess several key factors:

Performance Analysis

Overall Runtime Details:

Original: 5.96ms → Optimized: 5.02ms
18.65% speedup with millisecond-level runtimes indicates meaningful improvement

Generated Tests Results:
The optimization shows strong performance gains, particularly for large-scale operations:

test_encode_query_large_list_of_primitives: 100% speedup (232μs → 116μs)
test_encode_query_large_list_of_dicts: 59.4% speedup (667μs → 419μs)
test_encode_query_large_flat_dict: 30.5% speedup (472μs → 362μs)
Most other tests show consistent 5-15% improvements

Existing Tests Results:

Most tests show positive speedups ranging from 2.36% to 14.4%
Only very small edge cases (empty query, none query) show minor regressions of 2-20%, but these are in the nanosecond range and negligible in practice
The complex and realistic test cases show solid 9-14% improvements

Replay Tests Results:

Mixed results with some regressions, but the positive cases show meaningful 4-8% improvements
The regressions appear to be on very small workloads (microsecond range)

Hot Path Analysis

The calling_fn_details shows that encode_query is called in the HTTP client's request() and stream() methods - these are core networking functions that are likely called frequently in any application using this SDK. This places the function squarely in a hot path where even modest improvements get multiplied across many invocations.

Assessment

This optimization demonstrates:

Consistent meaningful speedups (18.65% overall) on realistic workloads
Exceptional performance on large-scale operations (59-100% improvements)
Location in a critical hot path (HTTP request processing)
Asymptotic benefits that scale with data size
Minimal negative impact on edge cases

The combination of solid percentage improvements, hot path location, and scaling benefits for larger workloads makes this a high-impact optimization.

END OF IMPACT EXPLANATION

The optimized code achieves an 18% speedup through several micro-optimizations that reduce Python's overhead in hot code paths:

Key Optimizations:

Cached type/function lookups: The code binds frequently used types and methods (_BaseModel, _isBaseModel, dict_type, result_append, result_extend) as local variables to avoid repeated attribute lookups during execution. In Python, local variable access is faster than attribute access.
Streamlined type checking: Instead of separate isinstance calls for pydantic.BaseModel and dict, the code uses a more efficient conditional expression pattern that reduces redundant type checks.
Method binding for list operations: By binding result.append and result.extend to local variables, the code avoids method lookup overhead in tight loops where these operations are called frequently.

Performance Impact by Test Case:

Large-scale operations benefit most: The test_encode_query_large_list_of_primitives shows 100% speedup and test_encode_query_large_list_of_dicts shows 59% speedup, indicating the optimizations are particularly effective for bulk operations.
Recursive/nested structures: Tests with nested dicts and lists of dicts show 10-15% improvements, benefiting from reduced overhead in recursive calls.
Simple cases see modest gains: Basic flat dictionaries show 5-12% improvements, demonstrating the optimizations don't hurt simple cases while providing significant benefits for complex ones.

The optimizations are most effective for workloads with large lists, nested structures, or frequent recursive calls to single_query_encoder, which matches the 43.7% time spent in .dict() calls shown in the original profiler results.

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	✅ 29 Passed
🌀 Generated Regression Tests	✅ 23 Passed
⏪ Replay Tests	✅ 25 Passed
🔎 Concolic Coverage Tests	✅ 2 Passed
📊 Tests Coverage	100.0%

⚙️ Existing Unit Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`unit/test_core_query_encoder.py::TestEncodeQuery.test_complex_query`	14.0μs	12.8μs	9.90%✅
`unit/test_core_query_encoder.py::TestEncodeQuery.test_empty_query`	1.13μs	1.42μs	-20.5%⚠️
`unit/test_core_query_encoder.py::TestEncodeQuery.test_none_query`	589ns	607ns	-2.97%⚠️
`unit/test_core_query_encoder.py::TestEncodeQuery.test_query_with_pydantic_models`	28.3μs	27.3μs	3.43%✅
`unit/test_core_query_encoder.py::TestEncodeQuery.test_query_with_special_values`	7.51μs	6.88μs	9.13%✅
`unit/test_core_query_encoder.py::TestEncodeQuery.test_simple_query`	4.67μs	4.23μs	10.3%✅
`unit/test_core_query_encoder.py::TestQueryEncoderEdgeCases.test_circular_reference_protection`	4.66μs	4.19μs	11.3%✅
`unit/test_core_query_encoder.py::TestQueryEncoderEdgeCases.test_unicode_and_special_characters`	7.21μs	6.30μs	14.4%✅
`utils/test_query_encoding.py::test_encode_query_with_none`	604ns	588ns	2.72%✅
`utils/test_query_encoding.py::test_query_encoding_deep_object_arrays`	14.4μs	12.9μs	11.9%✅
`utils/test_query_encoding.py::test_query_encoding_deep_objects`	9.94μs	9.71μs	2.36%✅

🌀 Generated Regression Tests and Runtime

from typing import Any, Dict, List, Optional, Tuple

import pydantic
# imports
import pytest  # used for our unit tests
from src.deepgram.core.query_encoder import encode_query

# unit tests

# ---- Basic Test Cases ----

def test_encode_query_none():
    # Should return None for None input
    codeflash_output = encode_query(None) # 601ns -> 640ns (6.09% slower)

def test_encode_query_empty_dict():
    # Should return empty list for empty dict
    codeflash_output = encode_query({}) # 1.11μs -> 1.41μs (21.3% slower)

def test_encode_query_simple_flat_dict():
    # Should encode a flat dict with primitive values
    query = {"a": 1, "b": "foo", "c": True}
    expected = [("a", 1), ("b", "foo"), ("c", True)]
    codeflash_output = sorted(encode_query(query)) # 4.67μs -> 4.13μs (12.8% faster)

def test_encode_query_list_of_primitives():
    # Should encode a list of primitives under a key
    query = {"tags": ["x", "y", "z"]}
    expected = [("tags", "x"), ("tags", "y"), ("tags", "z")]
    codeflash_output = sorted(encode_query(query)) # 4.27μs -> 3.90μs (9.47% faster)

def test_encode_query_nested_dict():
    # Should encode a nested dict
    query = {"outer": {"inner": 42}}
    expected = [("outer[inner]", 42)]
    codeflash_output = encode_query(query) # 4.37μs -> 3.94μs (10.9% faster)

def test_encode_query_list_of_dicts():
    # Should encode a list of dicts under a key
    query = {"items": [{"id": 1}, {"id": 2}]}
    expected = [("items[id]", 1), ("items[id]", 2)]
    codeflash_output = sorted(encode_query(query)) # 7.46μs -> 6.48μs (15.3% faster)

def test_encode_query_mixed_list():
    # Should encode a list with mixed primitives and dicts
    query = {"data": [1, {"a": "b"}, 2]}
    expected = [("data", 1), ("data[a]", "b"), ("data", 2)]
    codeflash_output = sorted(encode_query(query)) # 6.69μs -> 5.98μs (11.9% faster)



def test_encode_query_empty_list():
    # Should handle empty list under a key
    query = {"arr": []}
    expected = []
    codeflash_output = encode_query(query) # 3.28μs -> 2.99μs (9.59% faster)

def test_encode_query_empty_dict_in_list():
    # Should handle empty dict in a list
    query = {"arr": [{}]}
    expected = []
    codeflash_output = encode_query(query) # 4.78μs -> 4.20μs (14.0% faster)

def test_encode_query_nested_empty_dict():
    # Should handle nested empty dict
    query = {"outer": {"inner": {}}}
    expected = []
    codeflash_output = encode_query(query) # 4.40μs -> 4.12μs (6.80% faster)

def test_encode_query_deeply_nested_dict():
    # Should encode a dict nested several layers deep
    query = {"a": {"b": {"c": {"d": 99}}}}
    expected = [("a[b][c][d]", 99)]
    codeflash_output = encode_query(query) # 5.87μs -> 5.49μs (6.93% faster)

def test_encode_query_list_of_lists():
    # Should handle a list of lists under a key
    query = {"matrix": [[1, 2], [3, 4]]}
    expected = [("matrix", 1), ("matrix", 2), ("matrix", 3), ("matrix", 4)]
    codeflash_output = sorted(encode_query(query)) # 3.83μs -> 3.72μs (3.01% faster)

def test_encode_query_none_value():
    # Should encode None value as-is
    query = {"nada": None}
    expected = [("nada", None)]
    codeflash_output = encode_query(query) # 3.10μs -> 2.91μs (6.67% faster)

def test_encode_query_dict_with_bool_and_none():
    # Should encode bool and None values correctly
    query = {"flag": True, "missing": None}
    expected = [("flag", True), ("missing", None)]
    codeflash_output = sorted(encode_query(query)) # 3.98μs -> 3.75μs (5.97% faster)

def test_encode_query_dict_with_int_keys():
    # Should encode dicts with int keys (converted to str)
    query = {1: "one", 2: "two"}
    expected = [("1", "one"), ("2", "two")]
    codeflash_output = sorted(encode_query(query)) # 3.83μs -> 3.50μs (9.30% faster)

def test_encode_query_dict_with_special_chars():
    # Should encode keys with special characters
    query = {"a.b": {"c-d": 5}}
    expected = [("a.b[c-d]", 5)]
    codeflash_output = encode_query(query) # 4.28μs -> 3.99μs (7.24% faster)

def test_encode_query_dict_with_empty_string_key():
    # Should encode dict with empty string key
    query = {"": {"x": 1}}
    expected = [("[x]", 1)]
    # The key will be '[x]' because key_prefix is '' and k is 'x'
    codeflash_output = encode_query(query) # 4.30μs -> 3.97μs (8.35% faster)

def test_encode_query_dict_with_empty_string_value():
    # Should encode dict with empty string value
    query = {"foo": ""}
    expected = [("foo", "")]
    codeflash_output = encode_query(query) # 3.11μs -> 2.92μs (6.22% faster)


def test_encode_query_dict_with_list_of_dicts_and_primitives():
    # Should encode dict with list of dicts and primitives
    query = {"mixed": [{"x": 1}, 2, {"y": 3}]}
    expected = [("mixed[x]", 1), ("mixed", 2), ("mixed[y]", 3)]
    codeflash_output = sorted(encode_query(query)) # 8.20μs -> 7.13μs (15.1% faster)

# ---- Large Scale Test Cases ----

def test_encode_query_large_flat_dict():
    # Should handle a large flat dict
    query = {str(i): i for i in range(1000)}
    expected = [(str(i), i) for i in range(1000)]
    codeflash_output = encode_query(query); result = codeflash_output # 472μs -> 362μs (30.5% faster)

def test_encode_query_large_nested_dict():
    # Should handle a large nested dict
    query = {"outer": {str(i): i for i in range(500)}}
    expected = [("outer[" + str(i) + "]", i) for i in range(500)]
    codeflash_output = encode_query(query); result = codeflash_output # 130μs -> 132μs (1.24% slower)

def test_encode_query_large_list_of_dicts():
    # Should handle a large list of dicts
    query = {"items": [{"id": i} for i in range(500)]}
    expected = [("items[id]", i) for i in range(500)]
    codeflash_output = encode_query(query); result = codeflash_output # 667μs -> 419μs (59.4% faster)

def test_encode_query_large_list_of_primitives():
    # Should handle a large list of primitives
    query = {"nums": list(range(1000))}
    expected = [("nums", i) for i in range(1000)]
    codeflash_output = encode_query(query); result = codeflash_output # 232μs -> 116μs (100% faster)



#------------------------------------------------
from src.deepgram.core.query_encoder import encode_query

def test_encode_query():
    encode_query({'': {}})

def test_encode_query_2():
    encode_query(None)

⏪ Replay Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`test_pytest_testsunittest_agent_v1_models_py_testsintegrationstest_advanced_features_py_testsutilstest_se__replay_test_0.py::test_src_deepgram_core_query_encoder_encode_query`	3.58μs	4.16μs	-14.0%⚠️
`test_pytest_testsunittest_core_query_encoder_py__replay_test_0.py::test_src_deepgram_core_query_encoder_encode_query`	56.0μs	53.8μs	4.08%✅
`test_pytest_testsunittest_http_internals_py_testsintegrationstest_agent_client_py_testsunittest_telemetry__replay_test_0.py::test_src_deepgram_core_query_encoder_encode_query`	4.75μs	5.44μs	-12.6%⚠️
`test_pytest_testsunittest_listen_v1_models_py_testsunittest_telemetry_models_py_testsintegrationstest_rea__replay_test_0.py::test_src_deepgram_core_query_encoder_encode_query`	1.78μs	2.15μs	-17.2%⚠️
`test_pytest_testsutilstest_query_encoding_py_testsintegrationstest_auth_client_py_testsunittest_core_mode__replay_test_0.py::test_src_deepgram_core_query_encoder_encode_query`	23.2μs	21.4μs	8.75%✅

🔎 Concolic Coverage Tests and Runtime

Test File::Test Function	Original ⏱️	Optimized ⏱️	Speedup
`codeflash_concolic_5p92pe1r/tmpb8127hc2/test_concolic_coverage.py::test_encode_query`	3.46μs	3.17μs	9.18%✅
`codeflash_concolic_5p92pe1r/tmpb8127hc2/test_concolic_coverage.py::test_encode_query_2`	564ns	619ns	-8.89%⚠️

To edit these changes git checkout codeflash/optimize-encode_query-mgul9vz0 and push.

Impact: high Impact_explanation: Looking at this optimization report, I need to assess several key factors: ## Performance Analysis **Overall Runtime Details:** - Original: 5.96ms → Optimized: 5.02ms - 18.65% speedup with millisecond-level runtimes indicates meaningful improvement **Generated Tests Results:** The optimization shows strong performance gains, particularly for large-scale operations: - `test_encode_query_large_list_of_primitives`: 100% speedup (232μs → 116μs) - `test_encode_query_large_list_of_dicts`: 59.4% speedup (667μs → 419μs) - `test_encode_query_large_flat_dict`: 30.5% speedup (472μs → 362μs) - Most other tests show consistent 5-15% improvements **Existing Tests Results:** - Most tests show positive speedups ranging from 2.36% to 14.4% - Only very small edge cases (empty query, none query) show minor regressions of 2-20%, but these are in the nanosecond range and negligible in practice - The complex and realistic test cases show solid 9-14% improvements **Replay Tests Results:** - Mixed results with some regressions, but the positive cases show meaningful 4-8% improvements - The regressions appear to be on very small workloads (microsecond range) ## Hot Path Analysis The `calling_fn_details` shows that `encode_query` is called in the HTTP client's `request()` and `stream()` methods - these are core networking functions that are likely called frequently in any application using this SDK. This places the function squarely in a hot path where even modest improvements get multiplied across many invocations. ## Assessment This optimization demonstrates: 1. **Consistent meaningful speedups** (18.65% overall) on realistic workloads 2. **Exceptional performance on large-scale operations** (59-100% improvements) 3. **Location in a critical hot path** (HTTP request processing) 4. **Asymptotic benefits** that scale with data size 5. **Minimal negative impact** on edge cases The combination of solid percentage improvements, hot path location, and scaling benefits for larger workloads makes this a high-impact optimization. END OF IMPACT EXPLANATION The optimized code achieves an 18% speedup through several micro-optimizations that reduce Python's overhead in hot code paths: **Key Optimizations:** 1. **Cached type/function lookups**: The code binds frequently used types and methods (`_BaseModel`, `_isBaseModel`, `dict_type`, `result_append`, `result_extend`) as local variables to avoid repeated attribute lookups during execution. In Python, local variable access is faster than attribute access. 2. **Streamlined type checking**: Instead of separate `isinstance` calls for `pydantic.BaseModel` and `dict`, the code uses a more efficient conditional expression pattern that reduces redundant type checks. 3. **Method binding for list operations**: By binding `result.append` and `result.extend` to local variables, the code avoids method lookup overhead in tight loops where these operations are called frequently. **Performance Impact by Test Case:** - **Large-scale operations benefit most**: The `test_encode_query_large_list_of_primitives` shows 100% speedup and `test_encode_query_large_list_of_dicts` shows 59% speedup, indicating the optimizations are particularly effective for bulk operations. - **Recursive/nested structures**: Tests with nested dicts and lists of dicts show 10-15% improvements, benefiting from reduced overhead in recursive calls. - **Simple cases see modest gains**: Basic flat dictionaries show 5-12% improvements, demonstrating the optimizations don't hurt simple cases while providing significant benefits for complex ones. The optimizations are most effective for workloads with large lists, nested structures, or frequent recursive calls to `single_query_encoder`, which matches the 43.7% time spent in `.dict()` calls shown in the original profiler results.

codeflash-ai bot requested a review from aseembits93 October 17, 2025 08:30

codeflash-ai bot added the ⚡️ codeflash Optimization PR opened by Codeflash AI label Oct 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

⚡️ Speed up function `encode_query` by 19%#13

⚡️ Speed up function `encode_query` by 19%#13
codeflash-ai[bot] wants to merge 1 commit intomainfrom
codeflash/optimize-encode_query-mgul9vz0

codeflash-ai bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

codeflash-ai bot commented Oct 17, 2025

📄 19% (0.19x) speedup for encode_query in src/deepgram/core/query_encoder.py

📝 Explanation and details

Performance Analysis

Hot Path Analysis

Assessment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

📄 19% (0.19x) speedup for `encode_query` in `src/deepgram/core/query_encoder.py`