Guide for test proxy troubleshooting
November 3, 2025 ยท View on GitHub
This guide details some common errors that can come up when migrating to and using the Azure SDK test proxy.
Documentation of test proxy's underlying functionality can be found here in the azure-sdk-tools
GitHub repository, but this isn't necessary to read for Python testing.
Table of contents
- Debugging tip
- ResourceNotFoundError: Playback failure
- Test collection failure
- Errors in tests using resource preparers
- Test failure during
record/startorplayback/startrequests - Playback failures from body matching errors
- Playback failures from inconsistent line breaks
- Playback failures from URL mismatches
- Playback failures from inconsistent test values
- Recordings not being produced
- ConnectionError during tests
- Different error than expected when using proxy
- Test setup failure in test pipeline
- Fixture not found error
- PermissionError during startup
- ServiceRequestError: Cannot connect to host
Debugging tip
To see more detailed output from tests, you can run pytest commands with the flags -s and --log-cli-level=DEBUG.
The former will output print statements and more logging, and the latter will expose DEBUG-level logs that are hidden
by default. For example:
pytest .\tests\test_client.py -s --log-cli-level=DEBUG
Additionally, the -k flag can be used to collect and run tests that have a specific name. For example, providing
-k "test_delete or test_upload" to the pytest command will only collect and execute tests that have method names
containing the strings test_delete or test_upload.
For more information about pytest invocations, refer to Usage and Invocations.
ResourceNotFoundError: Playback failure
Test playback errors typically raise with a message similar to the following:
FAILED test_client.py::TestClient::test_client_method - azure.core.exceptions.ResourceNotFoundError:
Playback failure -- for help resolving, see https://aka.ms/azsdk/python/test-proxy/troubleshoot. Error details:
Unable to find a record for the request POST https://fake_resource.service.azure.net?api-version=2025-09-01
This means that the test recording didn't contain a match for the incoming playback request. This usually just means that the test needs to be re-recorded to pick up library updates (e.g. a new service API version).
If playback errors persist after re-recording, you may need to modify session sanitizers or matchers. The following sections of this guide describe common scenarios:
- Playback failures from body matching errors
- Playback failures from inconsistent line breaks
- Playback failures from URL mismatches
- Playback failures from inconsistent test values
Test collection failure
Make sure that all test class names begin with "Test", and that all test method names begin with "test_". For more
information about pytest test collection, please refer to the docs.
Special case: using pytest.mark.parametrize with recorded tests
Parametrization allows you to share test code by re-running the same test with varying inputs. For
example, azure-keyvault-keys tests are parametrized to run with multiple API versions and
multiple Key Vault configurations.
Because of how the pytest.mark.parametrize mechanism works, the recorded_by_proxy(_async) decorators aren't
compatible without an additional decorator that handles the arguments we want to parametrize. The callable that
pytest.mark.parametrize decorates needs to have positional parameters that match the arguments we're parametrizing;
for example:
import pytest
from devtools_testutils import recorded_by_proxy
test_values = [
("first_value_a", "first_value_b"),
("second_value_a", "second_value_b"),
]
# Works because `parametrize` decorates a method with positional `a` and `b` parameters
@pytest.mark.parametrize("a, b", test_values)
def test_function(a, b, **kwargs):
...
# Doesn't work; raises collection error
# `recorded_by_proxy`'s wrapping function doesn't accept positional `a` and `b` parameters
@pytest.mark.parametrize("a, b", test_values)
@recorded_by_proxy
def test_recorded_function(a, b, **kwargs):
...
To parametrize recorded tests, we need a decorator between pytest.mark.parametrize and recorded_by_proxy that
accepts the expected arguments. We can do this by declaring a class with a custom __call__ method:
class ArgumentPasser:
def __call__(self, fn):
# _wrapper accepts the `a` and `b` arguments we want to parametrize with
def _wrapper(test_class, a, b, **kwargs):
fn(test_class, a, b, **kwargs)
return _wrapper
# Works because `ArgumentPasser.__call__`'s return value has the expected parameters
@pytest.mark.parametrize("a, b", test_values)
@ArgumentPasser()
@recorded_by_proxy
def test_recorded_function(a, b, **kwargs):
...
You can also introduce additional logic into the __call__ method of your intermediate decorator. In the aforementioned
azure-keyvault-keys test example, the decorator between parametrize and recorded_by_proxy
is actually a client preparer that creates a client based on the parametrized input and passes this
client to the test.
Errors in tests using resource preparers
Test suites that haven't fully migrated to using a test-resources.json file for test resource deployment might use
resource preparers, such as
ResourceGroupPreparer.
Resource preparers need a management client to function, so test classes that use them will need to inherit from
AzureMgmtRecordedTestCase instead of AzureRecordedTestCase.
Test failure during record/start or playback/start requests
If tests fail during startup, logs might indicate that POST requests to record/start or playback/start endpoints
are returning 500 responses. In a stack trace, these errors might be raised here or
here, respectively.
This suggests that the test proxy failed to fetch recordings from the assets repository. This likely comes from a
corrupted git configuration in azure-sdk-for-python/.assets. To resolve this:
- Upgrade your local version of
gitto at least 2.30.0 - Remove the
.assetsdirectory completely. To do this easily,cdinto the root ofazure-sdk-for-pythonwith PowerShell >= 7.0.0 and run
Remove-Item -Recurse -Force .\.assets\
After running tests again, a new .assets directory will be created and tests should run normally.
If the problem persists, try removing both the .assets folder and your local test proxy tool. cd into the root of
azure-sdk-for-python and run the following PowerShell commands:
Remove-Item -Recurse -Force .\.assets\
Remove-Item -Recurse -Force .\.proxy\
These folders will be freshly recreated the next time you run tests.
Playback failures from body matching errors
The test proxy system enables body matching by default. For example, if a test sends a request that includes the current Unix time in its body, the body will contain a new value when run in playback mode at a later time -- this request won't match the recording if body matching is enabled.
Body matching can be turned off with the test proxy by calling the set_bodiless_matcher method from
devtools_testutils/sanitizers.py at the very start of a test method. This matcher applies only to the
test method that set_bodiless_matcher is called from, so other tests in the pytest session will still have body
matching enabled by default.
Playback failures from inconsistent line breaks
Line breaks can vary between OSes and cause tests to fail on certain platforms, in which case it can help to specify a
particular format for test files by using .gitattributes.
A .gitattributes file can be placed at the root of a directory to apply git settings to each file under that
directory. If a test directory contains files that need to have consistent line breaks, for example LF breaks instead
of CRLF ones, you can create a .gitattributes file in the directory with the following content:
# Force git to checkout text files with LF (line feed) as the ending (vs CRLF)
# This allows us to consistently run tests that depend on the exact contents of a file
* text=auto eol=lf
For a real example, refer to https://github.com/Azure/azure-sdk-for-python/pull/29955.
Playback failures from URL mismatches
URL matching errors in playback tests can come from a variety of issues. This section lists common ones and how to resolve them.
Duplicated slash(es) in URLs
This most often appears at the end of the URL domain; for example:
Uri doesn't match:
request: https://fake_resource.service.azure.net/path
record: https://fake_resource.service.azure.net//path
This most often comes from an EnvironmentVariableLoader playback endpoint ending with a trailing slash (e.g.
https://fake_resource.service.azure.net/) while the live-mode URL doesn't (e.g.
https://fake_resource.service.azure.net). A slash gets added to the real endpoint during tests, and then the domain
-- without a trailing slash -- is sanitized with a URL that has an additional trailing slash.
Check the real values of endpoints in your .env file, and ensure the formatting of corresponding playback endpoint
values match in any sanitizer or EnvironmentVariableLoader uses.
Inconsistent query parameter ordering
By default, the test proxy tries to match URLs exactly. If there's a section of the URL that's indeterminately ordered, you may intermittently see matching errors. This often happens with query parameters; for example:
Uri doesn't match:
request: https://fake_resource.service.azure.net/?a=value1&b=value2
record: https://fake_resource.service.azure.net/?b=value2&a=value1
To match requests for query parameter content instead of exact ordering, you can use the
set_custom_default_matcher method from devtools_testutils with the keyword argument
ignore_query_ordering=True. Calling this method inside the body of a test function will update the matcher for only
that test, which is recommended.
Sanitization impacting request URL/body/headers
In some cases, a value in a response body is used in the following request as part of the URL, body, or headers. If this value is sanitized, the recorded request might differ than what is expected during playback. Common culprits include sanitization of "name", "id", and "Location" fields. To resolve this, you can either opt out of specific sanitization or add another sanitizer to align with the sanitized value.
Opt out
You can opt out of sanitization for the fields that are used for your requests by calling the remove_batch_sanitizer
method from devtools_testutils with the sanitizer IDs to exclude. Generally, this is done in
the conftest.py file, in the one of the session-scoped fixtures. Example:
from devtools_testutils import remove_batch_sanitizers, test_proxy
@pytest.fixture(scope="session", autouse=True)
def add_sanitizers(test_proxy):
...
# Remove the following body key sanitizer: AZSDK3493: $..name
remove_batch_sanitizers(["AZSDK3493"])
Some sanitizer IDs that are often opted out of are:
AZSDK2003:Location- Header regex sanitizerAZSDK3430:$..id- Body key sanitizerAZSDK3493:$..name- Body key sanitizer
However, please be mindful when opting out of a sanitizer, and ensure that no sensitive data is being exposed.
Add another sanitizer
Alternatively, you can add another sanitizer to align the recorded request with the expected request, modifying the URL, body, or headers as needed. Example:
from devtools_testutils import add_uri_regex_sanitizer
add_uri_regex_sanitizer(regex="(?<=https://.+/foo/bar/)(?<id>[^/?\\.]+)", group_for_replace="id", value="Sanitized")
Playback failures from inconsistent test values
To run recorded tests successfully when recorded values are inconsistent or random and can't be sanitized, the test
proxy provides a variables API. This makes it possible for a test to record the values of variables that were used
during recording and use the same values in playback mode without a sanitizer.
Note that the recorded variables must have string values. For example, trying to record an integer value for a variable will cause a test proxy error.
For example, imagine that a test uses a randomized table_uuid variable when creating resources. The same random value
for table_uuid can be used in playback mode by using this variables API.
There are two requirements for a test to use recorded variables. First, the test method should accept **kwargs.
Second, the test method should return a dictionary with any test variables that it wants to record. This dictionary
will be stored in the recording when the test is run live, and will be passed to the test as a variables keyword
argument when the test is run in playback.
Below is a code example of how a test method could use recorded variables:
from devtools_testutils import AzureRecordedTestCase, recorded_by_proxy
class TestExample(AzureRecordedTestCase):
@recorded_by_proxy
def test_example(self, **kwargs):
# In live mode, variables is an empty dictionary
# In playback mode, the value of variables is {"current_time": "<previously recorded time>"}
variables = kwargs.pop("variables", {})
# To fetch variable values, use the `setdefault` method to look for a key ("current_time")
# and set a real value for that key if it's not present (str(time.time()))
# Note that time.time() is converted from a float to a string to record it properly
current_time = variables.setdefault("current_time", str(time.time()))
...
# return the variables at the end of the test to record them
return variables
Note:
variableswill be passed as a named argument to any test that acceptskwargsby the test proxy. In environments that don't use the test proxy, though -- like live test pipelines --variableswon't be provided. To avoid a KeyError, providing an empty dictionary as the default value tokwargs.popis recommended.
Recordings not being produced
Ensure the environment variable AZURE_SKIP_LIVE_RECORDING isn't set to "true", and that AZURE_TEST_RUN_LIVE
is set to "true".
ConnectionError during tests
For example, you may see a requests.exceptions.ConnectionError when trying to make service or sanitizer setup
requests. This means that the test proxy tool never started correctly; ensure the test_proxy fixture is being invoked
during test startup so that the tool is available during tests.
Different error than expected when using proxy
Some tests intentionally trigger exceptions in order to validate error behavior. There are a few known cases where the exception returned will be different when using the test proxy vs. when sending requests to the service directly.
One such instance is in the case of a DNS lookup failure, which can occur when trying to contact a nonexistent endpoint. This issue describes an instance of this behavior. As described in the issue, the best way to work around this for the time being is to have tests expect either of two potential errors, to cover both cases. For example:
with pytest.raises((ServiceRequestError, HttpResponseError)) as exc_info:
# This request will raise a ServiceRequestError when sent directly
# When using the test proxy, we get an HttpResponseError instead
... # Request that triggers DNS lookup failure
# Make sure the HttpResponseError is raised for the same reason: DNS lookup failure
if exc_info.type is HttpResponseError:
response_content = json.loads(exc_info.value.response.content)
assert "Name does not resolve" in response_content["Message"]
Test setup failure in test pipeline
If the test proxy isn't configured correctly for pipeline tests, you may see each test fail with an error message
of test setup failure.
CI pipelines
To enable using the test proxy in CI, you need to set the parameter TestProxy: true in the ci.yml file in the
service-level folder. For example, in sdk/eventgrid/ci.yml:
extends:
template: ../../eng/pipelines/templates/stages/archetype-sdk-client.yml
parameters:
ServiceDirectory: eventgrid
+ TestProxy: true
...
Live test pipelines
For tests to succeed in live test pipelines, make sure environment variables AZURE_SKIP_LIVE_RECORDING and
AZURE_TEST_RUN_LIVE are set to True in the tests.yml file in the service-level folder. For example, in
sdk/textanalytics/tests.yml:
stages:
- template: ../../eng/pipelines/templates/stages/archetype-sdk-tests.yml
parameters:
...
EnvVars:
...
+ AZURE_SKIP_LIVE_RECORDING: 'True'
+ AZURE_TEST_RUN_LIVE: 'true'
Requests are made directly to the service instead of going through the proxy when live tests are run with recording
skipped, so the TestProxy parameter doesn't need to be set in tests.yml.
Fixture not found error
Tests that aren't recorded should omit the recorded_by_proxy decorator. However, if these unrecorded tests accept
parameters that are provided by a preparer like the devtools_testutils EnvironmentVariableLoader,
you may see a test setup error. For example, imagine a test is decorated with a preparer that provides a Key Vault URL
as a azure_keyvault_url parameter:
class TestExample(AzureRecordedTestCase):
@EnvironmentVariableLoader("keyvault", azure_keyvault_url="https://vaultname.vault.azure.net")
def test_example(self, azure_keyvault_url):
The above would work in the old test setup, but with the test proxy, running the test will yield
_______ ERROR at setup of TestExample.test_example _______
...
E fixture 'azure_keyvault_url' not found
This is because AzureRecordedTestCase doesn't inherit from unittest.TestCase; pytest assumes that any named
parameter in a test method is a reference to a fixture unless the test method is wrapped in a particular way. Wrapping
a test with the recorded_by_proxy decorator will permit using named parameters, but wrapping with decorators like
EnvironmentVariableLoader alone will not.
PermissionError during startup
While the test proxy is being invoked during the start of a test run, you may see an error such as
PermissionError: [Errno 13] Permission denied: '.../azure-sdk-for-python/.proxy/Azure.Sdk.Tools.TestProxy'
This can mean that the test proxy tool was successfully installed at the location in the error message, but we don't
have sufficient permissions to run it with the tool startup script. We can set the correct permissions on the file by
using chmod. (This is done automatically during tool setup, but could conceivably fail.) Using the tool path that was
provided in the PermissionError message, run the following command:
chmod +x .../azure-sdk-for-python/.proxy/Azure.Sdk.Tools.TestProxy
Alternatively, you can delete the installed tool and re-run your tests to automatically reinstall it correctly.
- Open Task Manager, search for a process named "Azure.Sdk.Tools.TestProxy", and end the task if one is running.
- Delete the
.proxyfolder at the root of your localazure-sdk-for-pythonclone. - Re-run your tests; the test proxy will be reinstalled and should correctly set file permissions.
ServiceRequestError: Cannot connect to host
When using HTTPS via PROXY_URL='https://localhost:5001', tests may fail during startup with the
following exception:
azure.core.exceptions.ServiceRequestError: Cannot connect to host localhost:5001
ssl:True [SSLCertVerificationError: (1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate
verify failed: self signed certificate (_ssl.c:1123)')]
This is caused by the test proxy's certificate being incorrectly configured. First, update your branch to include the
latest changes from main -- this ensures you have the latest certificate version (it needs to be occasionally
rotated).
If tests continue to fail, this is likely due to an async-specific environment issue. The certificate is automatically configured during proxy startup, but async environments can still nondeterministically fail.
To work around this, unset the PROXY_URL environment variable to default to HTTP, which doesn't require a certificate.
If your tests require an HTTPS endpoint, reach out to the Azure SDK team for assistance.