Dark Mode
Capec-71 Detail
Using Unicode Encoding to Bypass Validation Logic
Detailed Software Likelihood: Medium Typical Severity: High
Parents: 267
Threats: T62 T290 T291
An attacker may provide a Unicode string to a system component that is not Unicode aware and use that to circumvent the filter or cause the classifying mechanism to fail to properly understanding the request. That may allow the attacker to slip malicious data past the content filter and/or possibly cause the application to route the request incorrectly.
Not present
| External ID | Source | Link | Description |
|---|---|---|---|
| CAPEC-71 | capec | https://capec.mitre.org/data/definitions/71.html | |
| CWE-176 | cwe | http://cwe.mitre.org/data/definitions/176.html | |
| CWE-179 | cwe | http://cwe.mitre.org/data/definitions/179.html | |
| CWE-180 | cwe | http://cwe.mitre.org/data/definitions/180.html | |
| CWE-173 | cwe | http://cwe.mitre.org/data/definitions/173.html | |
| CWE-172 | cwe | http://cwe.mitre.org/data/definitions/172.html | |
| CWE-184 | cwe | http://cwe.mitre.org/data/definitions/184.html | |
| CWE-183 | cwe | http://cwe.mitre.org/data/definitions/183.html | |
| CWE-74 | cwe | http://cwe.mitre.org/data/definitions/74.html | |
| CWE-20 | cwe | http://cwe.mitre.org/data/definitions/20.html | |
| CWE-697 | cwe | http://cwe.mitre.org/data/definitions/697.html | |
| CWE-692 | cwe | http://cwe.mitre.org/data/definitions/692.html | |
| OWASP Attacks | https://owasp.org/www-community/attacks/Unicode_Encoding | Unicode Encoding | |
| REF-1 | reference_from_CAPEC | G. Hoglund, G. McGraw, Exploiting Software: How to Break Code, 2004--02, Addison-Wesley |
Explore
-
Survey the application for user-controllable inputs: Using a browser or an automated tool, an attacker follows all public links and actions on a web site. They record all the links, the forms, the resources accessed and all other potential entry-points for the web application.
| Techniques |
|---|
| Use a spidering tool to follow and record all links and analyze the web pages to find entry points. Make special note of any links that include parameters in the URL. |
| Use a proxy tool to record all user input entry points visited during a manual traversal of the web application. |
| Use a browser to manually explore the website and analyze how it is constructed. Many browsers' plugins are available to facilitate the analysis or automate the discovery. |
Experiment
-
Probe entry points to locate vulnerabilities: The attacker uses the entry points gathered in the "Explore" phase as a target list and injects various Unicode encoded payloads to determine if an entry point actually represents a vulnerability with insufficient validation logic and to characterize the extent to which the vulnerability can be exploited.
| Techniques |
|---|
| Try to use Unicode encoding of content in Scripts in order to bypass validation routines. |
| Try to use Unicode encoding of content in HTML in order to bypass validation routines. |
| Try to use Unicode encoding of content in CSS in order to bypass validation routines. |
- Filtering is performed on data that has not be properly canonicalized.
Not present
| Medium |
|---|
| An attacker needs to understand Unicode encodings and have an idea (or be able to find out) what system components may not be Unicode aware. |
| Integrity | Availability | Authorization | Access Control | Confidentiality |
|---|---|---|---|---|
| Execute Unauthorized Commands (Run Arbitrary Code) | Execute Unauthorized Commands (Run Arbitrary Code) | Bypass Protection Mechanism | Bypass Protection Mechanism | Bypass Protection Mechanism |
| Modify Data | Unreliable Execution | Execute Unauthorized Commands (Run Arbitrary Code) |
- A very common technique for a Unicode attack involves traversing directories looking for interesting files. An example of this idea applied to the Web is http://target.server/some_directory/../../../winnt In this case, the attacker is attempting to traverse to a directory that is not supposed to be part of standard Web services. The trick is fairly obvious, so many Web servers and scripts prevent it. However, using alternate encoding tricks, an attacker may be able to get around badly implemented request filters. In October 2000, an adversary publicly revealed that Microsoft's IIS server suffered from a variation of this problem. In the case of IIS, all the attacker had to do was provide alternate encodings for the dots and/or slashes found in a classic attack. The Unicode translations are . yields C0 AE/ yields C0 AF\ yields C1 9C Using this conversion, the previously displayed URL can be encoded as http://target.server/some_directory/%C0AE/%C0AE/%C0AE%C0AE/%C0AE%C0AE/winntSee also: CVE-2000-0884