.. Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. .. include:: ../../common.defs .. _admin-plugins-regex-remap: Regex Remap Plugin ****************** This allows you to configure mapping rules based on regular expressions. This is similar to what you can accomplish using mod_rewrite in Apache httpd, but obviously not as flexible or sophisticated (yet). To use this plugin, configure a remap.config rule like :: map http://a.com http://b.com @plugin=regex_remap.so @pparam=maps.reg The file name parameter is always required. Unless an absolute path is specified, the file name is assumed to be a path relative to the Traffic Server configuration directory. The regular expressions listed in the configuration file are evaluated sequentially. When a regular expression is positively matched against a request URL, evaluation is stopped and the rewrite rule is applied. If none of the regular expressions are a match, the default destination URL is applied (``http://b.com`` in the example above). An optional argument (``@pparam``) with the string "``profile``\ " will enable profiling of this regex remap rule, e.g. :: ... @pparam=maps.reg @pparam=profile Profiling is very low overhead, and the information is dumped to ``traffic.out``, located in the log directory. This information is useful to optimize the order of your regular expression, such that the most common matches appears early in the file. In order to force a profile dump, you can do :: $ sudo touch remap.config $ sudo traffic_ctl config reload By default, this plugin operates on the post-remap URL (including any remappings done by preceding plugins in the remap rule). This behavior can be modified with the optional parameter :: @pparam=[no-]pristine [default: off] With ``@pparam=pristine``, the plugin will operate on the pre-remap, or pristine, URL. (But, if no regular expression in the config file is matched, the resulting URL will still be the post-remap URL.) By default, only the path and query string of the URL are provided for the regular expressions to match. The following optional parameters can be used to modify the plugin instance behavior :: @pparam=[no-]method [default: off] @pparam=[no-]query-string [default: on] @pparam=[no-]host [default: off] If you wish to match on the HTTP method used (e.g. "``GET``\ "), you must use the option ``@pparam=method``. e.g. :: ... @pparam=maps.reg @pparam=method With this enabled, the string that you will need to match will look like :: GET/path?query=bar The methods are always all upper-case, and always followed by one single space. There is no space between the method and the rest of the URL (or URI path). By default, the query string is part of the string that is matched again, to turn this off use the option 'no-query-string', e.g. :: ... @pparam=maps.reg @pparam=no-query-string Finally, to match on the host as well, use the option 'host', e.g. :: ... @pparam=maps.reg @pparam=host With this enabled, the string that you will need to match will look like :: //host/path?query=bar Note carefully that this includes the ``//`` prefix. This makes it possible to distinguish between the hostname and path components in cases where hostnames appear in the path. Consider, for example, a CDN that routes requests based on the origin hostname in the path, like this:: https://cdn.example.com/origin1.example.com/some/path The ``//`` prefix allows you to write a regex that reliably identifies the actual hostname versus a hostname-like string in the path. See the example for a demonstration of how to use the ``host`` option. A typical regex, without the above mentioned method or host options, would look like:: ^/(ogre.*)/more http://www.ogre.com/$h/$0/$1 The regular expression must not contain any white spaces! When the regular expression is matched, only the URL path + query string is matched (without any of the optional configuration options). The path will always start with a "/". Various substitution strings are allowed on the right hand side during evaluation :: $0 - The entire matched string $1-9 - Regular expression groups ($1 first group etc.) $h - The host as used in the "to" portion of the remap rule. For a long time it was the original host header from the request. $f - The host as used in the "from" portion of the remap rule $t - The host as used in the "to" portion of the remap rule $p - The original port number $s - The scheme (e.g. http) of the request $P - The entire path of the request $q - The query part of the request $r - The path parameters of the request (not implemented yet) $c - The cookie string from the request $i - The client IP for this request .. note:: The ``$0`` substitution expands to the characters that were matched by the regular expression, not to the entire string that the regular expression was matched against. You can also provide options, similar to how you configure your remap.config. The following options are available :: @status= - Force the response code to @active_timeout= - Active timeout (in ms) @no_activity_timeout= - No activity timeout (in ms) @connect_timeout= - Connect timeouts (in ms) @dns_timeout= - Connect timeouts (in ms) @caseless - Make regular expressions case insensitive @lowercase_substitutions - Turn on (enable) lower case substitutions @strategy - Specify a strategy from strategies.yaml. "null" or "" will clear the strategy. In addition to the options listed above, you can override any configuration variable from ``records.yaml`` on a per-rule basis. When a regex rule matches, any overridable configuration specified in that rule will be applied to the transaction. The syntax is ``@=``, for example :: @proxy.config.url_remap.pristine_host_hdr=0 @proxy.config.http.cache.http=1 These configuration overrides support integer, floating point, and string values, and will be automatically converted to the appropriate type. See :ref:`ts-overridable-config` for the complete list of configuration variables that can be overridden. This can be useful to force a particular response for some URLs, e.g. :: ^/(ogre.*)/bad http://www.example.com/ @status=404 Or, to force a 302 redirect :: ^/oldurl/(.*)$ http://news.example.com/new/$1 @status=302 Setting the status to 301 or 302 will force the new URL to be used as a redirect (Location:). You can also combine multiple options, including overridable configs :: ^/(.*)?$ https://example.com/sitemaps/$1 @proxy.config.url_remap.pristine_host_hdr=0 @status=301 Examples ~~~~~~~~ **Example: Basic path rewriting** By default, regex_remap matches against the URL path and query string, which always starts with ``/``. This example rewrites requests from an old URL structure to a new one:: # remap.config: map http://www.example.com http://backend.example.com @plugin=regex_remap.so @pparam=rewrites.conf # rewrites.conf: ^/old/(.*)$ http://backend.example.com/new/$1 A request to ``http://www.example.com/old/page.html`` will be rewritten to ``http://backend.example.com/new/page.html``. The ``$1`` substitution captures the first parenthesized group from the regex match. **Example: HTTP to HTTPS redirect with host matching** To redirect all HTTP requests to HTTPS while preserving the host and path, use the ``host`` option. With ``@pparam=host``, the match string is ``//host/path?query`` (note the ``//`` prefix), so the following rule captures the entire string and redirects to HTTPS. Because the captured group includes the ``//``, the substitution uses ``https:`` without ``//`` to avoid producing a malformed URL with four slashes:: # remap.config: regex_map http://(.*) https://unused.invalid @plugin=regex_remap.so @pparam=redirect.conf @pparam=host @pparam=pristine # redirect.conf: ^(.*)$ https:$1 @status=307 This will redirect any plaintext HTTP (``http://``) request for any host to the ``https://`` equivalent. For instance, a request to ``http://example.com/path`` would be redirected to ``https://example.com/path``. **Example: Method-based routing** To route requests differently based on the HTTP method, use the ``method`` option. With ``@pparam=method``, the match string is ``METHOD/path?query`` (note no space between method and path):: # remap.config: map http://api.example.com http://backend.example.com @plugin=regex_remap.so @pparam=api.conf @pparam=method # api.conf: ^GET/api/v1/(.*)$ http://read-replica.example.com/api/v1/$1 ^POST/api/v1/(.*)$ http://write-primary.example.com/api/v1/$1 This routes GET requests to a read replica and POST requests to the primary.