ClickHouse/tests
Azat Khuzhin 1d4a7c7290 Add support of !/* (exclamation/asterisk) in custom TLDs
Public suffix list may contain special characters (you may find format
here - [1]):
- asterisk (*)
- exclamation mark (!)

  [1]: https://github.com/publicsuffix/list/wiki/Format

It is easier to describe how it should be interpreted with an examples.

Consider the following part of the list:

    *.sch.uk
    *.kawasaki.jp
    !city.kawasaki.jp

And here are the results for `cutToFirstSignificantSubdomainCustom()`:

If you have only asterisk (*):

    foo.something.sheffield.sch.uk -> something.sheffield.sch.uk
    sheffield.sch.uk               -> sheffield.sch.uk

If you have exclamation mark (!) too:

    foo.kawasaki.jp                -> foo.kawasaki.jp
    foo.foo.kawasaki.jp            -> foo.foo.kawasaki.jp
    city.kawasaki.jp               -> city.kawasaki.jp
    some.city.kawasaki.jp          -> city.kawasaki.jp

TLDs had been verified wit the following script [2], to match with
python publicsuffix2 module.

  [2]: https://gist.github.com/azat/c1a7a9f1e3519793134ef4b1df5461a6

v2: fix StringHashTable padding requirements
Fixes: #39468
Follow-up for: #17748
Signed-off-by: Azat Khuzhin <a.khuzhin@semrush.com>
2022-07-26 08:34:30 +03:00
..
ci Merge pull request #39454 from ClickHouse/status-names 2022-07-24 17:39:16 +03:00
config
fuzz implementation of parseTimeDelta function 2022-07-19 09:33:02 +08:00
instructions
integration nats-py is missing also 2022-07-21 22:47:40 -04:00
jepsen.clickhouse-keeper
perf_drafts
performance
queries Add support of !/* (exclamation/asterisk) in custom TLDs 2022-07-26 08:34:30 +03:00
.gitignore
clickhouse-test Fix typo 2022-07-18 20:15:18 +02:00
CMakeLists.txt
msan_suppressions.txt
tsan_suppressions.txt
ubsan_suppressions.txt Add ubsan suppression for protobuf (#31835) 2021-11-26 11:36:36 +03:00