Robert Schulze
9c5017323d
Add comment about bf16 to CMake docs
2024-11-15 11:50:08 +00:00
Alexey Milovidov
62c94a7841
Maybe better
2024-11-10 16:40:24 +01:00
Robert Schulze
36fbc4ac32
Docs: Follow-up for #70585
2024-10-14 19:02:09 +00:00
Daniil Gentili
a1f715ecf4
Fix ARCH_NATIVE CMake flag
2024-10-11 11:36:44 +00:00
Robert Schulze
c51429b1ef
Simplify more
2023-11-17 17:02:52 +00:00
Robert Schulze
fcce5409f4
Simplify
2023-11-17 17:02:52 +00:00
Alexey Milovidov
80bbe7dcc6
Less CMake checks
2023-11-14 06:55:06 +01:00
Robert Schulze
ac7caace8a
Update comments
2023-03-01 09:34:46 +00:00
Robert Schulze
c78e99eb56
ARM: Activate LDAPR with -march flag instead via -XClang
...
The `-XClang ... -Xclang ...` flag is internal, undocumented and also
needs `-Wno-unused-command-line-argument`. Enable LDAPR instructions on
ARM via "official" -march. For more informations, see
- https://developer.arm.com/documentation/101754/0619/armclang-Reference/armclang-Command-line-Options/-march?_ga=2.24245759.493510493.1644226748-1698312558.1644226748
- https://developer.arm.com/documentation/dui0801/g/A64-Data-Transfer-Instructions/LDAPR?lang=en
2023-02-27 12:46:58 +00:00
Robert Schulze
c34b4c8ed1
Fix grammar
2023-02-24 09:43:02 +00:00
Robert Schulze
032dae5d29
Quick-n-dirty --> best effort
2023-02-24 09:41:02 +00:00
Robert Schulze
329abe28ba
Same check for x86
2023-02-24 09:38:32 +00:00
Robert Schulze
2f47046fe1
CMake: Add quick-n-dirty check that the build machine isn't too old
2023-02-23 23:05:55 +00:00
Robert Schulze
d67a3b9faa
Document usage of check_cxx_source_compiles instead of check_cxx_source_runs
...
Cf. #42754
2023-01-02 22:20:20 +00:00
Alexey Milovidov
c48127534b
Merge pull request #41753 from pkit/morton_coding
...
add Morton Coding (ZCurve)
2022-10-25 06:10:37 +02:00
Constantine Peresypkin
5b3b11b517
add Morton Coding (ZCurve)
2022-10-19 15:59:25 +02:00
Boris Kuschel
e45c800cdf
Fix power8+ support add power9+ support
2022-10-18 17:14:43 -05:00
Daniel Kutenin
9825b4d1f9
Move unused command line arguments to a proper place
2022-09-26 20:56:52 +01:00
Daniel Kutenin
cec641a71b
Add ldapr for Arm instances
2022-09-26 13:00:18 +01:00
Robert Schulze
49c4f1f9c6
Document flags
2022-09-21 21:18:53 +00:00
Robert Schulze
cfd8d4e1f1
Add CRC32 to compat build
2022-09-21 14:52:27 +00:00
Robert Schulze
971cef8bd2
Provide x86 SIMD flag options only on x86
2022-09-21 09:30:51 +00:00
Robert Schulze
e1def723f8
Add special x86-SSE2-only build
2022-09-05 12:51:44 +00:00
Alexey Milovidov
6fbff6d1f6
Revert "Avx enablement"
2022-08-21 15:47:02 +03:00
Maksim Kita
64b41e8676
Performance check build AVX
2022-07-22 11:53:16 +00:00
mergify[bot]
62c62a2df7
Merge branch 'master' into lz4-decompress-vbmi
2022-07-11 14:23:17 +00:00
Danila Kutenin
08e3f77a9c
Optimize most important parts with NEON SIMD
...
First part, updated most UTF8, hashing, memory and codecs. Except
utf8lower and upper, maybe a little later.
That includes huge amount of research with movemask dealing. Exact
details and blog post TBD.
2022-06-15 13:19:29 +00:00
Wangyang Guo
0cd2a6b9c3
Add ENABLE_AVX512_VBMI build flag
2022-06-06 13:45:25 +08:00
Alexander Gololobov
4350935377
Re-enable SSE2 for PowerPC
2022-05-20 08:51:48 +02:00
Robert Schulze
e06db56c67
Move ppc cpu features out of the x86 block
2022-05-15 09:48:48 +02:00
Alexander Gololobov
1dc3d04837
Removed __SSE2__ flag for PPC64
2022-05-12 16:20:19 +02:00
Alexey Milovidov
380f1a0052
Limit PowerPC code generation to Power8
2022-04-22 06:26:40 +02:00
hanqf-git
71ce4b46d8
Add x86 feature avx512 support for memcmpSmall. Co-authored-by: @vesslanjin jun.i.jin@intel.com
2022-01-17 16:13:49 +08:00
hanqf-git
d2bc77ac47
Add x86 feature avx512 support for memcmpSmall
2022-01-17 12:03:12 +08:00
Alexey Milovidov
541f3ae1f9
Fix build on AArch64
2021-11-21 14:30:36 +03:00
Alexey Milovidov
6246cec409
Fix build on AArch64
2021-11-21 14:30:04 +03:00
jasperzhu
0217ed09ac
revise compile flag
2021-10-27 05:19:15 +05:30
jasperzhu
3d0e595579
revise compile flag
2021-10-13 22:49:21 +05:30
jasperzhu
4b080489b5
revise flags definitions
2021-10-13 07:34:21 +05:30
jasperzhu
80760c0a0c
revise compile definitions
2021-10-13 05:59:20 +05:30
jasperzhu
f08dd0e8d8
modify flags definitions
2021-10-13 05:13:34 +05:30
jasperzhu
002c8d3a80
add disable vectorize flag due to llvm bug
2021-10-13 03:38:18 +05:30
jasperzhu
be28e94471
add compile flag -force-vector-width=16 due to llvm autovectorization bug with avx512
2021-10-12 06:30:56 +05:30
jasperzhu
0ef2624408
add specific flags for compiling filter operation source files
2021-10-12 04:04:27 +05:30
jasperzhu
a3d629a5b5
add x86 feature avx2/avx512 support for filter implementation
2021-10-11 22:51:13 +05:30
Alexey Milovidov
eda489dde8
Untangle CPU features specification in CMake
2021-08-10 02:58:51 +03:00
Ivan Lezhankin
ac47427b76
cmake/test_cpu → cmake/cpu_features
2019-09-13 17:54:22 +03:00