Commit Graph

47 Commits

Author SHA1 Message Date
Robert Schulze
9c5017323d
Add comment about bf16 to CMake docs 2024-11-15 11:50:08 +00:00
Alexey Milovidov
62c94a7841 Maybe better 2024-11-10 16:40:24 +01:00
Robert Schulze
36fbc4ac32
Docs: Follow-up for #70585 2024-10-14 19:02:09 +00:00
Daniil Gentili
a1f715ecf4 Fix ARCH_NATIVE CMake flag 2024-10-11 11:36:44 +00:00
Robert Schulze
c51429b1ef
Simplify more 2023-11-17 17:02:52 +00:00
Robert Schulze
fcce5409f4
Simplify 2023-11-17 17:02:52 +00:00
Alexey Milovidov
80bbe7dcc6 Less CMake checks 2023-11-14 06:55:06 +01:00
Robert Schulze
ac7caace8a
Update comments 2023-03-01 09:34:46 +00:00
Robert Schulze
c78e99eb56
ARM: Activate LDAPR with -march flag instead via -XClang
The `-XClang ... -Xclang ...` flag is internal, undocumented and also
needs `-Wno-unused-command-line-argument`. Enable LDAPR instructions on
ARM via "official" -march. For more informations, see

- https://developer.arm.com/documentation/101754/0619/armclang-Reference/armclang-Command-line-Options/-march?_ga=2.24245759.493510493.1644226748-1698312558.1644226748
- https://developer.arm.com/documentation/dui0801/g/A64-Data-Transfer-Instructions/LDAPR?lang=en
2023-02-27 12:46:58 +00:00
Robert Schulze
c34b4c8ed1
Fix grammar 2023-02-24 09:43:02 +00:00
Robert Schulze
032dae5d29
Quick-n-dirty --> best effort 2023-02-24 09:41:02 +00:00
Robert Schulze
329abe28ba
Same check for x86 2023-02-24 09:38:32 +00:00
Robert Schulze
2f47046fe1
CMake: Add quick-n-dirty check that the build machine isn't too old 2023-02-23 23:05:55 +00:00
Robert Schulze
d67a3b9faa
Document usage of check_cxx_source_compiles instead of check_cxx_source_runs
Cf. #42754
2023-01-02 22:20:20 +00:00
Alexey Milovidov
c48127534b
Merge pull request #41753 from pkit/morton_coding
add Morton Coding (ZCurve)
2022-10-25 06:10:37 +02:00
Constantine Peresypkin
5b3b11b517 add Morton Coding (ZCurve) 2022-10-19 15:59:25 +02:00
Boris Kuschel
e45c800cdf Fix power8+ support add power9+ support 2022-10-18 17:14:43 -05:00
Daniel Kutenin
9825b4d1f9
Move unused command line arguments to a proper place 2022-09-26 20:56:52 +01:00
Daniel Kutenin
cec641a71b
Add ldapr for Arm instances 2022-09-26 13:00:18 +01:00
Robert Schulze
49c4f1f9c6
Document flags 2022-09-21 21:18:53 +00:00
Robert Schulze
cfd8d4e1f1
Add CRC32 to compat build 2022-09-21 14:52:27 +00:00
Robert Schulze
971cef8bd2
Provide x86 SIMD flag options only on x86 2022-09-21 09:30:51 +00:00
Robert Schulze
e1def723f8
Add special x86-SSE2-only build 2022-09-05 12:51:44 +00:00
Alexey Milovidov
6fbff6d1f6
Revert "Avx enablement" 2022-08-21 15:47:02 +03:00
Maksim Kita
64b41e8676
Performance check build AVX 2022-07-22 11:53:16 +00:00
mergify[bot]
62c62a2df7
Merge branch 'master' into lz4-decompress-vbmi 2022-07-11 14:23:17 +00:00
Danila Kutenin
08e3f77a9c Optimize most important parts with NEON SIMD
First part, updated most UTF8, hashing, memory and codecs. Except
utf8lower and upper, maybe a little later.

That includes huge amount of research with movemask dealing. Exact
details and blog post TBD.
2022-06-15 13:19:29 +00:00
Wangyang Guo
0cd2a6b9c3 Add ENABLE_AVX512_VBMI build flag 2022-06-06 13:45:25 +08:00
Alexander Gololobov
4350935377 Re-enable SSE2 for PowerPC 2022-05-20 08:51:48 +02:00
Robert Schulze
e06db56c67
Move ppc cpu features out of the x86 block 2022-05-15 09:48:48 +02:00
Alexander Gololobov
1dc3d04837 Removed __SSE2__ flag for PPC64 2022-05-12 16:20:19 +02:00
Alexey Milovidov
380f1a0052 Limit PowerPC code generation to Power8 2022-04-22 06:26:40 +02:00
hanqf-git
71ce4b46d8 Add x86 feature avx512 support for memcmpSmall. Co-authored-by: @vesslanjin jun.i.jin@intel.com 2022-01-17 16:13:49 +08:00
hanqf-git
d2bc77ac47 Add x86 feature avx512 support for memcmpSmall 2022-01-17 12:03:12 +08:00
Alexey Milovidov
541f3ae1f9 Fix build on AArch64 2021-11-21 14:30:36 +03:00
Alexey Milovidov
6246cec409 Fix build on AArch64 2021-11-21 14:30:04 +03:00
jasperzhu
0217ed09ac revise compile flag 2021-10-27 05:19:15 +05:30
jasperzhu
3d0e595579 revise compile flag 2021-10-13 22:49:21 +05:30
jasperzhu
4b080489b5 revise flags definitions 2021-10-13 07:34:21 +05:30
jasperzhu
80760c0a0c revise compile definitions 2021-10-13 05:59:20 +05:30
jasperzhu
f08dd0e8d8 modify flags definitions 2021-10-13 05:13:34 +05:30
jasperzhu
002c8d3a80 add disable vectorize flag due to llvm bug 2021-10-13 03:38:18 +05:30
jasperzhu
be28e94471 add compile flag -force-vector-width=16 due to llvm autovectorization bug with avx512 2021-10-12 06:30:56 +05:30
jasperzhu
0ef2624408 add specific flags for compiling filter operation source files 2021-10-12 04:04:27 +05:30
jasperzhu
a3d629a5b5 add x86 feature avx2/avx512 support for filter implementation 2021-10-11 22:51:13 +05:30
Alexey Milovidov
eda489dde8 Untangle CPU features specification in CMake 2021-08-10 02:58:51 +03:00
Ivan Lezhankin
ac47427b76 cmake/test_cpu → cmake/cpu_features 2019-09-13 17:54:22 +03:00