SChernykh
644f4cc017
Fixed unaligned memory accesses
2022-01-26 17:18:18 +01:00
SChernykh
7f2f50a8d9
RandomX: don't restart mining threads when the seed changes
...
It helps to not loose huge pages when the seed changes (every 2048 blocks, ~2.8 days).
2021-12-25 13:39:15 +01:00
SChernykh
93081eb1f6
GhostRidere: set correct priority for helper threads
...
Fixes #2825
2021-12-20 12:05:17 +01:00
SChernykh
a4d5d0a75a
Added VAES support for Cryptonight variants
2021-11-28 20:49:54 +01:00
SChernykh
ceaebfd877
GhostRider algorithm (Raptoreum) support
2021-11-23 08:14:01 +01:00
SChernykh
3bece0ff40
Fix: use cn-heavy optimization only for Vermeer CPUs
...
Fixes #2680
2021-11-11 07:57:05 +01:00
Tony Butler
ef6011ac12
Fix compile warning when WITH_RANDOMX=OFF
2021-09-19 18:12:46 -06:00
XMRig
c7ac314110
Code cleanup based on Clang-Tidy.
2021-08-25 18:45:15 +07:00
SChernykh
1b4abe1e98
Fix: don't send miner signature during regular mining
2021-07-01 19:31:55 +02:00
SChernykh
cf104ebdc5
Update signing algorithm
2021-06-21 23:49:13 +02:00
SChernykh
a136790bee
Added support for solo mining with miner signatures (Wownero)
2021-06-16 18:07:36 +02:00
SChernykh
da7f5826cb
Added support for Uplexa (cn/upx2 algorithm)
2021-04-17 14:53:42 +02:00
SChernykh
94d2cac775
Fix GCC compilation
2021-03-04 10:45:39 +01:00
SChernykh
70cddc06ba
Fix crash in cn-heavy on Zen3 with manual thread count
2021-03-04 10:02:35 +01:00
SChernykh
e8a99809b6
Fixed crash when GPU mining cn-heavy on Zen3 system
2021-02-18 14:49:37 +01:00
SChernykh
8af8df25aa
Optimized cn-heavy for Zen3
...
- Uses scratchpad interleaving to access only the closest L3 slice from each CPU core.
- Also activates MSR mod for cn-heavy because CPU prefetchers get confused with interleaving
- 7-8% speedup on Zen3
2021-02-07 22:05:11 +01:00
XMRig
bd82b3c852
Added GpuWorker class.
2020-12-04 22:25:28 +07:00
XMRig
8686e08336
Fixed, benchmark validation on NUMA hardware produced incorrect results in some conditions.
2020-11-25 09:35:11 +07:00
XMRig
75f18c9b31
Use static RandomX seed for benchmark.
2020-11-20 08:15:04 +07:00
XMRig
a2a0defeef
Submit top benchmark diff.
2020-11-17 07:33:20 +07:00
XMRig
c1d99bfa09
Benchmark refactoring, zero delay submit and unified HTTP layer.
2020-11-16 16:22:34 +07:00
XMRig
926871cbe1
Removed non thread safe access to config.
2020-11-16 07:58:28 +07:00
SChernykh
c10ec90b60
Make single thread bench cheat-resistant
...
Each hash is dependent on the previous hash to make multi-threaded cheating impossible.
2020-11-15 20:38:27 +01:00
XMRig
7fc7b976bf
Removed m_bench field from Job class.
2020-10-16 20:29:11 +07:00
XMRig
a152d6be42
Added CMake option WITH_BENCHMARK.
2020-10-16 02:18:07 +07:00
SChernykh
2ecece7b3d
Added benchmark and stress test
...
Easy to use and zero configuration embedded benchmark/stress test.
2020-10-14 19:45:05 +02:00
xmrig
581d004568
Merge pull request #1890 from SChernykh/dev
...
Added argon2/chukwav2 algorithm
2020-10-13 14:54:34 +07:00
SChernykh
4f7186cb0e
Added argon2/chukwav2 algorithm
...
New Turtlecoin algorithm. Source: https://github.com/turtlecoin/turtlecoin/blob/development/src/crypto/hash.h#L57
2020-10-12 08:26:57 +02:00
cohcho
4a74ce3242
CPU: use raw counter
2020-10-10 13:28:14 +00:00
SChernykh
3fbf2ac3d4
More precise hashrate calculation
...
- Use only steady timestamp counters to guarantee correctness
- CPU backend: directly measure total hashrate using raw hash counters from each thread; update data more often on ARM CPUs because they're slower
- GPU backends: directly measure total hashrate too, but use interpolator with 4 second lag to fix variance from batches of hashes
Total hashrate is now measured directly (realtime for CPU, 4 seconds lag for GPU), so it might differ a bit from the sum of all thread hashrates because data points are taken at different moments in time.
Overhead is reduced a lot since it doesn't have to go through all threads to calculate max total hashrate on every timer tick (2 times a second).
2020-10-10 11:18:01 +02:00
cohcho
a705ab775b
RandomX: align args
...
tempHash/output must be 16-byte aligned for randomx_calculate_hash{,_first,_next}
2020-10-07 14:47:18 +00:00
SChernykh
abb78302b8
Try to allocate scratchpad from dataset's 1 GB huge pages, if normal huge pages are not available
2020-07-31 13:37:22 +02:00
SChernykh
7f00cb59d2
Conceal (CCX) support
2020-06-07 01:01:45 +02:00
SChernykh
22b937cc1c
KawPow WIP
2020-05-27 16:19:57 +02:00
SChernykh
680e4dd865
Fix code style
2020-04-09 14:31:42 +02:00
SChernykh
abb3340cc7
RandomX JIT refactoring
...
- Smaller memory footprint
- A bit faster overall
2020-04-09 14:24:54 +02:00
SChernykh
69cbfd682a
Use node number instead of affinity
2020-04-07 18:46:22 +02:00
SChernykh
6ae37a9519
Pooled allocation of RandomX VMs
...
+0.5% speedup on Zen2 when the whole L3 cache is used (16 threads on 3700X/3800X, 32 threads on 3950X).
2020-04-07 18:31:35 +02:00
XMRig
92a258f142
Added command line option --astrobwt-avx2
2020-03-12 00:04:07 +07:00
SChernykh
e22f798085
AVX2 optimized code for AstroBWT
...
Added "astrobwt-avx2" parameter in config.json, it's turned off ("false") by default.
4-5% speedup on CPUs with proper AVX2 support (AMD Ryzen starting with Zen2, Intel Core starting with Haswell).
There will be no speedup on the following CPUs:
- Intel Pentium/Celeron don't support AVX2
- AMD Zen/Zen+ have only half-speed AVX
GCC compiled version is faster without AVX2, MSVC compiled version is faster with AVX2
2020-03-10 22:35:14 +01:00
XMRig
6a45d5dcc9
Update year.
2020-03-06 12:57:21 +07:00
SChernykh
eeadea53e2
AstroBWT 20-50% speedup
...
Skips hashes with large stage 2 size. Added configurable `astrobwt-max-size` parameter, default value is 550, min 400, max 1200, optimal value ranges from 500 to 600 depending on CPU.
- Intel CPUs get 20-25% speedup
- 1st- and 2nd-gen Ryzens get 30% speedup
- 3rd-gen Ryzens get up to 50% speedup
2020-03-05 12:20:21 +01:00
SChernykh
14ef99ca67
AstroBWT algorithm (DERO) support
...
To test:
- Download https://github.com/deroproject/derosuite/releases/tag/AstroBWT
- Run daemon with `--testnet` in command line
In config.json:
- "coin":"dero"
- "url":"127.0.0.1:30306"
- "daemon:"true"
2020-02-29 22:41:24 +01:00
XMRig
c307433900
Fixed nicehash nonce overflow for CPU backend.
2020-02-06 17:19:08 +07:00
XMRig
402c44b547
Added "cn-pico/tlo".
2019-12-29 00:29:19 +07:00
XMRig
d32df84ca5
Memory allocation refactoring.
2019-12-08 23:17:39 +07:00
SChernykh
d0df824599
Optimized dataset read for Ryzen CPUs
...
Removed register dependency in dataset read, +0.8% speedup on average.
2019-12-08 16:14:02 +01:00
XMRig
c3fd5835c3
Added CPU option "yield".
2019-12-03 09:04:20 +07:00
SChernykh
84d7eb05f3
RandomX fixes
...
Intel JCC erratum fix and various other improvements, see more here: https://www.phoronix.com/scan.php?page=article&item=intel-jcc-microcode&num=1
2019-12-01 08:46:35 +01:00
XMRig
23ebcfb2db
Display backend for shares.
2019-10-29 15:43:13 +07:00