Author

Topic: How to split hex ranges in smaller parts (Read 326 times)

hero member
Activity: 630
Merit: 731
Bitcoin g33k
December 03, 2022, 10:30:19 AM
#19
That was my thought about it, too. A normal user would not use the switch -O or -OO or the PYTHONOPTIMIZE environment variable and if he does, then he knows the consequences and hopefully knows exactly what he is doing.

Thanks for your feedback.
hero member
Activity: 510
Merit: 4005
December 03, 2022, 10:15:20 AM
#18
EDIT: Done! Thanks for pointing to Python's assertion [1] [2] which was helpful. I am not sure if I did according to best-practice, especially following paragraph in [1] has made me a bit puzzled:

Quote
Assertions should *not* be used to test for failure cases that can occur because of bad user input or operating system/environment failures, such as a file not being found. Instead, you should raise an exception, or print an error message, or whatever is appropriate. One important reason why assertions should only be used for self-tests of the program is that assertions can be disabled at compile time.
Yup, that's the kind of dogma that I was referring to previously. Programmers love saying: "Don't *ever* do this/that" but often what they're talking about amounts to an inconsequential issue (in practice).

If you run your script like this:

Code:
python3 -O range_slicer.py

Then the assert statements won't make it into the compiled bytecode. So, to make sure that your input validation remains intact, regardless of how your script is invoked, you should replace the assert statements with something more resilient (like if statements that lead to an error message, or maybe your own fail_if function that raises an exception when it's passed something that evaluates to True, etc.)

I often don't bother to do this with my own scripts, but lots of people seem to think it's mightily important, so if you agree with their thinking or you don't want to be nagged about it in the future then you should probably do it.

Don't ignore "best practice", but also remember to cultivate a healthy disregard for it, too; it can easily become a crutch that prevents you from thinking deeply and developing your own style.
hero member
Activity: 630
Merit: 731
Bitcoin g33k
December 03, 2022, 03:17:44 AM
#17
@PowerGlove
Thanks a bunch! Really appreciate your feedback, as well as your offered assistance here and also for future questions. That's not a given and so I appreciate it very much. Thank you for your motivating words, which are very helpful.

Yes, absolutely right. I still need to validate the input to avoid weird effects and error messages. EDIT: Done! Thanks for pointing to Python's assertion [1] [2] which was helpful. I am not sure if I did according to best-practice, especially following paragraph in [1] has made me a bit puzzled:

Quote
Assertions should *not* be used to test for failure cases that can occur because of bad user input or operating system/environment failures, such as a file not being found. Instead, you should raise an exception, or print an error message, or whatever is appropriate. One important reason why assertions should only be used for self-tests of the program is that assertions can be disabled at compile time.

but it seems to work Smiley As I said, I'm still in my infancy and there is still a lot to learn.

[1] https://pythongeeks.org/assertion-in-python/
[2] https://wiki.python.org/moin/UsingAssertionsEffectively
hero member
Activity: 510
Merit: 4005
December 02, 2022, 08:54:52 PM
#16
@citb0in: Nice job. It's (very) impressive how quickly you're learning Python!

I like the solution you came up with, using the argparse module is sensible, and using the divmod function is a smart choice, too. I think it's clear that you're past the point of needing help (at least with this project). I'm sure that there are a few combinations of command-line arguments that will produce unexpected results (there always are), but I think you're capable of finding/fixing them all on your own (maybe a few assert statements, or similar, after parsing the arguments would help; making sure that some invariants hold, like: start >= 0, end > start, number > 0, number <= (end-start)+1, etc.)

Please bear with me if I have made mistakes and bad programming syntax, correct me accordingly so that I can learn and get better. Of course I am open for suggestions and corrections and constructive criticism is very welcome.
You're doing great. For a conscientious and self-reliant type like you, my only suggestions are to keep doing what you're doing and to be selective about the feedback that you internalize. Programmers are a dogmatic bunch and like to over-critique each other's code — only the really experienced ones seem to be able to restrict themselves to truly meaningful feedback. If someone has a whole bunch of stuff to say about your code then remember to pick and choose what's useful to you and ignore the rest; there's some kind of Dunning-Kruger effect where the least experienced programmers tend to be the ones who are brimming with the most advice.

If you ever get stuck on a programming-related problem and don't feel like posting about it, feel free to PM me. Wink
hero member
Activity: 630
Merit: 731
Bitcoin g33k
December 02, 2022, 03:18:23 PM
#15
Hi PowerGlove. Sorry for late reply.

The first way is using floating point division and the second way is using integer floor division (via GMP), so most combinations of input range arguments will lead to outputs that disagree.
Thanks for pointing that out.

You can tweak n0nce's script to do floor division by changing the / into a // (and maybe remove the two int() calls further on, which will then be unnecessary). This will also have the effect of letting it work with much bigger ranges (right now you'll run into problems if you try it with ranges that strain the limited precision of Python's float type; for example, if you run it with these arguments: 0 0x22222222222222222 2, it will produce "truncated" slices). However, by making this change you'll also ruin a nice property that this script has (automatically sizing the slices so that they always cover the entire input range); try running it with these arguments: 1 10 5, both before and after making the change and you'll see what I mean.
I did so and I understand what you mean, you're right. Also, as far as the big numbers are concerned, it behaves as you said. So I really need to get away from the normal float division and focus on floor division. For this I had studied various help pages on Python floor division. There are several libraries, especially in 'math' that allow to round up or down (e.g. math.ceil, math.floor) or round() and some more. However, I always prefer clean code so that you don't always have to rely on other libraries. So if there is an elegant way without additional libraries, I would prefer it. I stumbled over and noticed the speciality of how floor division behaves with negative numbers and I thought I could possibly take advantage of this to round up (eg (10-1)/5 to get "2" as solution. My initial approach was this one:
Code:
size = floor((end-start)*-1//n)*-1
But unfortunately I was completely wrong, it doesn't work like that. So I searched further and found various solutions that can effectively and simply determine ranges. My desired goal is that all ranges should have as identical sizes as possible. . The last range can be of any size and should serve as a residual container in which the rest can be placed.

I'm curious about the way the slices overlap in the examples you've shared — are you sure that's what you want? Wouldn't you prefer non-overlapping slices? By that, I mean, shouldn't each slice begin "ahead of" (instead of "on") the previous slice's end value? For example, if I had to divide the (inclusive) range {1..10} into 5 parts, I think it makes more sense to go: {1..2} {3..4} {5..6} {7..8} {9..10}.

You're basically right, of course. However, I would still prefer this range size for visual reasons, because it is simply clearer and easier to understand for the human eye. With decimal numbers this doesn't seem to be such a big problem, but with hex representation you may not always recognize at first glance whether the range runs without gaps.

Example:

In cases where the input range doesn't evenly divide, there's also the issue of what to do with the "remainder". This issue solves itself when using floating point (by automatically distributing it over the slices), but when doing integer divides, you have to decide yourself what to do with the remainder (e.g. ignore it, put it in the last slice, overshoot the input range like that C/C++ program sometimes does, etc.) I also think it makes more sense to calculate the length of a slice like this:
Code:
size = ((end-start)+1)//n
But that interacts with other design choices and I don't want to overwhelm you with too many considerations. I've given you some stuff to think about, but if you get stuck or would like a more verbose script to look at, let me know and I'll put something together for you.

I have tried a few things and ended up with a solution that evenly distributes the values across the ranges. I also changed the parameter query, the suggestion of n0nce was great but I like it a bit cleaner in the code to get rid of the repeating try: except: lines. Pythons' argument parser fitted perfectly. I have therefore modified the program as far as possible and created a first version. My current program allows two ways of execution. The range is mandatory and must always be entered as a parameter so that the program has something to do. By default, the program will use the "n slices" mode, where n is the number of slices desired. If no input is made for 'n', the default value 2 is used.

However, there are also cases where the user wants a certain size of the slices to be generated and doesn't care about the total number of slices to be generated. I integrated this "chunks" mode that can be used with the command line switch -c (for chunks).

Examples:

Range 20 - 90 will be sliced into 15 equal parts
Code:
./range_slicer.py -r 20 90 -n 15

Range 20 - 95 will be sliced into chunks with size 10
Code:
./range_slicer.py -r 20 95 -c -n 10

The current program version is not yet compatible with hex but only with decimal input. I will soon rebuild this so that interoperability is also ensured and so that the user can enter either decimal or hex and can even mix among themselves. I repeat and emphasize that I am an absolute Python beginner and want to make my first learning steps and gain experience. I also tried to create an account on github so that I can upload and manage code there in the future. Please bear with me if I have made mistakes and bad programming syntax, correct me accordingly so that I can learn and get better. Of course I am open for suggestions and corrections and constructive criticism is very welcome.

==> range_slicer by citb0in @github <==
hero member
Activity: 862
Merit: 662
November 25, 2022, 08:02:23 AM
#14
It seems that alberto's slicer tool sometimes creates one more additional range than user input was. Only the first range pair matches always between both programs.

The parameters need to have the prefix 0x for hexadecimal numbers.

the ranges need to be divisible by the given number, if not it will create an extra range with some values outside of the given range.

And yes I spotted some bug in the ranges, I will solve it
hero member
Activity: 510
Merit: 4005
November 25, 2022, 02:32:02 AM
#13
When I compare the output of both snippets, they differ. I have no clue what the reason therefore is. See here ..
The differences in output you're finding between the programs you're comparing is (mostly) due to how the slice length is being calculated.

One is doing this (in Python):

Code:
size = (end-start)/n

And the other is doing this (in C/C++):

Code:
mpz_fdiv_q(slice_value,diff,number_of_ranges);

The first way is using floating point division and the second way is using integer floor division (via GMP), so most combinations of input range arguments will lead to outputs that disagree.

You can tweak n0nce's script to do floor division by changing the / into a // (and maybe remove the two int() calls further on, which will then be unnecessary). This will also have the effect of letting it work with much bigger ranges (right now you'll run into problems if you try it with ranges that strain the limited precision of Python's float type; for example, if you run it with these arguments: 0 0x22222222222222222 2, it will produce "truncated" slices). However, by making this change you'll also ruin a nice property that this script has (automatically sizing the slices so that they always cover the entire input range); try running it with these arguments: 1 10 5, both before and after making the change and you'll see what I mean.

I'm curious about the way the slices overlap in the examples you've shared — are you sure that's what you want? Wouldn't you prefer non-overlapping slices? By that, I mean, shouldn't each slice begin "ahead of" (instead of "on") the previous slice's end value? For example, if I had to divide the (inclusive) range {1..10} into 5 parts, I think it makes more sense to go: {1..2} {3..4} {5..6} {7..8} {9..10}.

In cases where the input range doesn't evenly divide, there's also the issue of what to do with the "remainder". This issue solves itself when using floating point (by automatically distributing it over the slices), but when doing integer divides, you have to decide yourself what to do with the remainder (e.g. ignore it, put it in the last slice, overshoot the input range like that C/C++ program sometimes does, etc.)

I also think it makes more sense to calculate the length of a slice like this:

Code:
size = ((end-start)+1)//n

But that interacts with other design choices and I don't want to overwhelm you with too many considerations. I've given you some stuff to think about, but if you get stuck or would like a more verbose script to look at, let me know and I'll put something together for you.
newbie
Activity: 16
Merit: 1
legendary
Activity: 1568
Merit: 6660
bitcoincleanup.com / bitmixlist.org
November 16, 2022, 09:27:40 AM
#11
Based on SO posts it seems like you want to iterate over those numbers (key cracking?!) in which case you shouldn't create them like that, you should instead parallelize the process. It all takes one line in C# or two if you want to set the number of cores to use:
Code:
ParallelOptions op = new() { MaxDegreeOfParallelism = 2 };
Parallel.For(start, end, op, Action);

Parallelism is not needed for that particular operation because the action of splitting ranges is usually only done after some other range full of private keys has already been exhausted, so it will not be running frequently.
hero member
Activity: 630
Merit: 731
Bitcoin g33k
November 16, 2022, 03:36:31 AM
#10
@COBRAS
can you be more specific please? what do you mean by the link you posted?

EDIT: Oh I understood now. You were referring to the range slicer written in C++ by AlbertoBSD. There it is. Will try it ... thanks for pointing out

I slightly changed the output format of the python script that n0nce provided, so I get pairwise ranges per each line. I added two small functions for later use (converting dec<>hex).

Code:
#!/usr/bin/python3
import sys

try: start = int(sys.argv[1])
except:
  try: start = int(sys.argv[1], 16)
  except: exit(-1)
try: end = int(sys.argv[2])
except:
  try: end = int(sys.argv[2], 16)
  except: exit(-1)
try: n = int(sys.argv[3])
except:
  try: n = int(sys.argv[3], 16)
  except: exit(-1)

size = (end-start)/n
# output in hex
#print(list(map(lambda x: (hex(int(start+x*size)), hex(int(start+size*(x+1)))), range(n))))

# output in decimal
#print(list(map(lambda x: (int(start+x*size), int(start+size*(x+1))), range(n))))

for x in range(n):
    print((hex(int(start+x*size)), hex(int(start+size*(x+1)))))

def hex2int(hexdigits):
    return int(hexdigits, 16)

def int2hex(number):
    return hex(number)

When I compare the output of both snippets, they differ. I have no clue what the reason therefore is. See here ..

Quote
$ ./split_range.py 1 255 4

('0x1', '0x40')
('0x40', '0x80')
('0x80', '0xbf')
('0xbf', '0xff')


Quote
$ ./slicer 1 255 4
Range start: 0x1
Range end: 0xff
number: 0x4
slice value: 0x3f
1:40
40:7f
7f:be
be:fd
fd:13c

When user input is 2 slices the result matches between both programs:

Quote
$ ./split_range.py 1000 2000 2

('0x3e8', '0x5dc')
('0x5dc', '0x7d0')

Quote
$ ./slicer 1000 2000 2

Range start: 0x3e8
Range end: 0x7d0
number: 0x2
slice value: 0x1f4
3e8:5dc
5dc:7d0

Some more examples ...

Quote
$ ./split_range.py 1000 2000 7

('0x3e8', '0x476')
('0x476', '0x505')
('0x505', '0x594')
('0x594', '0x623')
('0x623', '0x6b2')
('0x6b2', '0x741')
('0x741', '0x7d0')

Quote
$ ./slicer 1000 2000 7

Range start: 0x3e8
Range end: 0x7d0
number: 0x7
slice value: 0x8e
3e8:476
476:504
504:592
592:620
620:6ae
6ae:73c
73c:7ca
7ca:858

here the output matches...

Quote
$ ./split_range.py 1000 2000 8

('0x3e8', '0x465')
('0x465', '0x4e2')
('0x4e2', '0x55f')
('0x55f', '0x5dc')
('0x5dc', '0x659')
('0x659', '0x6d6')
('0x6d6', '0x753')
('0x753', '0x7d0')

Quote
$ ./slicer 1000 2000 8

Range start: 0x3e8
Range end: 0x7d0
number: 0x8
slice value: 0x7d
3e8:465
465:4e2
4e2:55f
55f:5dc
5dc:659
659:6d6
6d6:753
753:7d0

I thought it might be the case when n is even or odd, but I find no relationship between odd/even numbers. See here, n=12 leads to different range pairs

Quote
$ ./split_range.py 1000 2000 12

('0x3e8', '0x43b')
('0x43b', '0x48e')
('0x48e', '0x4e2')
('0x4e2', '0x535')
('0x535', '0x588')
('0x588', '0x5dc')
('0x5dc', '0x62f')
('0x62f', '0x682')
('0x682', '0x6d6')
('0x6d6', '0x729')
('0x729', '0x77c')
('0x77c', '0x7d0')

Quote
$ ./slicer 1000 2000 12

Range start: 0x3e8
Range end: 0x7d0
number: 0xc
slice value: 0x53
3e8:43b
43b:48e
48e:4e1
4e1:534
534:587
587:5da
5da:62d
62d:680
680:6d3
6d3:726
726:779
779:7cc
7cc:81f

It seems that alberto's slicer tool sometimes creates one more additional range than user input was. Only the first range pair matches always between both programs.
member
Activity: 873
Merit: 22
$$P2P BTC BRUTE.JOIN NOW ! https://uclck.me/SQPJk
hero member
Activity: 630
Merit: 731
Bitcoin g33k
November 16, 2022, 01:44:40 AM
#8
Thanks for the hint, which certainly has meaning and may seem useful for parallel processing. In my case, however, I only need this in a small Python snippet to quickly build some ranges and manually feed a spreadsheet with them.

The snippet shown by n0nce helped me already, I just have to see how to rebuild it so that it still does INT and HEX output and also displays more info, but I have to learn Python first. The output formatting I would like to improve then, so that not everything is spit out in a line with commas but in a kind of matrix (with INC and DEC values accordingly).
legendary
Activity: 3472
Merit: 10611
November 15, 2022, 11:04:44 PM
#7
Based on SO posts it seems like you want to iterate over those numbers (key cracking?!) in which case you shouldn't create them like that, you should instead parallelize the process. It all takes one line in C# or two if you want to set the number of cores to use:
Code:
ParallelOptions op = new() { MaxDegreeOfParallelism = 2 };
Parallel.For(start, end, op, Action);
hero member
Activity: 630
Merit: 731
Bitcoin g33k
November 15, 2022, 04:04:00 PM
#6
Thanks a bunch @n0nce !!! Will try it soon ... meanwhile I stumbled over this, maybe it is helpful, maybe not ?
hero member
Activity: 924
Merit: 5950
not your keys, not your coins!
November 15, 2022, 03:43:11 PM
#5
Code:
start = 0x1000
end = 0x2000
n = 2
size = (end-start)/n

print(list(map(lambda x: (hex(int(start+x*size)), hex(int(start+size*(x+1)))), range(n))))

As script with hex and int parameters:
Code:
import sys

try: start = int(sys.argv[1])
except:
  try: start = int(sys.argv[1], 16)
  except: exit(-1)
try: end = int(sys.argv[2])
except:
  try: end = int(sys.argv[2], 16)
  except: exit(-1)
try: n = int(sys.argv[3])
except:
  try: n = int(sys.argv[3], 16)
  except: exit(-1)

size = (end-start)/n
print(list(map(lambda x: (hex(int(start+x*size)), hex(int(start+size*(x+1)))), range(n))))

Code:
$ python3 test.py 0x1000 0x2000 2
[('0x1000', '0x1800'), ('0x1800', '0x2000')]
$ python3 test.py 0x1000 0x2000 4
[('0x1000', '0x1400'), ('0x1400', '0x1800'), ('0x1800', '0x1c00'), ('0x1c00', '0x2000')]
$ python3 test.py 0x1000 0x2000 6
[('0x1000', '0x12aa'), ('0x12aa', '0x1555'), ('0x1555', '0x1800'), ('0x1800', '0x1aaa'), ('0x1aaa', '0x1d55'), ('0x1d55', '0x2000')]
$ python3 test.py 0x1000 0x2000 8
[('0x1000', '0x1200'), ('0x1200', '0x1400'), ('0x1400', '0x1600'), ('0x1600', '0x1800'), ('0x1800', '0x1a00'), ('0x1a00', '0x1c00'), ('0x1c00', '0x1e00'), ('0x1e00', '0x2000')]
hero member
Activity: 630
Merit: 731
Bitcoin g33k
November 15, 2022, 03:32:57 PM
#4
@n0nce: we cross-posted Smiley I was still editing. See my posting again

@NotATether: I'm sure, yes. Unfortunately I'm not good in Python (yet). I have written two small bash scripts that allow me to do dec<>hex conversions using linux command 'bc'. I wish I would be knowledgeable in Python, but it's still on my to-do-list Tongue that's why I am asking if anyone is aware of such a small python snippet.
legendary
Activity: 1568
Merit: 6660
bitcoincleanup.com / bitmixlist.org
November 15, 2022, 03:31:40 PM
#3
That looks like something that can be easily written in a single Python file. Unfortunately I do not know of any existing implementations.in the wild. If you are knowledgeable in Python then you can make your own script without even knowledge of cryptography being required.
hero member
Activity: 924
Merit: 5950
not your keys, not your coins!
November 15, 2022, 03:31:40 PM
#2
Since splitting a range into equal parts means that the first number in the next subrange is the last number of the previous subrange plus one, means you just need to divide (end-start) / n to get the size of subranges and add it over and over again to get the start of the next subrange.

Example:
Start = 0x1000 = 4096
End = 0x2000 = 8192
n = 2

Calculate subrange size: (0x2000-0x1000)/2 = 2048.
Meaning the first subrange goes from 4096+0*2048 to 4096+1*2048, and the second subrange goes from 4096+1*2048 to 4096+2*2048 = 0x2000.
hero member
Activity: 630
Merit: 731
Bitcoin g33k
November 15, 2022, 03:18:06 PM
#1
Is there any tool or python snippet available out there that can split a defined range into equal n parts and outputs the results in hex (omitting the 0x prefix for hex values) but as well as decimal ? It should also work vice-versa, so it should allow inputs in hex or in decimals. When entering 0x it should treat the user input as hex value, in all other cases as decimal value.


Example:

1st argument = start of the range
2nd argument = end of the range
3rd argument = create n equal parts

Code:
./split_range.py 0x1000000 0x2000000 2
Quote
User Input Range
Start (Hex): 1000000
End (Hex): 2000000
Span (Hex): 1000000

Start (Dec): 16777216
End (Dec): 33554432
Span (Dec): 16777216

2 Slices for that range

Slice1:

Start (Hex): 1000000
End (Hex): 1800000
Span (Hex): 800000

Start (Dec): 16777216
End (Dec): 25165824
Span (Dec): 8388608


Slice2:

Start (Hex): 1800000
End (Hex): 2000000
Span (Hex): 800000

Start (Dec): 25165824
End (Dec): 33554432
Span (Dec): 8388608
Jump to: