|
View:
New views
20 Messages
—
Rating Filter:
Alert me
|
| < Prev | 1 - 2 | Next > |
|
|
Most efficient way to "pre-grow" a list?In Perl one can assign a value to any element of an array, even to ones corresponding to indices greater or equal than the length of the array: my @arr; $arr[999] = 42; perl grows the array as needed to accommodate this assignment. In fact one common optimization in Perl is to "pre-grow" the array to its final size, rather than having perl grow it piecemeal as required by assignments like the one above: my @arr; $#arr = 999_999; After assigning to $#arr (the last index of @arr) as shown above, @arr has length 1,000,000, and all its elements are initialized to undef. In Python the most literal translation of the first code snippet above triggers an IndexError exception: >>> arr = list() >>> arr[999] = 42 Traceback (most recent call last): File "<stdin>", line 1, in <module> IndexError: list assignment index out of range In fact, one would need to pre-grow the list sufficiently to be able to make an assignment like this one. I.e. one needs the equivalent of the second Perl snippet above. The best I can come up with is this: arr = [None] * 1000000 Is this the most efficient way to achieve this result? TIA! kynn -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Nov 6, 12:12 pm, kj <no.em...@...> wrote:
[snip] > In fact, one would need to pre-grow the list sufficiently to be > able to make an assignment like this one. I.e. one needs the > equivalent of the second Perl snippet above. > > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? That's a good as it gets I think. If sparsely populated I might be tempted to use a dict (or maybe defaultdict): d = {999: 42, 10673: 123} for idx in xrange(1000000): # Treat it as though it's a list of 1,000,000 items... print 'index %d has a value of %d' % (idx, d.get(idx, None)) Efficiency completely untested. Jon. -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Fri, Nov 6, 2009 at 06:12, kj <no.email@...> wrote:
Is there a reason you need to pre-allocate the list other than the fact that you do it that way in Perl? -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Fri, Nov 6, 2009 at 1:12 PM, kj <no.email@...> wrote:
> > In Perl one can assign a value to any element of an array, even to > ones corresponding to indices greater or equal than the length of > the array: > > my @arr; > $arr[999] = 42; > > perl grows the array as needed to accommodate this assignment. In > fact one common optimization in Perl is to "pre-grow" the array to > its final size, rather than having perl grow it piecemeal as required > by assignments like the one above: > > my @arr; > $#arr = 999_999; > > After assigning to $#arr (the last index of @arr) as shown above, > @arr has length 1,000,000, and all its elements are initialized to > undef. > > In Python the most literal translation of the first code snippet > above triggers an IndexError exception: > >>>> arr = list() >>>> arr[999] = 42 > Traceback (most recent call last): > File "<stdin>", line 1, in <module> > IndexError: list assignment index out of range > > In fact, one would need to pre-grow the list sufficiently to be > able to make an assignment like this one. I.e. one needs the > equivalent of the second Perl snippet above. > > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? It depends - what do you want to do with it? My first hunch would be to use a dictionary instead of a list, then the whole problem disappears. If there is a reason you don't want to do that, what is it? -- André Engels, andreengels@... -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?[kj]
> In fact, one would need to pre-grow the list sufficiently to be > able to make an assignment like this one. I.e. one needs the > equivalent of the second Perl snippet above. > > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? Yes. Raymond -- http://mail.python.org/mailman/listinfo/python-list |
|
|
|
|
|
Re: Most efficient way to "pre-grow" a list?In <hd1os1$aqs$1@...> "Emily Rodgers" <emily@...> writes:
>"Andre Engels" <andreengels@...> wrote in message >news:mailman.2696.1257520404.2807.python-list@...... >>On Fri, Nov 6, 2009 at 1:12 PM, kj <no.email@...> wrote: >[snip] >>> arr = [None] * 1000000 >>> >>> Is this the most efficient way to achieve this result? >> >> It depends - what do you want to do with it? My first hunch would be >> to use a dictionary instead of a list, then the whole problem >> disappears. If there is a reason you don't want to do that, what is >> it? >I second this. It might seem a sensible thing to do in perl, but I can't >imagine what you would actually want to do it for! Seems like an odd thing >to want to do! As I said, this is considered an optimization, at least in Perl, because it lets the interpreter allocate all the required memory in one fell swoop, instead of having to reallocate it repeatedly as the array grows. (Of course, like with all optimizations, whether it's worth the bother is another question.) Another situation where one may want to do this is if one needs to initialize a non-sparse array in a non-sequential order, e.g. if that's the way the data is initially received by the code. Of course, there are many ways to skin such a cat; pre-allocating the space and using direct list indexing is just one of them. I happen to think it is a particularly straighforward one, but I realize that others (you, Andre, etc.) may not agree. kynn -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Nov 6, 6:12 am, kj <no.em...@...> wrote:
> In Perl one can assign a value to any element of an array, even to > ones corresponding to indices greater or equal than the length of > the array: > > my @arr; > $arr[999] = 42; > > perl grows the array as needed to accommodate this assignment. In > fact one common optimization in Perl is to "pre-grow" the array to > its final size, rather than having perl grow it piecemeal as required > by assignments like the one above: > > my @arr; > $#arr = 999_999; > > After assigning to $#arr (the last index of @arr) as shown above, > @arr has length 1,000,000, and all its elements are initialized to > undef. > > In Python the most literal translation of the first code snippet > above triggers an IndexError exception: > > >>> arr = list() > >>> arr[999] = 42 > > Traceback (most recent call last): > File "<stdin>", line 1, in <module> > IndexError: list assignment index out of range > > In fact, one would need to pre-grow the list sufficiently to be > able to make an assignment like this one. I.e. one needs the > equivalent of the second Perl snippet above. > > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? > > TIA! > > kynn I don't have the code with me, but for huge arrays, I have used something like: >>> arr[0] = initializer >>> for i in range N: >>> arr.extend(arr) This doubles the array every time through the loop, and you can add the powers of 2 to get the desired result. Gil -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Nov 6, 6:12 am, kj <no.em...@...> wrote:
> In Perl one can assign a value to any element of an array, even to > ones corresponding to indices greater or equal than the length of > the array: > > my @arr; > $arr[999] = 42; > > perl grows the array as needed to accommodate this assignment. In > fact one common optimization in Perl is to "pre-grow" the array to > its final size, rather than having perl grow it piecemeal as required > by assignments like the one above: > > my @arr; > $#arr = 999_999; > > After assigning to $#arr (the last index of @arr) as shown above, > @arr has length 1,000,000, and all its elements are initialized to > undef. > > In Python the most literal translation of the first code snippet > above triggers an IndexError exception: > > >>> arr = list() > >>> arr[999] = 42 > > Traceback (most recent call last): > File "<stdin>", line 1, in <module> > IndexError: list assignment index out of range > > In fact, one would need to pre-grow the list sufficiently to be > able to make an assignment like this one. I.e. one needs the > equivalent of the second Perl snippet above. > > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? > > TIA! > > kynn You mean sum'in like dis? class PerlishList(list): '''Hand holding list object for even the most demanding Perl hacker''' def __init__(self, dim=0): list.__init__(self) if dim: self.__setitem__(dim, None) def __setitem__(self, idx, v): lenkeys = len(self) sup = super(PerlishList, self) if idx > lenkeys: for idx in range(lenkeys, idx): sup.append(None) sup.__setitem__(idx, v) def __getitem__(self, idx): return self[idx] l = PerlishList(3) l.append('a') l.append('b') print l l[10] = 10 print l ;-) -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Fri, 06 Nov 2009 18:46:33 -0800, gil_johnson wrote:
> I don't have the code with me, but for huge arrays, I have used > something like: > >>>> arr[0] = initializer >>>> for i in range N: >>>> arr.extend(arr) > > This doubles the array every time through the loop, and you can add the > powers of 2 to get the desired result. Gil Why is it better to grow the list piecemeal instead of just allocating a list the size you want in one go? arr = [x]*size_wanted -- Steven -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?kj a écrit :
> > As I said, this is considered an optimization, at least in Perl, > because it lets the interpreter allocate all the required memory > in one fell swoop, instead of having to reallocate it repeatedly > as the array grows. IIRC, CPython has it's own way to optimize list growth. > (Of course, like with all optimizations, > whether it's worth the bother is another question.) My very humble opinion is that unless you spot a bottleneck (that is, you have real performance issues AND the profiler identified list growth as the culprit), the answer is a clear and obvious NO. > Another situation where one may want to do this is if one needs to > initialize a non-sparse array in a non-sequential order, Then use a dict. -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?Quoting Bruno Desthuilliers <bdesth.quelquechose@...>: > > Another situation where one may want to do this is if one needs to > > initialize a non-sparse array in a non-sequential order, > > Then use a dict. Ok, he has a dict. Now what? He needs a non-sparse array. -- Luis Zarrabeitia Facultad de Matemática y Computación, UH http://profesores.matcom.uh.cu/~kyrie -- Participe en Universidad 2010, del 8 al 12 de febrero de 2010 La Habana, Cuba http://www.universidad2010.cu -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Sat, Nov 7, 2009 at 8:25 PM, Luis Alberto Zarrabeitia Gomez
<kyrie@...> wrote: > > Quoting Bruno Desthuilliers <bdesth.quelquechose@...>: > >> > Another situation where one may want to do this is if one needs to >> > initialize a non-sparse array in a non-sequential order, >> >> Then use a dict. > > Ok, he has a dict. > > Now what? He needs a non-sparse array. Let d be your dict. Call the zeroeth place in your array d[0], the first d[1], the 10000th d[100000]. -- André Engels, andreengels@... -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?Quoting Andre Engels <andreengels@...>: > On Sat, Nov 7, 2009 at 8:25 PM, Luis Alberto Zarrabeitia Gomez > <kyrie@...> wrote: > > > > Ok, he has a dict. > > > > Now what? He needs a non-sparse array. > > Let d be your dict. > > Call the zeroeth place in your array d[0], the first d[1], the 10000th > d[100000]. Following that reasoning, we could get rid of lists and arrays altogether. Here's why that wouldn't work: for x,y in zip(d,other): ... do something ... Yes, we could also ignore zip and just use range/xrange to iterate for the indices... Lists and dictionaries have different semantics. One thing is to argue that you shouldn't be thinking on pre-growing a list for performance reasons before being sure that it is a bottleneck, and a very different one is to argue that because one operation (__setitem__) is the same with both structures, we should not use lists for what may need, depending on the problem, list semantics. ¿Have you ever tried to read list/matrix that you know it is not sparse, but you don't know the size, and it may not be in order? A "grow-able" array would just be the right thing to use - currently I have to settle with either hacking together my own grow-able array, or preloading the data into a dict, growing a list with the [0]*size trick, and updating that list. Not hard, not worthy of a PEP, but certainly not so easy to dismiss. -- Luis Zarrabeitia Facultad de Matemática y Computación, UH http://profesores.matcom.uh.cu/~kyrie -- Participe en Universidad 2010, del 8 al 12 de febrero de 2010 La Habana, Cuba http://www.universidad2010.cu -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?Steven D'Aprano wrote:
> On Fri, 06 Nov 2009 18:46:33 -0800, gil_johnson wrote: > >> I don't have the code with me, but for huge arrays, I have used >> something like: >> >>>>> arr[0] = initializer >>>>> for i in range N: >>>>> arr.extend(arr) >> This doubles the array every time through the loop, and you can add the >> powers of 2 to get the desired result. Gil > > Why is it better to grow the list piecemeal instead of just allocating a > list the size you want in one go? It isn't. > arr = [x]*size_wanted Is what I would do. -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On Nov 6, 3:12 pm, kj <no.em...@...> wrote:
> The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? It is the most efficient SAFE way to achieve this result. In fact, there IS the more efficient way, but it's dangerous, unsafe, unpythonic and plain evil: >>> import ctypes >>> ctypes.pythonapi.PyList_New.restype = ctypes.py_object >>> ctypes.pythonapi.PyList_New(100) -- Ivan -- http://mail.python.org/mailman/listinfo/python-list |
|
|
|
|
|
Re: Most efficient way to "pre-grow" a list?On 6 Nov, 13:12, kj <no.em...@...> wrote:
> > The best I can come up with is this: > > arr = [None] * 1000000 > > Is this the most efficient way to achieve this result? Yes, but why would you want to? Appending to a Python list has amortized O(1) complexity. I am not sure about Perl, but in MATLAB arrays are preallocated because resize has complexity O(n), instead of amortized O(1). You don't need to worry about that in Python. Python lists are resized with empty slots at the end, in proportion to the size of the list. On average, this has the same complexity as pre- allocation. -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On 7 Nov, 03:46, gil_johnson <gil_john...@...> wrote:>
> I don't have the code with me, but for huge arrays, I have used > something like: > > >>> arr[0] = initializer > >>> for i in range N: > >>> arr.extend(arr) > > This doubles the array every time through the loop, and you can add > the powers of 2 to get the desired result. > Gil You should really use append instead of extend. The above code is O (N**2), with append it becomes O(N) on average. -- http://mail.python.org/mailman/listinfo/python-list |
|
|
Re: Most efficient way to "pre-grow" a list?On 6 Nov, 22:03, kj <no.em...@...> wrote:
> As I said, this is considered an optimization, at least in Perl, > because it lets the interpreter allocate all the required memory > in one fell swoop, instead of having to reallocate it repeatedly > as the array grows. Python does not need to reallocate repeatedly as a list grows. That is why it's called a 'list' and not an array. There will be empty slots at the end of a list you can append to, without reallocating. When the list is resized, it is reallocated with even more of these. Thus the need to reallocate becomes more and more rare as the list grows, and on average the complexity of appending is just O(1). -- http://mail.python.org/mailman/listinfo/python-list |
| < Prev | 1 - 2 | Next > |
| Free embeddable forum powered by Nabble | Forum Help |