[BETA RELEASE] PhoneME Advanced (Java Mobile) - Page 49 - maemo.org

Active Topics

Porting apps to Leste (32)
to Maemo 7 / Leste by Arno_11 - 18 hrs, 40 mins ago
[ANNOUNCE] CSSU-thumb thread - stable Thumb2 on N900 (2,264)
to Maemo 5 / Fremantle by panjgoori - 2 days, 12 hrs ago
Full linux distros on Sailfish OS (254)
to SailfishOS by qoh - 3 days, 11 hrs ago
more...

Page 49 of 85

Thread Tools

DavyP	2012-03-29 , 19:35
Posts: 306 \| Thanked: 603 times \| Joined on Jan 2012 @ Belgium	#481

Originally Posted by freemangordon

@DavyP - thanks for the last patches, now opera runs. In short - everything is fine

I've played a little bit with cvm code and the result so far:

JBenchmark2, no overclock, charger connected, using max score from several runs:

landscape:
DavyP cvm - 862
freemangordon cvm - 874

portrait:
DavyP cvm - 760
freemangordon cvm - 774

not and impressive speedup (only about 2%) , but yet

Next thing I will do is to try to get rid of that memcpy(by passing cvm allocated buffer as screen buffer to libcvm), it should give us additional performance boost.

Again, thanks a lot for making this port to maemo and sharing your experience with us.

EDIT:
any chance to PM me a link to download JBenchmark, google finds a site when a registration is needed, but I don't want my inbox full with spam.

If you would repeat the experiment multiple times, you will notice
that the 2% below is even below the error margin. So I am very
doubtful there is any improvement at all :-)

And about the memcpy: I am pretty sure you will bump into
synchronization issues between two event loops (one of phoneme
and one of the Qt4 frontend). The phoneME backend calls a
method in the Qt4 frontend to say it is time to paint the buffer. The
Qt4 front-end will emit an asynchronous signal. By the time this
signal is processed the original buffer has changed, resulting in
flickering and painting of intermediate frames if you use the
original buffer.

But feel free to experiment, you never know ... :-)

Davy

Quote & Reply |

The Following User Says Thank You to DavyP For This Useful Post:
Joseph9560

DavyP	2012-03-29 , 19:36
Posts: 306 \| Thanked: 603 times \| Joined on Jan 2012 @ Belgium	#482

JBenchmark:

http://davy.preuveneers.be/phoneme/p...mo/jbenchmark/

Note: only JBenchmark and JBenchmark2 work. The others require
more JSR support

Davy

Quote & Reply |

The Following User Says Thank You to DavyP For This Useful Post:
Estel

freemangordon	2012-03-29 , 19:37
Posts: 3,074 \| Thanked: 12,960 times \| Joined on Mar 2010 @ Sofia,Bulgaria	#483

Originally Posted by Estel

After first batch of optimization, maybe it will be time for extras-devel?

No way this could be swallowed by autobuilder

. The only way it can be pushed is by binary distribution.

Quote & Reply |

The Following User Says Thank You to freemangordon For This Useful Post:
Estel

freemangordon	2012-03-29 , 19:43
Posts: 3,074 \| Thanked: 12,960 times \| Joined on Mar 2010 @ Sofia,Bulgaria	#484

Originally Posted by DavyP

JBenchmark:

http://davy.preuveneers.be/phoneme/p...mo/jbenchmark/

Note: only JBenchmark and JBenchmark2 work. The others require
more JSR support

Davy

Thanks a lot.

OK, second optimization pass (memcpy is gone), JBenchmark2 result(same conditions)

landscape - 972
portrait - 804

going to try your benchmarks, will post theresults here.

Quote & Reply |

DavyP	2012-03-29 , 19:46
Posts: 306 \| Thanked: 603 times \| Joined on Jan 2012 @ Belgium	#485

Originally Posted by freemangordon

Thanks a lot.

OK, second optimization pass (memcpy is gone), JBenchmark2 result(same conditions)

landscape - 972
portrait - 804

going to try your benchmarks, will post theresults here.

Nice! And no flickering?

Davy

Quote & Reply |

freemangordon	2012-03-29 , 19:48
Posts: 3,074 \| Thanked: 12,960 times \| Joined on Mar 2010 @ Sofia,Bulgaria	#486

Originally Posted by DavyP

If you would repeat the experiment multiple times, you will notice
that the 2% below is even below the error margin. So I am very
doubtful there is any improvement at all :-)

And about the memcpy: I am pretty sure you will bump into
synchronization issues between two event loops (one of phoneme
and one of the Qt4 frontend). The phoneME backend calls a
method in the Qt4 frontend to say it is time to paint the buffer. The
Qt4 front-end will emit an asynchronous signal. By the time this
signal is processed the original buffer has changed, resulting in
flickering and painting of intermediate frames if you use the
original buffer.

But feel free to experiment, you never know ... :-)

Davy

actually your libcvm.so build is set-up to use double-buffering

so... no flickering

EDIT:
But anyway I will add some synchronization, just in case, it could lower the results a bit, but will make sure there will be no flicker under any conditions

EDIT2:
with DavyP's benchmarks:

JBenchmark, landscape only (seems does not like portrait) - 5845

JBenchmark2:
landscape - 978
portrait - 906

Last edited by freemangordon; 2012-03-29 at 20:03.

Quote & Reply |

The Following 2 Users Say Thank You to freemangordon For This Useful Post:
Estel, Joseph9560

DavyP	2012-03-29 , 20:31
Posts: 306 \| Thanked: 603 times \| Joined on Jan 2012 @ Belgium	#487

Another remark:

It is possible that the above results are not accurate for another reason. The performance results are measured based on what happens in the event loop of phoneME. This event loop triggers in the Qt4 front-end app a signal emit. So most of what you measure is computation related, not screen IO related. The actual repainting is done in the event-loop of the Qt4 front-end, and it is possible that in this event-loop multiple repaint events are coalesced into a single one (i.e. a single screen update). So the actual painting time in the Qt4 event loop is not included in the benchmark, and the actual screen updates could be lower than what you measure.

Davy

Quote & Reply |

freemangordon	2012-03-29 , 20:53
Posts: 3,074 \| Thanked: 12,960 times \| Joined on Mar 2010 @ Sofia,Bulgaria	#488

Originally Posted by DavyP

Another remark:

It is possible that the above results are not accurate for another reason. The performance results are measured based on what happens in the event loop of phoneME. This event loop triggers in the Qt4 front-end app a signal emit. So most of what you measure is computation related, not screen IO related. The actual repainting is done in the event-loop of the Qt4 front-end, and it is possible that in this event-loop multiple repaint events are coalesced into a single one (i.e. a single screen update). So the actual painting time in the Qt4 event loop is not included in the benchmark, and the actual screen updates could be lower than what you measure.

Davy

But the same could happen with your code, i.e. there is no check if the current buffer has been painted before doing memcpy over it. And it is less possible to happen without memcpy, as there are 2 screens buffers used by libcvm, against just 1 when memcpy is used. Anyway, i will implement some buffer synchronization, I promise

.

On the other hand both threads are CPU bound, so when there is less overhead (because there is no memcpy for every screen buffer), the computational thread is given more CPU shares to do its job, so I believe the benchmark results are pretty corect.

Quote & Reply |

The Following User Says Thank You to freemangordon For This Useful Post:
Estel

DavyP	2012-03-29 , 22:00
Posts: 306 \| Thanked: 603 times \| Joined on Jan 2012 @ Belgium	#489

Originally Posted by freemangordon

But the same could happen with your code, i.e. there is no check if the current buffer has been painted before doing memcpy over it. And it is less possible to happen without memcpy, as there are 2 screens buffers used by libcvm, against just 1 when memcpy is used. Anyway, i will implement some buffer synchronization, I promise .

On the other hand both threads are CPU bound, so when there is less overhead (because there is no memcpy for every screen buffer), the computational thread is given more CPU shares to do its job, so I believe the benchmark results are pretty corect.

True. I don't check before the memcpy either as I don't want to force paint every frame either if the device cannot keep up.

Anyway, I don't want to discourage you in any way. The only point I wanted to make is that I am always careful with interpreting benchmarks, especially if I don't know what they are measuring :-),
but any frame rate improvement you can squeeze out of it would be most welcome.

Cheers,
Davy

Quote & Reply |

The Following User Says Thank You to DavyP For This Useful Post:
Estel

freemangordon	2012-03-29 , 22:44
Posts: 3,074 \| Thanked: 12,960 times \| Joined on Mar 2010 @ Sofia,Bulgaria	#490

Originally Posted by DavyP

True. I don't check before the memcpy either as I don't want to force paint every frame either if the device cannot keep up.

So without memcpy it is better because of the 2 buffers.

Anyway, I don't want to discourage you in any way. The only point I wanted to make is that I am always careful with interpreting benchmarks, especially if I don't know what they are measuring :-),
but any frame rate improvement you can squeeze out of it would be most welcome.

Cheers,
Davy

Well, I am not the one who would understand conversation like that as discouraging. Not that I could be discouraged easy

. And yes, bencmarks could be pretty misleading, I fully agree with you, but while we lack a better tools, we have to stick to what we have