forums.ps2dev.org

hlide · Joined: 10 Sep 2006 Posts: 750

This topic is where we can share our VFPU diggins. This first message should grow more and more as our VFPU diggins make progress.

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

hlide · Joined: 10 Sep 2006 Posts: 750

added nearly all the instructions but a lot to be done too :/

dot_blank · Joined: 28 Sep 2005 Posts: 498 Location: Brasil

i am finally glad somebody took it up themselves
to start something like this ....cheers hlide and raphael
_________________
10011011 00101010 11010111 10001001 10111010

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Some things from the list I can complete/confirm:

hlide · Joined: 10 Sep 2006 Posts: 750

nice catch for vfad, i was clueless.

opc-mips.c :

there is only one vcrs.t and vdet.p. if vdet.t exists, its opcode would probably be something like 0x67808000 + vd.t + (vs.t << 8). But my opinion is that the computation of a determinant for 3d vector being different than a 2d vector may explain this :

det([a]) = a
det([[a b][c d]]) = ad - bc.
det([[a b c][d e f][g h i]) = aei + dhc + gbf - ceg - fha - ibd.

I would investigate vcrs.t as soon as I can.

I will add your diggins as soon as possible.

N.B.: is the word "diggins" correct or is this a pure invention of mine ? i fail to find a french traduction for this word.

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

hlide · Joined: 10 Sep 2006 Posts: 750

i'm digging vcrs.t :

hlide · Joined: 10 Sep 2006 Posts: 750

i'm digging vdet.p as we suppose it does : vd.s = vs[0] x vt[1] - vs[1] x vt[0].

some tests just to check :

hlide · Joined: 10 Sep 2006 Posts: 750

LIST UPDATED !

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Do you have any idea which version of binutils/pspsdk I need to have, to be able to use the vbtf1/2 ops? I just tried updating pspsdk but that didn't help yet, the ops still aren't recognized. I tried updating binutils, but somehow that failed, so I need to try again.
I'll have an update to the document soon. A few new ops decoded plus most clock ticks.
_________________
<Don't push the river, it flows.>
http://wordpress.fx-world.org - my devblog
http://wiki.fx-world.org - VFPU documentation wiki
Alexander Berl

hlide · Joined: 10 Sep 2006 Posts: 750

i'm using DevkitPro and the last devkitPSP release 8.

http://sourceforge.net/project/showfiles.php?group_id=114505&package_id=157350

hlide · Joined: 10 Sep 2006 Posts: 750

oh my ! shouldn't be vbfy1/2 ?

I'm sorry, I DID misname them. I updated the text with the correct names.

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Update to the document:
- added some ops C counterpart (vi2c, vqmul, ..)
- added lv/sv ops for completeness
- added clock ticks for nearly all ops (only some for .t/.p/.s versions are missing)
- moved operand prefixes up to pretty much the top

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Another update:
- added vsrt*, vsocp, vf2h/vh2f
- added prefix information from hlide's last post
- removed exec cycles from exec/latency column (better readability) and added missing latencies for .t/p/s variations
- added '?' where clock ticks information is missing

only missing ops now are vcmp versions, byte to X extensions and vflush as well as vsync.

The information should next be formatted in a better readable way into a .pdf or something.

hlide · Joined: 10 Sep 2006 Posts: 750

vsrt1/2/3/4.q vd, vs are very tough ones but i think to discover what they do :

hlide · Joined: 10 Sep 2006 Posts: 750

oh i miss you post, Raphael ! well i can compare yours addition with mine. :)

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael:

ok, we found the same thing for vsrt1/2/3/4, that should be okay.

I updated the textfile in the first message, so i think you can erase your long text to alleviate the number of page to browse :).

By the way, groepaz plans to update his document with our findings.

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Heh, finally, he already said he'd update it when I first posted my VFPU clock cycles :P
EDIT: I think we can leave only your min/max code for vsrt*, it's shorter and easier to read
Oh, and do you know how you can seed the random generator for VFPU?
_________________
<Don't push the river, it flows.>
http://wordpress.fx-world.org - my devblog
http://wiki.fx-world.org - VFPU documentation wiki
Alexander Berl

hlide · Joined: 10 Sep 2006 Posts: 750

Raphael · Joined: 17 Jan 2006 Posts: 646 Location: Germany

Yeah, just stumbled upon those too. Hm, unfortunately I have no clue how to use them. Would be nice though, seeing how the vector random generator only takes 3 cycles to generate one random number.
_________________
<Don't push the river, it flows.>
http://wordpress.fx-world.org - my devblog
http://wiki.fx-world.org - VFPU documentation wiki
Alexander Berl

hlide · Joined: 10 Sep 2006 Posts: 750

vone.q and vzero.q take 3 cycles !?

vmov.q vd, vs[1, 1, 1, 1] and vmov.q vd, vs[0, 0, 0, 0] don't give us better cycles ? (at least 2 cyles instead of 3 ?), do they ?

random stuff, i'm trying to see how to use them.