And so my interesting but probably useless project is finally finished. It's a program to compile and count the number of words of the posts of each member in PGG. It outputs the top words per member, which can be used to generate a word cloud.
			
			
			
				hanep!
			
			
			
				Here's my "Politics, Philosophy and Religion" word cloud:
(http://i.imgur.com/xgA3Q.jpg)
			
			
			
				It's not perfect. Maybe I can tweak it as I go along. But for now...
			
			
			
				very nice!
i'm excited to see general chat topics' word cloud.
			
			
			
				So, how much will this cost us to have one of those cool clouds? ;D
			
			
			
				maybe you can play with the forums stat para mga pangalan naman ng username per category ang lalabas. kaso mukang tedious yun
nice one carp!
			
			
			
				Here's the "Politics, Philosophy and Religion" word cloud of the venerable ctan:
(http://i.imgur.com/vFDNY.png)
			
			
			
				CuhuwOoOoL...  :o :-X  
I don't think I'm consistent enough in one board to generate an interesting word cloud.  :(
			
			
			
				naku... baka puro "masturbation" ang magenerate sa akin sa sex and elationship topic.. hehehehe...
			
			
			
				the "Politics, Philosophy and Religion" word cloud of the other doc, kilo:
(http://i.imgur.com/KH7UJ.png)
			
			
			
				More! More! Pwede ba sa ibang boards carp? Like Hobbies?
			
			
			
				fox's "Politics, Philosophy and Religion" word cloud. He sure enjoys life.
(http://i.imgur.com/OJS7n.png)
			
			
			
				A notable poster in "Politics, Philosophy and Religion", judE_Law:
(http://i.imgur.com/gIgoh.png)
			
			
			
				Quote from: carpediem on May 22, 2011, 09:55:21 PM
A notable poster in "Politics, Philosophy and Religion", judE_Law:
(http://i.imgur.com/gIgoh.png)
wow! hmm... carpediem and brusko is in my word count. naks!
yung pinaka-malaking word ba ibig sabihin siya yung pinaka-maraming beses na nabanggit?
			
 
			
			
				^ correct.
Here's pinoybrusko's "Politics, Philosophy and Religion" word cloud. His love for his country and his dedication to work are apparent.
(http://i.imgur.com/Lje28.png)
			
			
			
				hey carp! :-) thanks for that! kakatuwa. hehehe!
			
			
			
				Quote from: carpediem on May 22, 2011, 09:20:12 PM
And so my interesting but probably useless project is finally finished. It's a program to compile and count the number of words of the posts of each member in PGG. It outputs the top words per member, which can be used to generate a word cloud.
and i would assume that the greater the mentions, the bigger the word is ?
			
 
			
			
				^ yes.
On to "Men's Style and Clothing".
First from the master himself, hiei:
(http://i.imgur.com/vKlN2.png)
			
			
			
				Another fashionista, angelo:
(http://i.imgur.com/HZJqW.png)
			
			
			
				^ thanks. nice to know.
			
			
			
				Natatawa ako sa word cloud ng ibang members hahaha ;D. Pero di ko ipopost, unless they would like me to do so.
			
			
			
				pinoybrusko's "Men's Style and Clothing" word cloud:
(http://i.imgur.com/hHPen.png)
			
			
			
				Quote from: carpediem on May 22, 2011, 10:51:07 PM
Natatawa ako sa word cloud ng ibang members hahaha ;D. Pero di ko ipopost, unless they would like me to do so.
be my guest :D
			
 
			
			
				hey carp! may i know my word cloud for every category? hehehehe!
			
			
			
				@eLgimiker0: wait, wala pa ako sa "Dating Women, Courtship and Sex". matagal din kasi magextract kapag madaming posts. so far ang naextract ko palang are:
- PGG Announcements
- Politics, Philosophy and Religion
- Men's Style and Clothing
- Body and Fitness - ongoing
Of course di ka mawawala kapag sa "Dating Women, Courtship and Sex" na, together with marvin, junjaporms, and of course, fox. :D
@ctan: sige, pero baka hindi ko gagawin sa General Chat. It may take forever to extract hehehe. Unless I'll tweak something to exclude certain threads.
			
			
			
				hahaha! ayus!
			
			
			
				@carpediem: ahaha. salamat :D 
			
			
			
				@ctan: here you go. I therefore conclude that your favorite color is not red, but black and white.
And you really like skin.
(http://i.imgur.com/yXHjV.png)
			
			
			
				@ctan: show some skin. lolz
@carpediem: ganda ng gawa mo. congratz
			
			
			
				^ thanks :)
Here's Jon's "Men's Style and Clothing" word cloud:
(http://i.imgur.com/eJXS6.png)
			
			
			
				very nice carpediem. Thanks for my word cloud. More please!!! 
			
			
			
				^ wait lang. nakakapagod din mag generate hehehe
Anyway, if anyone wants their own word cloud, I can give them their word frequencies (I'll either post them here or PM), and they can get their own from www.wordle.net
			
			
			
				Now to "Body and Fitness".
Syempre, yung kay marvinofthefaintsmile muna:
(http://i.imgur.com/jwUR8.png)
			
			
			
				gamit mo ang Revolutionsrad carpe?
			
			
			
				^ no. ano yun?
Here's "Body and Fitness" word cloud of mangkulas:
(http://i.imgur.com/nUzM3.png)
			
			
			
				Doc Kilo always thinks about health:
(http://i.imgur.com/cf7aC.png)
			
			
			
				si angelo, mahilig ata mag OT sa "Body and Fitness":
(http://i.imgur.com/upTPc.png)
			
			
			
				doc ctan's "Body and Fitness" cloud:
(http://i.imgur.com/VmWrI.png)
			
			
			
				That's it for now.
Before I go, here's "PGG Announcements" of Chris.
(http://i.imgur.com/DMpgh.png)
			
			
			
				Quote from: carpediem on May 23, 2011, 12:09:31 AM
Now to "Body and Fitness".
Syempre, yung kay marvinofthefaintsmile muna:
(http://i.imgur.com/jwUR8.png)
hmm.. walang ibang name na namention dito kunde 'Wensha' lang.. Wala ding name ng kahit sinong PGG member.. Hmm..
Anu kaya ung sa sex category?
I also wonder kung anu nmn ung ke fox sa general. ke val?
			
 
			
			
				hi carpediem! 
:0 ang sipag mo naman. naisip ko na rin ito dati, pero hindi ako marunong magcollate ng user posts, ;)
congrats! this is so cool ;) super clap!
			
			
			
				Thanks all.
I thought people won't care, after all it's kinda useless.
@marvin: Dun sa word cloud mo, marami akong tinanggal na mga txtspeak, like "ung", "aq", etc. Lumulusot sila sa filter words. hehehe
@darkstar: Masipag lang ako kapag interesting for me ang isang bagay. I tend to stop finishing on what I've started if I lose interest.
The weekend project was not easy, but I won't say it was hard either. There are some problematic things like I have to exclude those texts that are just being quoted, and these quotes might be nested.
Of course there are other elements like punctuations and Unicode characters that complicate things.
Obviously there are certain "stopwords" (words that are irrelevant, like words ignored by Google search)  that still pass through the filter. This is especially true for Filipino stopwords.
One notable word that is being filtered is "us". Since I converted all words to lowercase, "US" is also filtered out.
It also cannot handle compound words. This would be difficult to do, and would require natural language processing.
			
			
			
				kudos to you! ang galing mo ;)
			
			
			
				Quote from: carpediem on May 22, 2011, 10:51:07 PM
Natatawa ako sa word cloud ng ibang members hahaha ;D. Pero di ko ipopost, unless they would like me to do so.
May I know mine? Hehehe...  :-*
			
 
			
			
				nice thread! :D
			
			
			
				Ang galing naman nito! Nice one carpediem.  :)
Na amazed ako. Hehe!
			
			
			
				Quote from: Boomer23 on May 23, 2011, 12:26:08 PM
Quote from: carpediem on May 22, 2011, 10:51:07 PM
Natatawa ako sa word cloud ng ibang members hahaha ;D. Pero di ko ipopost, unless they would like me to do so.
May I know mine? Hehehe...  :-*
Pwede. Pero konti pa lang posts mo, and mostly nasa General Chat na ayaw ko magextract dahil sobrang laki. Pangit magiging output kapag konti lang posts, kasi halos the same lang mga word counts.
			
 
			
			
				Extracting "Dating Women, Courtship and Sex" :P (Ang haba ng thread mo marvin)
			
			
			
				carpe wag mo i-edit yung mga critical words kay marvin  :D
			
			
			
				^ fox, natawa ako sa word cloud mo for "Dating Women, Courtship and Sex" ;D
			
			
			
				^ (http://i.imgur.com/1Gv63.png)
			
			
			
				^ wait lang, wala pa ako dun. maybe later. madaming posts e siguradong matagal ang extract.
Here's marvin's "Dating Women, Courtship and Sex" word cloud with the txtspeak:
(http://i.imgur.com/i8cbK.png)
Here's his word cloud with most txtspeak removed, para mas visible yung mga relevant words:
(http://i.imgur.com/1iVa9.png)
			
			
			
				^ sex with jollibee lol
anyway, later na yung iba. alis muna ako.
			
			
			
				:) paki-queue na rin ng request ko, carp. Hobbies thread ;) no pressure.
			
			
			
				Bromance pala tong si emoderator:
(http://i.imgur.com/ouoRE.png)
			
			
			
				ako din carp, sa hobbies and interest. pero PM mo na lang muna, feeling ko kasi walang kwenta mga pinagsasasabi ko e. lol
			
			
			
				^ hindi yan, kung walang kwenta mga pinagsasabi mo, ganun din yung iba
junjaporm's "Dating Women, Courtship and Sex" word cloud. At first I was weirded out, and thought there was I bug in the program, but then I remember the thread "I love you languages".
(http://i.imgur.com/LIDYh.png)
			
			
			
				Quote from: carpediem on May 23, 2011, 08:34:06 PM
Bromance pala tong si emoderator:
(http://i.imgur.com/ouoRE.png)
Bromance? ahahah. thanks carpediem.. more! more :D
			
 
			
			
				joshgroban's.
Ano to? Minsan gusto sex, minsan gusto love? Basta mahirap basta asawa? mwahaha
(http://i.imgur.com/arf8Z.png)
			
			
			
				ctan's
Syempre nandyan parin yung kanyang trademark na "hahahaha"
(http://i.imgur.com/6yykx.png)
			
			
			
				angelo's
Isa pa tong "gusto sex kailangan love, basta girls, medyo mahirap"
(http://i.imgur.com/0qL8q.png)
			
			
			
				meron ba ko diyan sa hobbies?
			
			
			
				^ yup of course, reserved ka sa hobbies, together with Luc, bukojob, fox :)
			
			
			
				pinoybrusko's. I'm noticing a trend.
(http://i.imgur.com/Ev14X.jpg)
			
			
			
				Doc Kilo's.
Look for girl2girl.
(http://i.imgur.com/s2gP8.png)
			
			
			
				nakakatuwa!! keep 'em coming!
			
			
			
				Quote from: carpediem on May 23, 2011, 09:20:31 PM
angelo's
Isa pa tong "gusto sex kailangan love, basta girls, medyo mahirap"
(http://i.imgur.com/0qL8q.png)
parang chinese lang?!
			
 
			
			
				feeling ko panay "XD" ang akin lol
			
			
			
				Quote from: carpediem on May 23, 2011, 09:07:35 PM
joshgroban's.
Ano to? Minsan gusto sex, minsan gusto love? Basta mahirap basta asawa? mwahaha
(http://i.imgur.com/arf8Z.png)
gusto ko to...ang tyaga a....thanks thanks
			
 
			
			
				Quote from: carpediem on May 23, 2011, 06:36:42 PM
^ sex with jollibee lol
anyway, later na yung iba. alis muna ako.
ntawa aq sa Jollibee.. akalain mong nai-singet sa sex..
			
 
			
			
				Quote from: pinoybrusko on May 23, 2011, 06:06:52 PM
carpe wag mo i-edit yung mga critical words kay marvin  :D
hmm.. pb.. what's on your mind ba? hehehehe.
			
 
			
			
				As promised, "Hobbies and Men's Interest".
Mine. Marami ata youtube.
(http://i.imgur.com/CuVkV.png)
			
			
			
				^ Here's yours:
(http://i.imgur.com/65lt8.png)
			
			
			
				^ Maybe you did not mention bob dylan frequent enough in that board.
Here are your top-9 word frequencies: (dylan=10, bob=9)
song=119 love=113 music=98 year=84 time=76 cd=65 album=64 show=58 im=57 cds=53 gma=48 students=47 school=44 record=44 top=43 songs=42 fin=41 good=40 day=40 band=36 fan=36 live=35 carl=35 american=35 sobrang=35 rice=34 madonna=34 life=33 movie=33 10=33 math=33 pgg=33 years=31 glee=31 idol=30 banana=30 rock=29 tv=28 lady=28 high=28 2011=27 work=27 pop=27 c2=26 quaker=26 agree=26 free=26 metal=26 week=26 cooking=25 albums=25 oats=25 apple=25 series=24 release=24 country=24 long=24 money=24 thread=24 episode=23 night=23 concert=23 performance=23 kids=22 billboard=22 sm=22 world=22 hope=22 lunch=22 big=21 categories=21 bjork=21 hit=21 hits=21 grammy=21 ot=21 metallica=20 favorite=20 lyrics=20 dio=20 watch=20 awards=19 summer=19 cher=19 teacher=19 amazing=19 doc=19 picture=19 motion=19 feel=19 sabi=18 indie=18 friend=18 latest=18 michael=18 times=18 luc=18 artist=18 gaga=18 carpediem=18 ice=18 dinner=18 season=18 people=17 paul=17 post=17 death=17 march=17 dvd=17 city=17 sale=17 man=17 black=17 janet=17 sana=17 greatest=17 jazz=17 nice=17 category=17 april=17 nominated=16 guess=16 pinas=16 heavy=16 home=16 star=16 million=16 movies=16 interesting=16 12=16 14=16 grabe=16 recording=16 fans=16 christmas=16 place=16 bottom=15 happy=15 cream=15 soundtrack=15 sex=15 garbage=15 video=15 great=15 start=15 jam=15 remember=15 she's=15 friends=15 group=15 sound=15 dont=15 kuya=15 gusto=15 manila=15 academy=15 student=15 james=15 ost=14 comedy=14 nominations=14 early=14 eminem=14 artists=14 acting=14 version=14 fried=14 hot=13 actor=13 give=13 records=13 number=13 play=13 u2=13 sarah=13 born=13 true=13 extra=13 card=13 sales=13 family=13 announced=13 100=13 classic=13 11=13 mars=13 24=13 members=13 part=13 boys=13 classical=13 mr=13 days=13 local=13 naka=13 based=13 casey=13 george=13 kind=13 pm=13 waiting=13 date=12 mall=12 richie=12 sabbath=12 pizza=12 john=12 mtv=12 heart=12 title=12 sweldo=12 hell=12 genre=12 credit=12 lionel=12 minsan=12 girl=12 tour=12 medyo=12 hard=12 00php=12 30=12 buy=12 queen=12 water=12 jude=12 story=12 ctan=12 watching=12 sad=11 original=11 experience=11 person=11 ginataang=11 magic=11 2010=11 nominees=11 award=11 word=11 las=11 network=11 bagong=11 strong=11 food=11 ratings=11 download=11 bassist=11 mind=11 sunday=11 13=11 15=11 16=11 barry=11 genres=11 kapuso=11 idea=11 prices=11 single=11 today=11 taylor=11 king=11 chapter=11 catholic=11 galing=11 singer=10 pinoy=10 played=10 things=10 filipino=10 tori=10 cheers=10 hey=10 found=10 set=10 fire=10 houston=10 perfect=10 system=10 edition=10 palayok=10 kitchen=10 air=10 reality=10 scotty=10 club=10 dream=10 miss=10 nyan=10 girls=10 points=10 judges=10 anthrax=10 pay=10 audience=10 share=10 yellowcab=10 collection=10 thesis=10 radio=10 news=10 dylan=10 cook=10 turn=10 actress=10 taste=10 kylie=10 sold=10 bukojob=10 hold=9 debut=9 holy=9 funeral=9 marvin=9 major=9 plan=9 gold=9 nope=9 smiths=9 magaling=9 ago=9 age=9 producer=9 halo=9 television=9 mathematics=9 barbell=9 brightman=9 wave=9 win=9 mac=9 universal=9 captain=9 room=9 stop=9 fields=9 super=9 afternoon=9 finally=9 ma=9 bob=9 reviews=9 pink=9 jackson=9 track=9 odyssey=9 find=9 2008=9 tom=9 social=9 amos=9 bata=9 philippines=9 science=9 fleetwood=9 white=9 college=9 streisand=9 ticket=9 higher=9 dvds=9 sinigang=9 ehem=9 tuloy=9
			
			
			
				"Hobbies and Men's Interest" word cloud ni junjaporms:
(http://i.imgur.com/Cdikx.png)
			
			
			
				Quote from: bukojob on May 23, 2011, 11:44:07 PM
feeling ko panay "XD" ang akin lol
Tama!!!
(http://i.imgur.com/h7Arp.png)
			
 
			
			
				Luc!
(http://i.imgur.com/kELN2.png)
			
			
			
				Iba ang trip ni incognito - japanese, koreans, thais, dragons and gore
(http://i.imgur.com/Pscmy.png)
			
			
			
				Later na yung sa iba...
			
			
			
				Thanks carp! galing ng project mo. nakakatuwa. :)
			
			
			
				^ Ok lang kapag thread level I just need to tweak the program a bit. Wag lang yung buong General Chat. Huwag din yung Out of Topic Chatforum.
I'll continue with some other members' "Hobbies and Men's Interest"...
			
			
			
				marvinofthefaintsmile. As usual marami akong tinanggal na txtspeak to make the words in the word cloud relevant.
(http://i.imgur.com/XCb5W.png)
			
			
			
				hulaan ninyo kanino to:
(http://i.imgur.com/r0wK2.png)
			
			
			
				"Hobbies and Men's Interest" word cloud of the most mysterious forumer:
(http://i.imgur.com/m2Ul1.png)
			
			
			
				Quote from: carpediem on May 24, 2011, 09:02:27 PM
hulaan ninyo kanino to:
(http://i.imgur.com/r0wK2.png)
hahahaha? ctan to.
			
 
			
			
				darkstar13:
(http://i.imgur.com/igOCX.png)
MaRfZ
(http://i.imgur.com/7xg4b.png)
mang juan
(http://i.imgur.com/wRnyZ.png)
pinoybrusko
(http://i.imgur.com/0jndG.png)
			
			
			
				That's it. Tinatamad na ako. Sorry nalang kung hindi included sa taas. Just ask if yours are not included above and I'll post them. (Sana huwag lang magrequest kung medyo konti lang yung posts niyo sa isang board kasi hindi maganda ang resulting word cloud.)
			
			
			
				junjaporm's word cloud in his own thread:
(http://i.imgur.com/8GRk4.png)
			
			
			
				ang sipag naman ni carpediem.
 ;)
			
			
			
				Quote from: carpediem on May 22, 2011, 11:37:06 PM
^ thanks :)
Here's Jon's "Men's Style and Clothing" word cloud:
(http://i.imgur.com/eJXS6.png)
kung mapapansin nyo...white and shorts ay malalaki.
mahilig kasi ako dun.
salamat carperdiem...heavy super galing nito.
			
 
			
			
				at mahal , boxers and super.
mahal kasi di mamahlin gamit ko.
boxers im into it.
super, expression lang.
			
			
			
				basta enjoy ako sa projectna ito
			
			
			
				Quote from: carpediem on May 24, 2011, 06:15:55 PM
Quote from: bukojob on May 23, 2011, 11:44:07 PM
feeling ko panay "XD" ang akin lol
Tama!!!
(http://i.imgur.com/h7Arp.png)
hahahaha! natawa ko dito! sabi na nga ba e! XD
			
 
			
			
				^^ stay tough on it during these hard times. ur not alone in there.
			
			
			
				Quote from: carpediem on May 24, 2011, 09:14:35 PM
darkstar13:
(http://i.imgur.com/igOCX.png)
thanks carpediem! ;) wala naman ako masyadong post, hehe. somewhat reminded me of the things i posted for the past one year, ;)
			
 
			
			
				thanks, carp. may variety yung akin. haha.
			
			
			
				thanks carpediem  :D
			
			
			
				i just this format presented to us by a media agency this morning.
they used this "word cloud" to present how much is our brand mentioned compared to competition and what are the things people say about the brand. (all coming from the famous social media platforms)
			
			
			
				Quote from: carpediem on May 24, 2011, 09:14:35 PM
MaRfZ
(http://i.imgur.com/7xg4b.png)
Maraming salamat dito carp.. Nakakatuwa!  :)
Natawa ko, kuya talaga yun pinaka malaking word.. Haha! kulit!  :D
			
 
			
			
				carpediem, panu naman ako? ;D
			
			
			
				^ Your "Hobbies and Men's Interest" word cloud:
(http://i.imgur.com/mrHZv.png)
			
			
			
				lagi ko na lang nasasabi si LIE?! nakow! it's a lie!
lagi ako may RUNNING TRIP, BREAK ko ang FOOTBALL.
natuwa ako sa, KALA JAPANESE, SANDWHICH NAMAN.
			
			
			
				Quote from: noyskie on June 01, 2011, 10:47:26 AM
lagi ko na lang nasasabi si LIE?! nakow! it's a lie!
you just lied again ;D
remember dalawampu't limang kasinungalingan?
			
 
			
			
				Quote from: carpediem on June 01, 2011, 11:01:12 AM
Quote from: noyskie on June 01, 2011, 10:47:26 AM
lagi ko na lang nasasabi si LIE?! nakow! it's a lie!
you just lied again ;D
remember dalawampu't limang kasinungalingan?
waahh... makakalimutin ako minsan eh...
			
 
			
			
				Carpe!
I don't know where to post this, but I came across this place on my latest vacation and just had to take a pic xD
(http://farm3.static.flickr.com/2588/5810517538_30451d9d66_z.jpg)
xD
			
			
			
				hehehe Korea
			
			
			
				back then, i wondered if i would actually see you in that cafe xD