<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	>
<channel>
	<title>Comments on: 10 Tips to Improve your Text Classification Algorithm Accuracy and Performance</title>
	<atom:link href="http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/feed/" rel="self" type="application/rss+xml" />
	<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=10-ways-to-improve-your-classification-algorithm-performance</link>
	<description>Because the world needs another Business Intelligence blog!</description>
	<lastBuildDate>Fri, 14 Sep 2018 19:30:18 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.41</generator>
	<item>
		<title>By: Links Naji</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3389</link>
		<dc:creator><![CDATA[Links Naji]]></dc:creator>
		<pubDate>Wed, 28 Oct 2015 23:16:22 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3389</guid>
		<description><![CDATA[This question on &lt;a href=&quot;http://www.researchgate.net/post/In_Text_Classification_how_do_you_find_the_correlation_between_a_feature_and_the_class_label&quot; rel=&quot;nofollow&quot;&gt;ResearchGate &lt;/a&gt;might be able to help you out, there are multiple approaches to doing this.]]></description>
		<content:encoded><![CDATA[<p>This question on <a href="http://www.researchgate.net/post/In_Text_Classification_how_do_you_find_the_correlation_between_a_feature_and_the_class_label" rel="nofollow">ResearchGate </a>might be able to help you out, there are multiple approaches to doing this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rukayat Hussein</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3387</link>
		<dc:creator><![CDATA[Rukayat Hussein]]></dc:creator>
		<pubDate>Mon, 26 Oct 2015 12:28:11 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3387</guid>
		<description><![CDATA[How can i evaluate the content of  a data set(s) i.e to know if a data set is good for a specific domain or not without the use of any classifier algorithms.]]></description>
		<content:encoded><![CDATA[<p>How can i evaluate the content of  a data set(s) i.e to know if a data set is good for a specific domain or not without the use of any classifier algorithms.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rukayat Hussein</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3386</link>
		<dc:creator><![CDATA[Rukayat Hussein]]></dc:creator>
		<pubDate>Mon, 26 Oct 2015 12:16:04 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3386</guid>
		<description><![CDATA[please i want to know the  figure(value) or range of values  that determine the classifier accuracy is good or bad.]]></description>
		<content:encoded><![CDATA[<p>please i want to know the  figure(value) or range of values  that determine the classifier accuracy is good or bad.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lana</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3356</link>
		<dc:creator><![CDATA[lana]]></dc:creator>
		<pubDate>Wed, 01 Jul 2015 01:58:39 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3356</guid>
		<description><![CDATA[Hello Ibrahim Naji, thank you for a great summary of tools and approaches. I am wondering what are the best emotion classifiers out there. Most are based on the Plutchik’s wheel. Do you know of any that go beyond that into Value Systems? I am considering having a unique classifier build for market research purposes. Would you recommend to use an existing one, or have one built specifically for my application? An example of a classifier &quot;output&quot; is on my website here (manual coding): http://www.heartbeat.marketing/report-example#600-women-1.
Thank you,
Lana]]></description>
		<content:encoded><![CDATA[<p>Hello Ibrahim Naji, thank you for a great summary of tools and approaches. I am wondering what are the best emotion classifiers out there. Most are based on the Plutchik’s wheel. Do you know of any that go beyond that into Value Systems? I am considering having a unique classifier build for market research purposes. Would you recommend to use an existing one, or have one built specifically for my application? An example of a classifier &#8220;output&#8221; is on my website here (manual coding): <a href="http://www.heartbeat.marketing/report-example#600-women-1" rel="nofollow">http://www.heartbeat.marketing/report-example#600-women-1</a>.<br />
Thank you,<br />
Lana</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Links Naji</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3334</link>
		<dc:creator><![CDATA[Links Naji]]></dc:creator>
		<pubDate>Tue, 24 Feb 2015 22:08:31 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3334</guid>
		<description><![CDATA[Thanks :)]]></description>
		<content:encoded><![CDATA[<p>Thanks <img src="http://thinknook.com/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Viacheslav</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3333</link>
		<dc:creator><![CDATA[Viacheslav]]></dc:creator>
		<pubDate>Tue, 24 Feb 2015 13:35:47 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3333</guid>
		<description><![CDATA[A LOT of value here. Thanks Ibrahim, thanks Manish.]]></description>
		<content:encoded><![CDATA[<p>A LOT of value here. Thanks Ibrahim, thanks Manish.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Johnc511</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3282</link>
		<dc:creator><![CDATA[Johnc511]]></dc:creator>
		<pubDate>Thu, 04 Sep 2014 01:40:29 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3282</guid>
		<description><![CDATA[I think  you have  remarked some very interesting points ,  appreciate it for the post. ckdfcbgdccck]]></description>
		<content:encoded><![CDATA[<p>I think  you have  remarked some very interesting points ,  appreciate it for the post. ckdfcbgdccck</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: john kim</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-3220</link>
		<dc:creator><![CDATA[john kim]]></dc:creator>
		<pubDate>Thu, 22 May 2014 10:05:33 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-3220</guid>
		<description><![CDATA[Good text analytic methosds]]></description>
		<content:encoded><![CDATA[<p>Good text analytic methosds</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Links Naji</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-2979</link>
		<dc:creator><![CDATA[Links Naji]]></dc:creator>
		<pubDate>Tue, 13 Aug 2013 08:27:51 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-2979</guid>
		<description><![CDATA[Hello Jayson,

Awesome, am really glad you found the articles useful.

Regarding your question, there are many approaches to doing emotion classification, each has some advantages and disadvantages, I strongly recommending reading a book on Data Mining to get a better holistic picture about the over-all discipline, and maybe that will also spark some unique ideas of your own, which will make for a great thesis!. There is a good book by &lt;a href=&quot;http://www.cs.uic.edu/~liub/WebMiningBook.html&quot; title=&quot;Web Data Mining by Ling Liu&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;Bing Liu on Web Data Mining&lt;/a&gt; I would recommend for this task.

As far as your specific problem is concerned, it looks like you are looking at classifying based on &lt;a href=&quot;http://en.wikipedia.org/wiki/Plutchik%27s_Wheel_of_Emotions#Plutchik.27s_wheel_of_emotions&quot; title=&quot;Wheek of Emotion&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;Plutchik&#039;s wheel of emotions&lt;/a&gt;, in which he identifies 8 primal emotions, and uses those to build more complex human emotions. There are already some classification APIs that returns results based on this wheel of emotion, I recommend looking at &lt;a href=&quot;https://developer.conveyapi.com/features.html&quot; title=&quot;Convey API&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;ConveyAPI&#039;s Emotion classifier&lt;/a&gt; as a practical example.

As far as building a classifier is concerned, assuming you are building a &quot;supervised learning&quot; classifier, you will need 2 things:

&lt;strong&gt;1) A training corpus of already emotion classified text.&lt;/strong&gt;

There are a few ways you could build this corpus, for example:
&lt;ul&gt;
	&lt;li&gt;You could try cleaning up the data in the &lt;a href=&quot;http://www.experienceproject.com/&quot; title=&quot;Experience Project&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;Experience Project&lt;/a&gt;.&lt;/li&gt;

	&lt;li&gt;Using transcripts on &lt;a href=&quot;http://www.ted.com/&quot; title=&quot;TED&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;TED&lt;/a&gt;, you can search by emotions such as Courageous, Inspiring, Funny, etc.&lt;/li&gt;

	&lt;li&gt;Build your own corpus using &lt;a href=&quot;https://www.mturk.com/mturk/&quot; title=&quot;Mechanical Turk by Amazon&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;Amazon&#039;s Mechanical Turk&lt;/a&gt;. Cheap and easier to setup than one might think.&lt;/li&gt;

	&lt;li&gt;Although not really recommended, but you could sign-up to a trial on one of the online APIs that offer emotion classification, and use your free trial to build a corpus. I am sure if you are using the data for research purpose it won&#039;t cause too many issues, but the draw back is that you are building on something with an inherent inaccuracy, this inaccuracy will potentially be compounded when you building your own classifier on top of that data-set.&lt;/li&gt;

	&lt;li&gt;Get creative! :)&lt;/li&gt;

&lt;/ul&gt;


&lt;strong&gt;2) A classification engine.&lt;/strong&gt;

There are many packages out there that really simplifies the whole process, for example &lt;a href=&quot;http://www.r-project.org/&quot; title=&quot;R Statistical Programming&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;R&lt;/a&gt; (for advance users) or &lt;a href=&quot;http://nltk.org/&quot; title=&quot;NLTK&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;NLTK &lt;/a&gt;(for beginners/mid-level users). Each of those packages have great examples on how to get started, and a huge user community with many blog posts that can provide you with already made code. If you decide to go for NLTK I really recommend going through &lt;a href=&quot;http://streamhacker.com/&quot; title=&quot;Steamhacker&quot; target=&quot;_blank&quot; rel=&quot;nofollow&quot;&gt;Jacob Perkins&#039; Streamhacker blog&lt;/a&gt;, he has so many examples on NLTK, and explains things very well.


Once you have those 2 elements, then begins the process of refining the algorithm and results to achieve the desired accuracy, my posts on improving your classification algorithm, or measuring accuracy using the confusion matrix, should aid you some of the way there, or at least give you some ideas on how to get started. There are many interesting articles online if you do some research.

It is important to note that all of this might help you get started, but in order to produce something totally unique and awesome, you will need to dig deep and understand the mathematical models and concepts these algorithms operate on, and read some of the very interesting research out there on various approaches and the results they produced.

I hope this all makes sense, please do not hesitate to hit me up if you need more information.

Goodluck with your thesis!

Cheers,]]></description>
		<content:encoded><![CDATA[<p>Hello Jayson,</p>
<p>Awesome, am really glad you found the articles useful.</p>
<p>Regarding your question, there are many approaches to doing emotion classification, each has some advantages and disadvantages, I strongly recommending reading a book on Data Mining to get a better holistic picture about the over-all discipline, and maybe that will also spark some unique ideas of your own, which will make for a great thesis!. There is a good book by <a href="http://www.cs.uic.edu/~liub/WebMiningBook.html" title="Web Data Mining by Ling Liu" target="_blank" rel="nofollow">Bing Liu on Web Data Mining</a> I would recommend for this task.</p>
<p>As far as your specific problem is concerned, it looks like you are looking at classifying based on <a href="http://en.wikipedia.org/wiki/Plutchik%27s_Wheel_of_Emotions#Plutchik.27s_wheel_of_emotions" title="Wheek of Emotion" target="_blank" rel="nofollow">Plutchik&#8217;s wheel of emotions</a>, in which he identifies 8 primal emotions, and uses those to build more complex human emotions. There are already some classification APIs that returns results based on this wheel of emotion, I recommend looking at <a href="https://developer.conveyapi.com/features.html" title="Convey API" target="_blank" rel="nofollow">ConveyAPI&#8217;s Emotion classifier</a> as a practical example.</p>
<p>As far as building a classifier is concerned, assuming you are building a &#8220;supervised learning&#8221; classifier, you will need 2 things:</p>
<p><strong>1) A training corpus of already emotion classified text.</strong></p>
<p>There are a few ways you could build this corpus, for example:</p>
<ul>
<li>You could try cleaning up the data in the <a href="http://www.experienceproject.com/" title="Experience Project" target="_blank" rel="nofollow">Experience Project</a>.</li>
<li>Using transcripts on <a href="http://www.ted.com/" title="TED" target="_blank" rel="nofollow">TED</a>, you can search by emotions such as Courageous, Inspiring, Funny, etc.</li>
<li>Build your own corpus using <a href="https://www.mturk.com/mturk/" title="Mechanical Turk by Amazon" target="_blank" rel="nofollow">Amazon&#8217;s Mechanical Turk</a>. Cheap and easier to setup than one might think.</li>
<li>Although not really recommended, but you could sign-up to a trial on one of the online APIs that offer emotion classification, and use your free trial to build a corpus. I am sure if you are using the data for research purpose it won&#8217;t cause too many issues, but the draw back is that you are building on something with an inherent inaccuracy, this inaccuracy will potentially be compounded when you building your own classifier on top of that data-set.</li>
<li>Get creative! <img src="http://thinknook.com/wp-includes/images/smilies/icon_smile.gif" alt=":)" class="wp-smiley" /></li>
</ul>
<p><strong>2) A classification engine.</strong></p>
<p>There are many packages out there that really simplifies the whole process, for example <a href="http://www.r-project.org/" title="R Statistical Programming" target="_blank" rel="nofollow">R</a> (for advance users) or <a href="http://nltk.org/" title="NLTK" target="_blank" rel="nofollow">NLTK </a>(for beginners/mid-level users). Each of those packages have great examples on how to get started, and a huge user community with many blog posts that can provide you with already made code. If you decide to go for NLTK I really recommend going through <a href="http://streamhacker.com/" title="Steamhacker" target="_blank" rel="nofollow">Jacob Perkins&#8217; Streamhacker blog</a>, he has so many examples on NLTK, and explains things very well.</p>
<p>Once you have those 2 elements, then begins the process of refining the algorithm and results to achieve the desired accuracy, my posts on improving your classification algorithm, or measuring accuracy using the confusion matrix, should aid you some of the way there, or at least give you some ideas on how to get started. There are many interesting articles online if you do some research.</p>
<p>It is important to note that all of this might help you get started, but in order to produce something totally unique and awesome, you will need to dig deep and understand the mathematical models and concepts these algorithms operate on, and read some of the very interesting research out there on various approaches and the results they produced.</p>
<p>I hope this all makes sense, please do not hesitate to hit me up if you need more information.</p>
<p>Goodluck with your thesis!</p>
<p>Cheers,</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jayson</title>
		<link>http://thinknook.com/10-ways-to-improve-your-classification-algorithm-performance-2013-01-21/#comment-2978</link>
		<dc:creator><![CDATA[Jayson]]></dc:creator>
		<pubDate>Tue, 13 Aug 2013 08:22:24 +0000</pubDate>
		<guid isPermaLink="false">http://thinknook.com/?p=934#comment-2978</guid>
		<description><![CDATA[Hello Sir Ibrahim Naji, reading your blog posts has let me learned many things, I thank you for that. I do have some questions too, I am currently doing a thesis on emotion analysis on disaster related tweets, and I have some problems, how do I incorporate the frequent words that have been collected then translate it into an emotion? For example, the tweet is, &quot;God is our refuge and strength so when you choose friends make sure that most of them are God fearing pray 4 them #bopha&quot; Thank you sir in advance.]]></description>
		<content:encoded><![CDATA[<p>Hello Sir Ibrahim Naji, reading your blog posts has let me learned many things, I thank you for that. I do have some questions too, I am currently doing a thesis on emotion analysis on disaster related tweets, and I have some problems, how do I incorporate the frequent words that have been collected then translate it into an emotion? For example, the tweet is, &#8220;God is our refuge and strength so when you choose friends make sure that most of them are God fearing pray 4 them #bopha&#8221; Thank you sir in advance.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.w3-edge.com/products/


Served from: thinknook.com @ 2026-05-27 00:25:11 by W3 Total Cache
-->