<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>bullsmind.com &#187; spanish</title>
	<atom:link href="http://www.bullsmind.com/tag/spanish/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bullsmind.com</link>
	<description>Technology and entrepeneurship</description>
	<lastBuildDate>Tue, 24 May 2011 05:31:49 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>Spanish language and SMS messages</title>
		<link>http://www.bullsmind.com/technology/spanish-language-and-sms-messages/</link>
		<comments>http://www.bullsmind.com/technology/spanish-language-and-sms-messages/#comments</comments>
		<pubDate>Sat, 24 Jan 2009 20:12:41 +0000</pubDate>
		<dc:creator>David Grajal</dc:creator>
				<category><![CDATA[sms]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[spanish]]></category>
		<category><![CDATA[telecommunications]]></category>

		<guid isPermaLink="false">http://www.bullsmind.com/?p=38</guid>
		<description><![CDATA[At the end of the last decade everybody in Spain decided to buy a cell phone. As all foreigners know, Spaniards party and dance flamenco on the streets all day long so cells were very useful to call your friends and locate the next place to dance flamenco. However the voice service was very expensive [...]]]></description>
			<content:encoded><![CDATA[<p>At the end of the last decade everybody in  Spain decided to buy a cell phone. As all foreigners know, Spaniards <strong>party </strong>and <strong>dance flamenco</strong> on the streets all day long so cells were very useful to call your friends and locate the <strong>next place to dance flamenco</strong>. However the voice service was very expensive so the people started using the SMS messaging system. As we don&#8217;t have a lot of time to write messages between <strong>one glass of sangria and the next one</strong>, we created our Spanish SMS language.</p>
<p>The Spanish SMS language makes use of <strong>compressed words</strong> taking off vowels and shortening expressions. In consequence <strong>it is faster to write a message in SMS language and you can pack more information in each SMS</strong>.</p>
<p>This SMS language is seen as an aberration for Academics, which are seriously concerned about those simplifications finding their way onto regular Spanish. One of the most widespread problems is that <strong>nobody writes SMS messages with accents</strong>.</p>
<p>What not everybody knows is that if you want to write proper Spanish on your SMS messages, <strong>you will need to pay more for each message</strong>.</p>
<p><strong>The technical reason</strong></p>
<p>All the cells in Spain are configured to send messages of 160 characters. <strong>However, the real payload limit for an SMS message is 140 bytes.</strong> We can encode 160 characters on only 140 bytes using a 7 bit character set.</p>
<p>A bit after the dinosaurs ruled the world, the computers spoke to each other on complete 8 bit words. American people were not entirely happy with this solution. You have plenty of space to encode all English characters on 7 bits so <strong>using 8 bits per word was seen as a waste of space</strong>. One bit does not seem a lot, but it was actually a big deal when computers could just manage a few kilobytes of memory.</p>
<p>So they took off the first bit and the 7 bit characters sets were born. The most widespread is named ASCII. On 7 bit you can refer to 127 characters but instead of <strong>including foreign symbols</strong>, <strong>American people decided to use the free space for drawing symbols and to control printers.</strong> This was a decision what made easier to develop software but it also made sharing information encoded in different character sets a <strong>big</strong> headache.</p>
<p>On the other hand <strong>GSM was an European effort</strong>, a lot more concerned about interoperability on the entire region. Instead of using standard ASCII <strong>they decided to create a new 7 bit character set</strong> removing the useless symbols and control characters and <strong>including the special symbols of the European languages</strong>. They included German, French, Danish and Finish symbols, but they just included the Spanish symbol &#8220;<em>ñ</em>&#8220;, essential to write wonderful words as <em><a href="http://www.coño.es">coño</a>,maricastaña,ñoño, guiñapo or moño.<br />
</em></p>
<p>The GSM character set is spoken by all the GSM cellphones on earth. This is how it looks:</p>
<p style="text-align: center;"><img class="size-full wp-image-41 aligncenter" src="http://www.bullsmind.com/wp-content/uploads/2009/01/gsm_aphabet.png" alt="GSM Alphabet" width=" mce_" height="480" /></p>
<p>On the GSM character set there are all the lowercased French accents but only the lowercased Spanish <em>é</em>. Why they included<strong> only one</strong> Spanish accent is a mystery to me. It&#8217;s useless.</p>
<p>You have probably noticed that this character set has 255 spaces but I said that GSM is a 7-bit character set. It actually was at the beginning, but eventually they decided to extend the character set with a <strong>beautiful hack </strong>on the system. They allow referring the upper 127 symbols using 14 bits (one control symbol plus the upper symbol). <strong>That&#8217;s why when you write a &#8216;[',']&#8216; or <a title="Euro sign" href="http://en.wikipedia.org/wiki/Euro_sign">€</a> symbol on your cell it counts as 2 characters instead of one</strong>. As you see there is still lot of empty space. Why they didn&#8217;t seize the opportunity to include more symbols? <strong>No idea.</strong></p>
<p>GSM is also used out of Europe. Can Chinese, Japanese, Korean, Russian and Arabic people send messages in their languages? <strong>Yes, but they don&#8217;t use standard GSM character set.</strong> They use UTF-16/UCS2 (Unicode) to encode their symbols. Unicode solve all the problems because <strong>it includes on one character set all the symbols of all the present and future real languages  you can think of</strong>.  <em>Unicode is the character set equivalent to a nuclear power plant.</em></p>
<p>Unicode requires 2 bytes per character. As we have a maximum payload of 140 bytes, <strong>all those countries using Unicode have a limitation of 70 characters per message.</strong> This is not a big deal for Korean, Chinese and Japanese people because they can transmit very dense information with their symbols. I guess Russian people have the same problem as we have with Spanish.</p>
<p>What is the problem? Spanish is not a dense language. Using Unicode to write Spanish means we can only use 70 character messages. 70 characters is simply too short. Remember that <strong>we already have a very compressed SMS language to deal with the 160 character limitation.</strong></p>
<p>You can set up your phone to send messages in proper Spanish in Unicode. It&#8217;s very easy. <strong>Why the Spanish cells are not configured by default in Unicode?</strong> I guess it would be a nightmare for PR Operator&#8217;s department because thousands of people would start complaining about extra cost. <strong>Spanish operators charge concatenated messages as independent messages.</strong></p>
<p>There was a nice alternative which was also the <strong>ethically correct</strong> solution. The operators could have changed their billing processes to allow sending sequences of two or three Unicode-written concatenated messages at the cost of one message.<strong> That would have meant that sending 160 character messages of proper Spanish in Unicode would have cost the same as one message</strong>. That nice solution would have forced the Operators to assume extra transmission  costs. Let&#8217;s say they could lose 0,00001% less. <em>I got that number out of thin air but I don&#8217;t think it&#8217;s far from reality.</em></p>
<p>The operators chose between giving the possibility of write proper Spanish in concatenated short messages or ignore the problem and force the people to write bad Spanish.</p>
<p>Everybody knew what happened. In fact using SMS language was used by the operators on their marketing campaigns to attract young people. <strong>It was just cool and modern to use it. The operators forced the people to write <span style="text-decoration: line-through;">bad</span> cool Spanish.<br />
</strong></p>
<p>So if Spaniards start using SMS-based language <strong>while singing flamenco</strong>,  remember that part of the problem was that one executive working on a Spanish operator thought that it was a good idea to sacrifice our language in exchange of a bit of extra revenue.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bullsmind.com/technology/spanish-language-and-sms-messages/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

