<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Kindle Topaz Format .tpz</title>
	<atom:link href="http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/feed/" rel="self" type="application/rss+xml" />
	<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/</link>
	<description>Kindle Review, Kindle Fire Review, New Kindle Review, Kindle 4 Review</description>
	<lastBuildDate>Mon, 13 Feb 2012 04:23:43 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: switch11</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-35924</link>
		<dc:creator><![CDATA[switch11]]></dc:creator>
		<pubDate>Wed, 05 Oct 2011 06:51:30 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-35924</guid>
		<description><![CDATA[Thanks. That&#039;s one of the most inelegant solutions I&#039;ve ever heard of. That they literally took an image per page and then split it into an image per word. That&#039;s crazy.]]></description>
		<content:encoded><![CDATA[<p>Thanks. That&#8217;s one of the most inelegant solutions I&#8217;ve ever heard of. That they literally took an image per page and then split it into an image per word. That&#8217;s crazy.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AlexV</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-35911</link>
		<dc:creator><![CDATA[AlexV]]></dc:creator>
		<pubDate>Wed, 05 Oct 2011 03:12:11 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-35911</guid>
		<description><![CDATA[OK, here is what Topaz developer said step by step

1. the book is scanned so we have one image per page
2. each page&#039;s image is cut up so that each word goes into its own image so we have as many images per page as there are words in it.
3 the book in Topaz format is nothing else but container for those words&#039; images. 

Having each word in separate image allows for reflowing (by rearranging words/images) and resizing (by resizing the words&#039; images)

You can not change the font because there is no concept of font in this format - you see the font that was used in printed book. Hence no font embedding - there are no letters to apply the font to.

4. in addition the book is OCRd. The result is stored as _hidden_ information that is used for instance to search the text of the book - you never see the result of the OCR directly.

This was the first format that was developed for Kindle. This is what was shown to publishers when Amazon was just thinking about the whole e-reader business and had a prototype of the device. The main reason was that the conversion from printed book to a e-book which looked almost like the original was fully automated. 

Later MOBI became the principal format for Kindle, but to have the book in this new format it basically has to be redesigned almost from scratch (to have all the hyperlinks, etc in place). Hence MOBI came later when publishers got on board.

All that was not a speculation, I just repeated what the developer said about his work.]]></description>
		<content:encoded><![CDATA[<p>OK, here is what Topaz developer said step by step</p>
<p>1. the book is scanned so we have one image per page<br />
2. each page&#8217;s image is cut up so that each word goes into its own image so we have as many images per page as there are words in it.<br />
3 the book in Topaz format is nothing else but container for those words&#8217; images. </p>
<p>Having each word in separate image allows for reflowing (by rearranging words/images) and resizing (by resizing the words&#8217; images)</p>
<p>You can not change the font because there is no concept of font in this format &#8211; you see the font that was used in printed book. Hence no font embedding &#8211; there are no letters to apply the font to.</p>
<p>4. in addition the book is OCRd. The result is stored as _hidden_ information that is used for instance to search the text of the book &#8211; you never see the result of the OCR directly.</p>
<p>This was the first format that was developed for Kindle. This is what was shown to publishers when Amazon was just thinking about the whole e-reader business and had a prototype of the device. The main reason was that the conversion from printed book to a e-book which looked almost like the original was fully automated. </p>
<p>Later MOBI became the principal format for Kindle, but to have the book in this new format it basically has to be redesigned almost from scratch (to have all the hyperlinks, etc in place). Hence MOBI came later when publishers got on board.</p>
<p>All that was not a speculation, I just repeated what the developer said about his work.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: EdW</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-33946</link>
		<dc:creator><![CDATA[EdW]]></dc:creator>
		<pubDate>Sat, 30 Jul 2011 05:35:58 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-33946</guid>
		<description><![CDATA[@AlexV. I don&#039;t know what exactly are you saying. Topaz does support embed font. Do you mean it is image with OCR? I really don&#039;t understand what is your point.]]></description>
		<content:encoded><![CDATA[<p>@AlexV. I don&#8217;t know what exactly are you saying. Topaz does support embed font. Do you mean it is image with OCR? I really don&#8217;t understand what is your point.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AlexV</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-30663</link>
		<dc:creator><![CDATA[AlexV]]></dc:creator>
		<pubDate>Wed, 02 Mar 2011 17:55:33 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-30663</guid>
		<description><![CDATA[According to this [link removed due to anti-DRM hack] ebook in topaz format is scanned printed book where individual words, mathematical formulas, charts, etc are placed in separate images to provide reflow with hidden layer of OCRd text to provide search and TTS (similarly to DJVU format). So the format is great for publishers because it does not require any additional work (Amazon scanned thousands of books already). On the other hand there is wide spread misinformation that topaz supports custom fonts - it does not, the text is just shown as in printed book.]]></description>
		<content:encoded><![CDATA[<p>According to this [link removed due to anti-DRM hack] ebook in topaz format is scanned printed book where individual words, mathematical formulas, charts, etc are placed in separate images to provide reflow with hidden layer of OCRd text to provide search and TTS (similarly to DJVU format). So the format is great for publishers because it does not require any additional work (Amazon scanned thousands of books already). On the other hand there is wide spread misinformation that topaz supports custom fonts &#8211; it does not, the text is just shown as in printed book.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: switch11</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-9052</link>
		<dc:creator><![CDATA[switch11]]></dc:creator>
		<pubDate>Tue, 05 Jan 2010 23:11:06 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-9052</guid>
		<description><![CDATA[please get in touch with Kovid Goyal who wrote the Calibre software. He or someone at MobileRead.com might be able to help you.]]></description>
		<content:encoded><![CDATA[<p>please get in touch with Kovid Goyal who wrote the Calibre software. He or someone at MobileRead.com might be able to help you.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: switch11</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-5948</link>
		<dc:creator><![CDATA[switch11]]></dc:creator>
		<pubDate>Tue, 22 Sep 2009 05:03:14 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-5948</guid>
		<description><![CDATA[really interesting. i&#039;ve been thinking about how something like this could be used to create a piracy inhibitor. thanks for your comment.]]></description>
		<content:encoded><![CDATA[<p>really interesting. i&#8217;ve been thinking about how something like this could be used to create a piracy inhibitor. thanks for your comment.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: denis</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-5947</link>
		<dc:creator><![CDATA[denis]]></dc:creator>
		<pubDate>Tue, 22 Sep 2009 04:56:02 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-5947</guid>
		<description><![CDATA[I have a topaz book in which a special font appears to have been used to scramble the text. For example, sometimes there is a word that begins with a capital H, and -- probably because of a typographical error -- the H appears divided into two halves that are separated by at least one space. If you highlight the word that begins with the H and then view the highlight under notes and bookmarks, the word shown in the highlight is completely different from the word actually highlighted, probably because of the typo of the space(s) separating the two halves of the H. I don&#039;t have any specialized knowledge of text encoding, but to me this seems to be some kind of extra DRM.]]></description>
		<content:encoded><![CDATA[<p>I have a topaz book in which a special font appears to have been used to scramble the text. For example, sometimes there is a word that begins with a capital H, and &#8212; probably because of a typographical error &#8212; the H appears divided into two halves that are separated by at least one space. If you highlight the word that begins with the H and then view the highlight under notes and bookmarks, the word shown in the highlight is completely different from the word actually highlighted, probably because of the typo of the space(s) separating the two halves of the H. I don&#8217;t have any specialized knowledge of text encoding, but to me this seems to be some kind of extra DRM.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pidgeon92</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-4329</link>
		<dc:creator><![CDATA[pidgeon92]]></dc:creator>
		<pubDate>Thu, 25 Jun 2009 04:18:06 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-4329</guid>
		<description><![CDATA[New from 2008? It&#039;s not new.

Basically, it looks like a photocopy of a print book; that is, it doesn&#039;t look nice. You tend to end up with uneven fonts, and ink splotches on the page. Topaz is one of the reasons I check the sample of every Kindle book prior to purchase. Much of the formatting done for the Kindle has been extremely shoddy.]]></description>
		<content:encoded><![CDATA[<p>New from 2008? It&#8217;s not new.</p>
<p>Basically, it looks like a photocopy of a print book; that is, it doesn&#8217;t look nice. You tend to end up with uneven fonts, and ink splotches on the page. Topaz is one of the reasons I check the sample of every Kindle book prior to purchase. Much of the formatting done for the Kindle has been extremely shoddy.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: switch11</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-4322</link>
		<dc:creator><![CDATA[switch11]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 20:52:59 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-4322</guid>
		<description><![CDATA[thanks for the kind words. yes, that&#039;s probably what it is. The addition of the &#039;words per line&#039; option in thekindle dx is a good sign. Perhaps they add more options down the line like justification and fonts. One to &#039;bold&#039; text would be another good one to add.]]></description>
		<content:encoded><![CDATA[<p>thanks for the kind words. yes, that&#8217;s probably what it is. The addition of the &#8216;words per line&#8217; option in thekindle dx is a good sign. Perhaps they add more options down the line like justification and fonts. One to &#8216;bold&#8217; text would be another good one to add.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: radio_babylon</title>
		<link>http://ireaderreview.com/2009/06/24/kindle-topaz-format-tpz/#comment-4319</link>
		<dc:creator><![CDATA[radio_babylon]]></dc:creator>
		<pubDate>Wed, 24 Jun 2009 20:29:42 +0000</pubDate>
		<guid isPermaLink="false">http://ireaderreview.com/?p=4269#comment-4319</guid>
		<description><![CDATA[huh... i guess i never noticed. ill have to plug my kindle in to the pc and see if i have any of these new format books... although i dont much care either way, since like i said, i havent noticed any difference...]]></description>
		<content:encoded><![CDATA[<p>huh&#8230; i guess i never noticed. ill have to plug my kindle in to the pc and see if i have any of these new format books&#8230; although i dont much care either way, since like i said, i havent noticed any difference&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>

