<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Sakai CLE ElasticSearch</title>
	<atom:link href="http://blog.tfd.co.uk/2012/10/11/sakai-cle-elasticsearch/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.tfd.co.uk/2012/10/11/sakai-cle-elasticsearch/</link>
	<description>Open Source Open Thought</description>
	<lastBuildDate>Thu, 24 Jan 2013 05:22:45 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Ian</title>
		<link>http://blog.tfd.co.uk/2012/10/11/sakai-cle-elasticsearch/#comment-1763</link>
		<dc:creator><![CDATA[Ian]]></dc:creator>
		<pubDate>Thu, 11 Oct 2012 20:16:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.tfd.co.uk/?p=748#comment-1763</guid>
		<description><![CDATA[Yes, AuthZ is always a problem. Fortunately we have a scheme that started in CLE and evolved in OAE. In the original scheme in CLE we used the site realm as the metadata indicating a user could see the items, and that works well with minimal performance impact for most CLE use cases. In OAE it was harder, since the number of &quot;reader&quot; tokens owned by a user is far higher and performing 100s of AuthZ queries (via Lucene&#039;s bitmaps) for each query is noticeable. Initially in OAE this caused problems. We did eventually develop a custom Lucene Query class that was benchmarked with a flat query profile for very large numbers of user reader tokens. (I guess I should call those &quot;principals&quot;). Although this was coded for Solr, I believe we can use it inside ElasticSearch. Unlike OAE which eventually (mistakenly IMHO) relied 100% of Solr for all queries, CLE will only be using search for search, and consequently the queries will not be insane. The limiting factor in the AuthZ resolution will become the cardinality of the AuthZ tokens as that drives the size of the inverted index bitmaps, which in turn is related to heap usage and performance.]]></description>
		<content:encoded><![CDATA[<p>Yes, AuthZ is always a problem. Fortunately we have a scheme that started in CLE and evolved in OAE. In the original scheme in CLE we used the site realm as the metadata indicating a user could see the items, and that works well with minimal performance impact for most CLE use cases. In OAE it was harder, since the number of &#8220;reader&#8221; tokens owned by a user is far higher and performing 100s of AuthZ queries (via Lucene&#8217;s bitmaps) for each query is noticeable. Initially in OAE this caused problems. We did eventually develop a custom Lucene Query class that was benchmarked with a flat query profile for very large numbers of user reader tokens. (I guess I should call those &#8220;principals&#8221;). Although this was coded for Solr, I believe we can use it inside ElasticSearch. Unlike OAE which eventually (mistakenly IMHO) relied 100% of Solr for all queries, CLE will only be using search for search, and consequently the queries will not be insane. The limiting factor in the AuthZ resolution will become the cardinality of the AuthZ tokens as that drives the size of the inverted index bitmaps, which in turn is related to heap usage and performance.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Charles Severance</title>
		<link>http://blog.tfd.co.uk/2012/10/11/sakai-cle-elasticsearch/#comment-1761</link>
		<dc:creator><![CDATA[Charles Severance]]></dc:creator>
		<pubDate>Thu, 11 Oct 2012 12:07:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.tfd.co.uk/?p=748#comment-1761</guid>
		<description><![CDATA[This sounds great.  I am looking forward to how this all fits together.   Search is frankly one of the harder problems to solve in an LMS and this would be a nice competitive advantage for the Sakai CLE IMHO.  Search in an LMS is far more difficult than search in nearly any other application because of the AUTHZ intricacies that force scaling in another dimension not typically a problem in classic large-corpus search applications.]]></description>
		<content:encoded><![CDATA[<p>This sounds great.  I am looking forward to how this all fits together.   Search is frankly one of the harder problems to solve in an LMS and this would be a nice competitive advantage for the Sakai CLE IMHO.  Search in an LMS is far more difficult than search in nearly any other application because of the AUTHZ intricacies that force scaling in another dimension not typically a problem in classic large-corpus search applications.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
