The system's two main ingredients are Apache Solr and Alfresco.
When you submit a EULA to Youluh, your EULA becomes part of a document library in Alfresco that's assigned to you. You won't ever interact directly with Alfresco, it's used as the back end for document management.
The Alfresco we use will have some customization around search. Each new EULA will be sent to a custom version of Solr for indexing. The customizations of Alfresco and Solr are referred to as XAM, which is expected to be an open source project.
Note: we picked the name XAM before we'd ever heard of Xamarin, which is now on our radar, so we'll probably change the project name to avoid confusion.
The XAM indexing process will break the EULA down into parts, typically the paragraphs that you see when you read the EULA.
Each part will be further analyzed and converted into an "essence" that gives the meaning of the part in an abstract form.
The essence allows us to find similar or identical parts in EULAs submitted by other users, by searching a Solr index. Once we locate the similar parts, we can tell you which parts are common, which parts are rare, and which parts have been commented on by other users.
This method is U.S. patent pending, and the open source license for XAM will include an automatic license to the patent. Leave a comment if you'd like to know more about how patents and open source can live together. More details about the patent are here, contact us if you'd like access.