5. Conclusions and recommendations for future work
In the context of this study, the search for "Rio de Janeiro" followed by eight other words, considered "condition criteria”, exemplified a case of web content search. Adding condition criteria made it possible to obtain a more effective result and restricted. But still, the amount of URLs returned is significant (approximately 468,000). How to make the search results more effective? From the unstructured data that were returned by the search engine, it has become feasible to draw up a table with structured data, through the lifting of the citation frequency of condition criteria for each referenced URL summary. At this table, it was associated with a decision class ("information class"), where it was possible to expand it to a "decision table". Subsequently, the decision table associated to Dominance principle, which allow extracting "patterns" (or rules) and hence add information to "ranking" of URLs. In this case, a "core" of suggested condition criteria emphasized the importance in highlighting that subset of criteria that are essential to the information system (decision table) in the study, which could not be eliminated without impact (negative) to the system [8]. Of the 96 relevant URLs suggested by the search engine (“Google”), it is observed that the best positioned URLs do not always return the desired information – ex, the site referring to the URL “21” (www.riodejaneironow.com/cultura.htm) suggested by Rule “1”, shows as much as or more information about “Rio de Janeiro” than the site referring to the URL “1” (vejario.abril.com.br/materia/eventos/programacao-450-anos-rio). About the significant URLs in the form of "ranking", the search engine according to its own criteria, exemplified in these cases, as it may become costly to attempt to analyze manually, a considerable mass of unstructured text. Thus, the logical rules generated based on a "decision table", allowed reveal patterns on the set of URLs returned by the search engine, however the existence of other tools and decision support techniques on "web mining" and in particular under uncertainties – ex, "document clustering” and "web mining soft" [15], [16]; “rough association rules” [17]; “rough-fuzzy” and “rough-wavelet” [18].