Informations for developers

Contribute to Bayes-Swarm development. Use Bayes-Swarm data in your tools. Access free datasets. There are many ways you can use to get started with Bayes-Swarm!

  • Datasets

    Bayes-Swarm offers sample datasets that you can explore and use as a testbed for the tools you will create. Are you interested in accessing more data? Contact us.


    Jan 2009 Pagestore (230Mb): A complete archive of almost 200 webpages and rss feeds we crawled in the month of January 2009.


    Jan 2009 Bayes db (4M datapoints, 26Mb): A MySQL database dump containing word occurrences extracted from the above pagestore, already divided by source, language, date and position where the word was found in the original text.


    Find out more


    Note (June 12th, 2009): We want to make available a sample instance of the MeanMachine too, but it is not quite ready yet. Stay tuned for updates!

  • APIs

    Bayes-Swarm data are accessible in various forms. You can decide between a CSV format (available on each graphical analysis) you can import into your tools, or access our data source programmatically using the Google Visualization APIs. If you are a visualization writer, contact us to have you visualization embedded in Bayes-Swarm.


    Note (June 12th, 2009): the documentation to access our GViz datasource is not available yet. Check again soon for updates! In the meanwhile, you can use the CSV export functionality provided with every graphical analysis you create.


    Tools

    Bayes-Swarm offers various tools to interact with the data it collects. Access them from here.

    1. Word Lookup
  • Contribute

    Bayes-Swarm is an open-source project released under the GNU GPL v2 License. You can browse the project information and source code at the code project website. We are happy to discuss and integrate any useful idea you might have!

    Find out more