R

See the following -

3 Open Source Alternatives to MATLAB

For many students in mathematics, physical sciences, engineering, economics, and other fields with a heavy numeric component, MATLAB is their first introduction to programming or scientific computing in general. It can be a good tool for learning, although (in my experience) many of the things that students and researchers use MATLAB for are not particularly demanding calculations; rather they could easily be conducted with any number of basic scripting tools, with or without statistical or math-oriented packages. However, it does have a near ubiquity in many academic settings, bringing with it a large community of users familiar with the language, plugins, and capabilities in general...

3 Open Source Alternatives to MATLAB

For many students in mathematics, physical sciences, engineering, economics, and other fields with a heavy numeric component, MATLAB is their first introduction to programming or scientific computing in general. It can be a good tool for learning, although in my experience many of the things that students and researchers alike use MATLAB for are not particularly demanding calculations that easily could be conducted with any number of basic scripting tools, with or without statistical or math-oriented packages. However, it does have a near ubiquity in many academic settings, bringing with it a large community of users familiar with the the language, plugins, and capabilities in general...

Read More »

A New Meaning for Connected Health at 2016 Symposium (Part 3)

Andy Oram | EMR & HIPPA | October 7, 2016

The previous section of this article paused during a discussion of the accuracy and uses of devices. At a panel on patient generated data, a speaker said that one factor holding back the use of patient data was the lack of sophistication in EHRs. They must be enhanced to preserve the provenance of data: whether it came from a device or from a manual record by the patient, and whether the device was consumer-grade or a well-tested medical device. Doctors invest different levels of trust in different methods of collecting data: devices can provide more objective information than other ways of asking patients for data. A participant in the panel also pointed out that devices are more reliable in the lab than under real-world conditions. Consumers must be educated about the proper use of devices, such as whether to sit down and how to hold their arms when taking their blood pressure...

Read More »

Analyze, Collaborate, and Share Research with Open Source Tools

The most powerful free and open source (FOSS) statistics program, though, is R. Originally a FOSS version of the statistics language S, R has shown explosive growth over the last few years, with some 7,000 add-on packages available to handle nearly any statistical requirement and an increasing number of books, courses, and blogs (e.g. R-bloggers) focusing on practical usage. Some websites concentrate specifically on how to use R for psychological research—an example is William Revelle's Personality Project, which also offers an R package called psych, a toolbox for personality, psychometrics, and experimental psychology...

Christine Doig on Data Science as a Team Discipline

Srini Penchikala | Info Q | August 26, 2016

Data science is about the design and development of solutions to extract insights from data (structured and unstructured) using machine learning and predictive analytics techniques and tools. Data Science as a discipline and Data Scientist as a role have been getting lots of attention in the recent years to solve real world problems with solutions ranging from fraud detection to recommendation engines. Christine Doig, Senior Data Scientist at Continuum Analytics, spoke at this year’s OSCON Conference about data science as a team discipline and how to navigate the data science Python ecosystem.

Read More »

Cloudera Unveils Open Source Workbench to Accelerate Data Science and Machine Learning

Press Release | Cloudera | May 1, 2017

Cloudera, Inc., the provider of the leading modern platform for machine learning and advanced analytics built on the latest open source technologies, announced the general availability of the Cloudera Data Science Workbench, its self-service tool for data scientists. The workbench, announced in beta at Strata+Hadoop World San Jose 2017, enables fast, easy and secure self-service data science for the enterprise. "We are entering the golden age of machine learning and it's all about the data. However, data scientists continue to struggle to build and test new analytics projects as fast as they would like, particularly in large scale environments," said Charles Zedlewski, senior vice president, Products at Cloudera.

Read More »

Continuum Analytics Teams Up with Intel for Python Distribution Powered by Anaconda

Press Release | Continuum Analytics | September 8, 2016

Continuum Analytics, the creator and driving force behind Anaconda, the leading Open Data Science platform powered by Python, is pleased to announce a technical collaboration with Intel resulting in the Intel® Distribution for Python powered by Anaconda. Intel Distribution for Python powered by Anaconda was recently announced by Intel and will be delivered as part of Intel® Parallel Studio XE 2017 software development suite. With a common distribution for the Open Data Science community that increases Python and R performance up to 100X, Intel has empowered enterprises to build a new generation of intelligent applications that drive immediate business value...

Read More »

Data Notebook for Python, R Gets Reworked for the Web

Serdar Yegulalp | Info World | August 22, 2016

Jupyter, the interactive data notebook for visualization and analysis with languages like Python and R, is undergoing a quiet but major reworking into a new product, JupyterLab. Jupyter's popularity extends beyond scientific or technical applications. Developers routinely share code snippets, sample output, and discussions of that output via Jupyter notebooks exported to HTML...

Read More »

Health Catalyst Launches Open Source Machine Learning: healthcare.ai

Press Release | healthcare.ai, Health Catalyst | December 1, 2016

Use of machine learning and predictive analytics to improve health outcomes has so far been limited to highly-trained data scientists, mostly in the nation's top academic medical centers. No longer. healthcare.ai is on a mission to make machine learning accessible to the thousands of healthcare professionals who possess little or no data science skills but who share an interest in using the technology to improve patient care. By making its central repository of proven machine learning algorithms available for free, healthcare.ai enables a large, diverse group of technical healthcare professionals to quickly use machine learning tools to build accurate models...

Read More »

Hot Programming Trends from 2016

Technology is constantly moving forward—well, maybe not always forward, but always moving. Even for someone who keeps an eye on the trends and their effect on programmers, discerning exactly where things are headed can be a challenge. My clearest glimpse into open source programming trends always comes in the fall when I work with my fellow chairs, Kelsey Hightower and Scott Hanselman, and our fantastic programming committee to sculpt the coming year's OSCON (O'Reilly Open Source Convention). The proposals that we get and the number focused on specific topics turn out to be good indicators of hot trends in the open source world. What follows is an overview of the top programming trends we saw in 2016...

How Helsinki Became The Most Successful Open-Data City In The World

Olli Sulopuisto | The Atlantic Cities | April 29, 2014

...Helsinki Region Infoshare publishes all of its data in formats that make it easy for software developers, researchers, journalists and others to analyze, combine or turn into web-based or mobile applications that citizens may find useful. In four years of operation, the project has produced more than 1,000 "machine-readable" data sources such as a map of traffic noise levels, real-time locations of snow plows, and a database of corporate taxes...

Read More »

Is the Future of mHealth Based on SMS and Inexpensive Mobile Phones?

Earlier this month the Finnish mobile phone maker, Nokia, announced that they will be re-releasing an updated version of their legendary 3310 GSM phone. Since it was first released in the early 2000s, the Nokia 3310 has gained a cult following for its incredible durability, long battery life and compact design featuring an internal antenna. Many Europeans and Americans fondly remember the 3310 as their first mobile phone, a device that made meeting up with friends in a crowd easier and a device that provided endless hours of entertainment with the timeless game Snake...

Read More »

Python Versus R for Machine Learning and Data Analysis

Machine learning and data analysis are two areas where open source has become almost the de facto license for innovative new tools. Both the Python and R languages have developed robust ecosystems of open source tools and libraries that help data scientists of any skill level more easily perform analytical work. The distinction between machine learning and data analysis is a bit fluid, but the main idea is that machine learning prioritizes predictive accuracy over model interpretability, while data analysis emphasizes interpretability and statistical inference. Python, being more concerned with predictive accuracy, has developed a positive reputation in machine learning. R, as a language for statistical inference, has made its name in data analysis...

Recursion Pharmaceuticals Selects Anaconda to Create Innovative Next Generation Drug Discovery Assay Platform to Eradicate Rare Genetic Diseases

Press Release | Recursion Pharmaceuticals, Continuum Analytics | October 26, 2016

Continuum Analytics, the creator and driving force behind Anaconda, the leading open data science platform powered by Python, today announced that Recursion Pharmaceuticals, LLC, a drug discovery company focused on rare genetic diseases, has adopted Bokeh––a Continuum Analytics open source visualization framework that operates on the Anaconda platform. Bokeh on Anaconda makes it easy for biologists to identify genetic disease markers and assess drug efficacy when visualizing cell data, allowing for faster time-to-value for pharmaceutical companies...

Read More »

Transforming Scientific Research with OpenStack

Maxwell Cooter | Diginomica | November 17, 2016

A cloud-based approach is often heralded as the natural way forward when it comes to improving agility. And whilst many traditional enterprises have turned to the technology, other types of organizations are seeing the benefits too. The Naturalis Biodiversity Center, based in Leiden, Netherlands, is one of the largest centres in the world for the study of biological and geographical diversity...

Read More »