Data News and Articles
AI Doesn’t Have to Be Too Complicated or Expensive for Your Business —
For most companies that are interested in using AI, there isn’t a clear model to follow. Given advances in AI technology, these organizations should shift their focus from building the right model — a software-focused approach — to focusing getting good data, which clearly illustrates the concepts we need the AI to learn, and using new machine learning operations (MLOps) tools. Tags: AI, ML, Strong Data
Data-Driven Album Covers —
I conversation with Tiziana Alocci, a freelance information designer who has created a number of data-driven album covers, and Sean Peoples, who runs Atlantic Rhythms records, a label with a consistent approach to creating abstract artwork for album covers. What follows is a lightly-edited conversation between the three. Tags: Data, Music, Crossover
The Pentagon Is Bolstering Its AI Systems—by Hacking Itself —
A new “red team” will try to anticipate and thwart attacks on machine learning programs. Tags: AI, ML, Security, Government
Real Estate Pricing with Machine Learning & Non-Traditional Data Sources —
Real estate is the world’s largest asset class, worthing $277 trillion (that’s 277 followed by 12 zeros, in case you were wondering), three times the total value of all publicly traded companies. And Machine Learning applications have been accompanying its sector’s growth. Tags: ML, Data, Real Estate
How to Build an AI Unicorn in 6 Years —
Today, Tractable is worth $1 billion. Our AI is used by millions of people across America, Asia and Europe to recover faster from road accidents. It helps recycle as many cars as Tesla put on the road in 2019. And yet 6 years ago, Tractable was just me and Raz, two college grads coding in a London basement. A year before that I knew nothing about tech. If it’s happened to me, it can happen to others, so here’s the story & learnings. Tags: AI, Management
What Do Data-Driven Companies Have in Common? —
As more organizations see and understand their data, and some grow data use at enviable rates, Tableau and IDC felt it was useful to explore behaviors, characteristics, and key trends that set data-leading companies—ones with a strong Data Culture—apart. Tags: Data, Management
How Airbnb Built “Wall” to Prevent Data Bugs —
Gaining trust in data with extensive data quality, accuracy and anomaly checks. Tags: Data Integrity, Quality
Data Tools and Resources
The open-source alternative that turns any MySQL, PostgreSQL, SQL Server, SQLite, and MariaDB into a smart-spreadsheet. Tags: SQL, Spreadsheet
Datafuse is an open source elastic and scalable cloud warehouse, it offers blazing fast query and combines elasticity, simplicity, low cost of the cloud, built to make the Data Cloud easy. Tags: Elastic, Cloud
Peanut provides a REST API, Admin Dashboard and a command line tool to deploy and configure the commonly used services like databases, message brokers, graphing, tracing, caching tools ... etc. It perfectly suited for development, manual testing, automated testing pipelines where mocking is not possible and test drives. Tags: Dashboard
Continual is the easiest way to maintain predictions – from customer churn to inventory forecasts – directly in your cloud data warehouse. It’s built for modern data teams that want to leverage machine learning to drive revenue, streamline operations, and power innovative products and services without complex engineering.Tags: Cloud, ML, Management
JupyterLite is a JupyterLab distribution that runs entirely in the browser built from the ground-up using JupyterLab components and extensions. Tags: Jupyter
AgentPy is an open-source library for the development and analysis of agent-based models in Python. The framework integrates the tasks of model design, interactive simulations, numerical experiments, and data analysis within a single environment. The package is optimized for interactive computing with IPython, IPySimulate, and Jupyter. Tags: Python, Jupyter
PolarDB for PostgreSQL (PolarDB for short) is an open-source database system based on PostgreSQL. It extends PostgreSQL to become a share-nothing distributed database, which supports global data consistency and ACID across database nodes, distributed SQL processing, and data redundancy and high availability through Paxos based replication. Tags: Database
How To's and Tutorials
Awesome MLOps —
An awesome list of references for MLOps. Tags: MLOps
Modern Statistics with R —
The past decades have transformed the world of statistical data analysis, with new methods, new types of data, and new computational tools. The aim of Modern Statistics with R is to introduce you to key parts of the modern statistical toolkit. Tags: Statistical Analysis, Data
Machine Learning with Missing Values —
Here we use simulated data to understanding the fundamentals of statistical learning with missing values. Tags: ML, Missing Data
Share Your Project
Have you been working on a data project and are ready to share your methods, processes, or results? Contact us to get started.
Be a Do-Gooder
Are you looking for a way to get involved in the community and make an impact? Check out the volunteer opportunities with U.S. Digital Response
Book Review Opportunity
Are you interested in reviewing an O'Reilly book for the publisher and sharing your views with the world? As if that isn't enough, you get to take a book home to enjoy as well. Send us an email
and we'll get you started.
Data Analysis Volunteer Work to Support Baltimore City
Are you an expert with data and willing to mentor, or are you an up and coming hobbyist looking for a side project to work on? We have put together a group to focus on a few problems working with Baltimore City data and need your help. The current project focuses on data parsing and analysis for the Baltimore Board of Estimates. If interested, please send us an email
or join us on Slack
to discuss building a side project group.
Considering a Career Change?
Are you a software or system engineer, data scientist, analytic developer, or cybersecurity expert interested in learning about new opportunities?
Please send us an email
to learn about the opportunities available with our partners.
Are You Hiring?
If your company is looking for data scientists, data engineers, software engineers, and other data related experts, please reach out so that we can help our members find new opportunities.
Please send us an email
introducing your company and needs.
Get Involved with Data Works!
Want to be more involved in our data science community? If you have experience running workshops, hack-a-thons, curating newsletters, or are just interested in helping to grow the meetup, please send us an email
Erias has an immediate need for Software Engineers, System Engineers, Test Engineers, Data Scientists, and System Administrators. External referral bonuses are available. For more information, please contact us at firstname.lastname@example.org