July 2022

Upcoming Events

(Don't) Mind the Gap: Bridging the Worlds of People and IoT Devices

Presented by Roberto Yus

Jul 21 | 6:00pm | UMBC Technology Center
Join us in July as continue our in-person events with a discussion on IoT devices and the sheer amount of new data being made available.

Online: Data Analysis with Cyber Security

Presented by Jackson Welch

Aug 17 | 6:00pm | Online
Join us in August as discuss how data analysis can help with cyber security. This will be an online presentation featuring demonstrations of several tools used for cyber security data analysis.

Past Events

DAX was a full day of incredible speakers and collaboration. We are working on processing the videos and will share them via social media and in the newsletter when they are available. 
Thank you again to our speakers for providing their insight and knowledge to the DWMD Community. 
DAX wouldn't have been possible without our sponsors support. 

Data News and Articles

How Innovation in Baltimore Led to Catalyte’s New $15M State ContractMaryland has already innovated its own workforce pipeline by eliminating the requirement of a four-year degree for state jobs. This policy change is key to the 90 and 100 new jobs Catalyte wants to create. “Now that we’ve gotten rid of degree requirements, let’s create the talent base to produce and deliver. We all know there’re challenges in Baltimore but there are things that are happening here that are truly first time in the world. We should be shouting that from the mountaintop. We should be scaling these programs more.” Tags: OpenMD, Balitmore, Business, State, Government,

New AI Research Tool Turns Ideas Into ArtImagine creating a digital painting without ever picking up a paintbrush or instantly generating storybook illustrations to accompany the words. Today, we’re showcasing an exploratory artificial intelligence (AI) research concept called Make-A-Scene that will allow people to bring their visions to life. Tags: AI, Art, Innovation

Is Data Scientist Still the Sexiest Job of the 21st Century?The job of data scientist has become better institutionalized, its scope has been redefined, the technology it relies on has made huge strides, and the importance of non-technical expertise, such as ethics and change management, has grown. The many executives who recognize that data science is important to their businesses now need to create and oversee diverse data science teams rather than searching for data scientist unicorns. Tags: Data Science, Management

Measuring trends in Artificial IntelligenceThe AI Index is an independent initiative at the Stanford Institute for Human-Centered Artificial Intelligence (HAI), led by the AI Index Steering Committee, an interdisciplinary group of experts from across academia and industry. The annual report tracks, collates, distills, and visualizes data relating to artificial intelligence, enabling decision-makers to take meaningful action to advance AI responsibly and ethically with humans in mind. Tags: AI, Ethics

Founding an Analytics Engineering TeamIf your company is struggling to leverage analytics, dealing with an overgrown ecosystem of dashboards/databases or simply want to avoid the mistakes of others, this story is for you. In this article, I will walk through forming the first analytics engineering team at Smartsheet including how momentum built around forming the team, the challenges we faced, and the solutions we developed within the first year. Tags: Management, Engineering, System Engineer

Data Tools and Resources

Apache KibbleApache Kibble is a suite of tools for collecting, aggregating and visualizing activity in software projects. In follows a similar architecture to Gitana (and in fact to other several tools in the list), with a central Kibble server and a set of scanner applications specialized in working with a specific type of resource (a git repo, a mailing list, a JIRA instance, etc) and push compiled data objects to the Kibble Server. Tags: ML, Apache, Software

GitanaGitana, a project inspector that analyzes the support tools used in software projects and imports the information in a relational database, thus providing a central point to perform all kinds of cross-cutting analysis on project data. The current version of the tool provides support to inspect Git repositories, Bugzilla/GitHub issue trackers, Eclipse forums and Slack instant messages. To ensure efficiency, Gitana comes with an incremental propagation mechanism that refreshes the database content with the latest modifications available on the data sources. The approach also incorporates exporters to enable further data analysis with third-party tools.Tags: Software Analysis, Management

GH ArchiveOpen-source developers all over the world are working on millions of projects: writing code & documentation, fixing & submitting bugs, and so forth. GH Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis. Tags: Open Source, Git Analysis

How To's and Tutorials


4 Pandas Anti-Patterns to Avoid and How to Fix Thempandas is a powerful data analysis library with a rich API that offers multiple ways to perform any given data manipulation task. Some of these approaches are better than others, and pandas users often learn suboptimal coding practices that become their default workflows. Tags: Pandas, Data Analysis

Tidy Finance with RThis book aims to lift the curtain on reproducible finance by providing a fully transparent code base for many common financial applications. We hope to inspire others to share their code publicly and take part in our journey towards more reproducible research in the future. Tags: R, Finance, Open Source

Graph Machine Learning at AirbnbIn this blog post, we will explain the benefits of using graphs for machine learning, and show how leveraging graph information allows us to learn more about our users, in addition to building more contextual representations of them. We will then cover specific graph machine learning methods, such as Graph Convolutional Networks, that are being used at Airbnb to improve upon existing machine learning models. Tags: ML, Neural Networks, Graph ML


Share Your Project
Have you been working on a data project and are ready to share your methods, processes, or results? Contact us to get started.

Book Review Opportunity
Are you interested in reviewing an O'Reilly book for the publisher and sharing your views with the world? As if that isn't enough, you get to take a book home to enjoy as well. Send us an email and we'll get you started.

Considering a Career Change?
Are you a software or system engineer, data scientist, analytic developer, or cybersecurity expert interested in learning about new opportunities?
Please send us an email to learn about the opportunities available with our partners.

Are You Hiring?
If your company is looking for data scientists, data engineers, software engineers, and other data related experts, please reach out so that we can help our members find new opportunities.
Please send us an email introducing your company and needs.

Get Involved with Data Works!
Want to be more involved in our data science community? If you have experience running workshops, hack-a-thons, curating newsletters, or are just interested in helping to grow the meetup, please send us an email!


Our sponsors help us bring data analysis Meetups, conferences, and newsletters to Maryland data enthusiasts. If you're interested in joining this prestigious group, send us an email
If you are interested in speaking, hosting, or sponsoring a meetup, have opportunities to list, or local news to share, please email

This email was sent to <<Email Address>>
why did I get this?    unsubscribe from this list    update subscription preferences
Data Works MD · 101 W Dickman St · Baltimore, MD 21784-9239 · USA

Email Marketing Powered by Mailchimp