Data News and Articles

How Innovation in Baltimore Led to Catalyte’s New $15M State Contract — Maryland has already innovated its own workforce pipeline by eliminating the requirement of a four-year degree for state jobs. This policy change is key to the 90 and 100 new jobs Catalyte wants to create. “Now that we’ve gotten rid of degree requirements, let’s create the talent base to produce and deliver. We all know there’re challenges in Baltimore but there are things that are happening here that are truly first time in the world. We should be shouting that from the mountaintop. We should be scaling these programs more.” Tags: OpenMD, Balitmore, Business, State, Government,
New AI Research Tool Turns Ideas Into Art — Imagine creating a digital painting without ever picking up a paintbrush or instantly generating storybook illustrations to accompany the words. Today, we’re showcasing an exploratory artificial intelligence (AI) research concept called Make-A-Scene that will allow people to bring their visions to life. Tags: AI, Art, Innovation
Is Data Scientist Still the Sexiest Job of the 21st Century? — The job of data scientist has become better institutionalized, its scope has been redefined, the technology it relies on has made huge strides, and the importance of non-technical expertise, such as ethics and change management, has grown. The many executives who recognize that data science is important to their businesses now need to create and oversee diverse data science teams rather than searching for data scientist unicorns. Tags: Data Science, Management
Measuring trends in Artificial Intelligence — The AI Index is an independent initiative at the Stanford Institute for Human-Centered Artificial Intelligence (HAI), led by the AI Index Steering Committee, an interdisciplinary group of experts from across academia and industry. The annual report tracks, collates, distills, and visualizes data relating to artificial intelligence, enabling decision-makers to take meaningful action to advance AI responsibly and ethically with humans in mind. Tags: AI, Ethics
Founding an Analytics Engineering Team — If your company is struggling to leverage analytics, dealing with an overgrown ecosystem of dashboards/databases or simply want to avoid the mistakes of others, this story is for you. In this article, I will walk through forming the first analytics engineering team at Smartsheet including how momentum built around forming the team, the challenges we faced, and the solutions we developed within the first year. Tags: Management, Engineering, System Engineer
|
|
Data Tools and Resources
Apache Kibble — Apache Kibble is a suite of tools for collecting, aggregating and visualizing activity in software projects. In follows a similar architecture to Gitana (and in fact to other several tools in the list), with a central Kibble server and a set of scanner applications specialized in working with a specific type of resource (a git repo, a mailing list, a JIRA instance, etc) and push compiled data objects to the Kibble Server. Tags: ML, Apache, Software
Gitana — Gitana, a project inspector that analyzes the support tools used in software projects and imports the information in a relational database, thus providing a central point to perform all kinds of cross-cutting analysis on project data. The current version of the tool provides support to inspect Git repositories, Bugzilla/GitHub issue trackers, Eclipse forums and Slack instant messages. To ensure efficiency, Gitana comes with an incremental propagation mechanism that refreshes the database content with the latest modifications available on the data sources. The approach also incorporates exporters to enable further data analysis with third-party tools. Tags: Software Analysis, Management
GH Archive — Open-source developers all over the world are working on millions of projects: writing code & documentation, fixing & submitting bugs, and so forth. GH Archive is a project to record the public GitHub timeline, archive it, and make it easily accessible for further analysis. Tags: Open Source, Git Analysis
|
|
How To's and Tutorials
4 Pandas Anti-Patterns to Avoid and How to Fix Them — pandas is a powerful data analysis library with a rich API that offers multiple ways to perform any given data manipulation task. Some of these approaches are better than others, and pandas users often learn suboptimal coding practices that become their default workflows. Tags: Pandas, Data Analysis
Tidy Finance with R — This book aims to lift the curtain on reproducible finance by providing a fully transparent code base for many common financial applications. We hope to inspire others to share their code publicly and take part in our journey towards more reproducible research in the future. Tags: R, Finance, Open Source
Graph Machine Learning at Airbnb — In this blog post, we will explain the benefits of using graphs for machine learning, and show how leveraging graph information allows us to learn more about our users, in addition to building more contextual representations of them. We will then cover specific graph machine learning methods, such as Graph Convolutional Networks, that are being used at Airbnb to improve upon existing machine learning models. Tags: ML, Neural Networks, Graph ML
|
|
Opportunities
Share Your Project
Have you been working on a data project and are ready to share your methods, processes, or results? Contact us to get started.
Book Review Opportunity
Are you interested in reviewing an O'Reilly book for the publisher and sharing your views with the world? As if that isn't enough, you get to take a book home to enjoy as well. Send us an email and we'll get you started.
Considering a Career Change?
Are you a software or system engineer, data scientist, analytic developer, or cybersecurity expert interested in learning about new opportunities?
Please send us an email to learn about the opportunities available with our partners.
Are You Hiring?
If your company is looking for data scientists, data engineers, software engineers, and other data related experts, please reach out so that we can help our members find new opportunities.
Please send us an email introducing your company and needs.
Get Involved with Data Works!
Want to be more involved in our data science community? If you have experience running workshops, hack-a-thons, curating newsletters, or are just interested in helping to grow the meetup, please send us an email!
|
|
|
|