GitHub Releases New Research on Copilot Chat

New research from GitHub takes a look at Copilot Chat’s impact on authoring and reviewing code. The company conducted a study where 36 developers used GitHub Copilot and Copilot Chat to author and review code – it’s obviously a small sample size, in part due to how new the tool is, though early results are interesting.

Ethical Web Data Collection Initiative Launches Certification Program

The Ethical Web Data Collection Initiative (EWDCI) is an industry-led consortium of web data collectors focused on strengthening public trust, promoting ethical guidelines, and helping businesses and their customers make informed data extraction choices. The association aims to raise the bar for ethics in the process widely known as “data scraping” with the goal of enhancing trust—a key component of a free, fair, and open Internet. 

Survey: Massive Retooling Around Large Language Models Underway

A recent survey of data scientists and engineers revealed that over half (53.3%) of today’s machine learning (ML) teams are planning on deploying a large language model (LLM) application of their own into production “within the next 12 months” or “as soon as possible”. Perhaps even more startling, however, is the finding that nearly one in ten (8.3%) enterprise ML teams have already deployed an LLM application into production.

Video Highlights: Make Better Decisions with Data — with Dr. Allen Downey

In this video presentation, our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, is joined by Dr. Allen Downey, renowned author and professor, who shares insights from his upcoming book ‘Probably Overthinking It,’ breaking down underused techniques like Survival Analysis, explaining common paradoxes, discussing the dynamic Overton Window, and how to be prepared for Black Swan events. Strap in for a data-driven journey that will help you learn how to make better decisions with data!

Revolutionizing Bioscience Research: Creating an Atlas of the Human Body

Making healthcare and life science (HCLS) discoveries is time-consuming and requires considerable amounts of data. HPC enterprise infrastructure with AI and edge to cloud capabilities is required for biomedical research to make creating a human atlas of the body possible. The HPE, NVIDIA and Flywheel collaboration using the latest technologies designed for HCLS promise to transform biomedical research.

AWS Announces Amazon DataZone GA to Simplify Data Discovery and Governance

Amazon Web Services (AWS) announced the general availability of Amazon DataZone, a data management service that enables customers to catalog, discover, govern, share, and analyze data at scale across organizational boundaries. With Amazon DataZone, users can share and access data throughout an organization via a unified data portal and add business context to data to make data discovery easier, while balancing access governance with pre-defined approvals.

Capital One’s New Forrester Study Reveals Key Challenges Data Leaders Face Deploying Self-service Data Strategies

Capital One Software released a new commissioned study, “Leverage Self-service Data Strategies to Drive Business Value,” by Forrester Consulting that reveals the biggest challenges data leaders face when implementing a self-service data strategy across their organization.

What Does the Commercialization of Generative AI Mean for Society?

ACM, the Association for Computing Machinery has released “TechBrief: Generative Artificial Intelligence.” It is the latest in the quarterly ACM TechBriefs series of short technical bulletins that present scientifically grounded perspectives on the impact and policy implications of specific technological developments in computing.

Video Highlights: Vicuña, Gorilla, Chatbot Arena and Socially Beneficial LLMs — with Prof. Joey Gonzalez

LLM Vicuña, Chatbot Arena, and the race to increase LLM context windows: In this video presentation, guest Joey Gonzalez joins our good friend Jon Krohn, Co-Founder and Chief Data Scientist at the machine learning company Nebula, to talk about developing models and platforms that leverage and improve LLMs, as well as the future of AI development and access.

Anaconda’s State of Data Science Report Report Reveals Surge in AI Upskilling Among Data and IT Professionals

Anaconda Inc., provider of the popular platform for data science and modern AI development, released its sixth annual State of Data Science report, surfacing insights into today’s vibrant data science community and the growth and usage of AI and open-source software.