Data GIS Blog
Duke University recently acquired access to the online version of its Current Population Statistics (CPS) CD-ROM collection to facilitate easy access to CPS data (Unicon’s CPS Utilities on the Web). This blog post will walk through the basic data extraction process. The interface is comparable to that provided by the CD, and users of this collection will find the interface and powerful. Please note that the instructions provided on the web site are very important to read, particularly for those unfamiliar with the CPS CD version.
Create an Account
When you visit the Unicon site (http://unicon.com/), click the “CPS on Web” link to the left, then click the Register button. You will have to enter some information to complete the registration process.
Once complete, submit the information. Once the registration window closes, choose the CPS series (or month) you wish to query, and log in to the system.
Once logged in, you will see a popup window like that shown in the image to the right. For a typical data extraction, the following steps are advised.
1) First, click the Set Option button and chang4e the timeout to at least 300 seconds. This will ensure successful data extraction.
2) Next, click the Make an Extraction button, followed by the Request Editor button on the next page. You should see a page similar to that below (all variables used in your prior extraction will be listed).
3) Remove any variables you do not need. Next, make certain the variable you wish to include is selected at the top and click “Add Variable(s).” Alternatively, if you already know the names of the variables, you may type them into the boxes provided on the page.
4) Once all variables are added to the selection, click Continue. On the following page, specify the output format for the dataset. Once complete, be certain to select one or more years (at the top). After you have selected years, click the Extract button.
5) On the following page, you will be presented with a list of variables by year. As variables change across years in some cases, not all selected variables may be present for each year. When selecting variables, checking the “View Documentation” checkbox at the top will allow for browsing of available years.
Other Useful Tools
- The Make a Table button allows for the construction of crosstabs of observations, means, and other statistics. This is helpful if the goal is to locate variables for analysis or if there is a choice between two or more variables.
- The Make a Graph button is also useful for data exploration. The program provides the ability to construct hsitograms, line charts, scatter ploys, pie charts, and bar charts. Basic summaries of a variable can also be generated from this page.
- If your data need to be weighted to represent the US population, be certain to select the appropriate weight under the Apply Weights button before extraction.
- Subsets of individuals can also be produced under the Specify Universe button. For example, a specific race or gender can be specified to reduce the sample to what you need.
The post Welcome to the Current Population Statistics on the Web appeared first on Duke Libraries Data & Visualization Services.
The fall of 2014 marks the completion of the first five years of the libraries’ Data and GIS Services Department. In 2009, when Mark Thomas and I formed the department, the name accurately reflected our staffing and services as Mark focused on GIS-related issues and I focused on data-related issues. As an increasing number of scholars have embraced data-driven research over the last five years , our services and staff have grown to support an increasingly diverse set of research needs at Duke.
In 2010-2011 academic year, the Libraries launched services around data management and sharing plans in anticipation of new funding rules surrounding research data. In 2012, the library expanded data services in collaboration with OIT’s Research Computing to offer one of the first data visualization consulting positions in the country. In 2013 and 2014, we expanded services and staff to include consultations on research computing and big data.
At this year’s Data and GIS Services annual retreat, we decided that the time has come to change the name of the department to reflect the broader range of staff and consulting services available. While we continue to support our traditional dimensions of data and GIS research, we intend to support a range of data needs across the following five themes:Data and Visualization Services Themes
Get the data you need. Data and Visualization Services consultants can help you locate and license a diverse range of data sources. We also provide long term storage for Duke data collections through Duke’s institutional repository.
Data Storage and Management
Need help on a data management plan, want advice on archiving, or struggling with “big data” analytics? We are happy to consult!
Data Cleaning and Analysis
From Google Refine to the command line, we can help with data cleaning and analysis.
Mapping and GIS
Mapping and spatial analysis remain a core service for the data and visualization program.
Our data visualization service can help with the most effective way to represent your data for both analysis and communication.
We appreciate the research community’s support as we’ve grown over the last five years. We look forward to working with you on a larger range of data challenges in the future!