Skip to content

Commit

Permalink
add NSF data curation to website
Browse files Browse the repository at this point in the history
  • Loading branch information
cl117 committed Dec 9, 2024
1 parent 77e5bcb commit 1005bca
Show file tree
Hide file tree
Showing 3 changed files with 75 additions and 0 deletions.
14 changes: 14 additions & 0 deletions content/Research/datacuration_about.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
+++
# About widget.
widget = "about_research" # See https://sourcethemes.com/academic/docs/page-builder/
headless = true # This file represents a page section.
active = true # Activate this widget? true/false
weight = 100 # Order that this section will appear in.

title = "Synthetic Biology Data Curation"

# Choose the user profile to display
# This should be the username of a profile in your `content/authors/` folder.
# See https://sourcethemes.com/academic/docs/get-started/#introduce-yourself
author = "NSF_Data_Curation"
+++
61 changes: 61 additions & 0 deletions content/authors/NSF_Data_Curation/_index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,61 @@
---
# Display name
title: Synthetic Biology Data Curation

#Use 1 for PI, 100 for Current Postdocs, 200 for current phds, 300 for current masters, 400 for current undergrads, 800 for alum postdocs, 810 for alum phds, 820 for alum masters, and 830 for alum undergrads, 900 for tools, 1000 for projects, 900 for tools, 1000 for projects
weight: 1000

# Username (this should match the folder name)
authors:
- NSF_Data_Curation

# Is this the primary user of the site?
superuser: false

# Organizations/Affiliations
organizations:
- name: Genetic Design Automation
url: ""
#- name: Formal Verification of Cyber-Physical Systems
# url: ""
#- name: Asynchronous Circuit Design and Verification
# url: ""
#- name: Analog Circuit Design and Verification
# url: ""

# Short bio (displayed in user profile at end of posts)
# bio: My research interests include distributed robotics, mobile computing and programmable matter.

# Social/Academic Networking
# For available icons, see: https://sourcethemes.com/academic/docs/page-builder/#icons
# For an email link, use "fas" icon pack, "envelope" icon, and a link in the
# form "mailto:[email protected]" or "#contact" for contact widget.
social:

# - icon: github
# icon_pack: fab
# link: https://github.com/synbioks
# - icon: file-alt
# icon_pack: fas
# link: https://synbioks.github.io/



# Link to a PDF of your resume/CV from the About widget.
# To enable, copy your resume/CV to `static/files/cv.pdf` and uncomment the lines below.
# - icon: cv
# icon_pack: ai
# link: files/cv.pdf

# Enter email to display Gravatar (if Gravatar enabled in Config)
email: ""


# Organizational groups that you belong to (for People widget)
# Set this to `[]` or comment out if you are not using People widget.
user_groups:
- Projects
- SBKS Project
---

Synthetic biology designed systems have many applications in areas including environmental, manufacturing, sensor development, defense, and medicine. However, currently the progress and usefulness of synthetic biology is impeded by the time required for literature studies and the replication of existing but poorly documented work. The Synthetic Biology Knowledge System (SBKS) project endeavored to address these challenges by integrating data from parts repositories with information extracted from literature into a unified knowledge system. However, this form of post-hoc curation requires the extraction of knowledge from manuscript and supplemental text files after publication by curators separate from the original authors. To handle large amounts of data, machines are used to scour free text and attempt to recognize key words and work out their meaning from context. This tests the limits of named entity recognition and entity classification. Additionally, it leaves ambiguous entities that only the original authors might disambiguate. For example, yeast may refer to many different strains of yeast. Furthermore, the SBKS project also extracted sequences provided as supplemental information in publications. However, these sequences, even when they are provided, are typically poorly annotated, incomplete, and provided in non-machine readable formats. Taken together, the SBKS project demonstrated that reconstruction of this important design information through post-hoc curation is extremely noisy and error prone.
Binary file added content/authors/NSF_Data_Curation/avatar.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 1005bca

Please sign in to comment.