利用DNA儲存數據(學生工作紙)Using DNA as a data storage (Student's worksheets)

STEM活動27.1STEM activity 27.1利用DNA儲存數據Using DNA as a data storage
學習技能:Learning skill: 解難能力Problem-solving skills 團隊合作Cooperation 運算思維Computational thinking
學習範圍:Learning areas: 0
  • DNA作為信息儲存的角色
  • Role of DNA as an information store
0
  • 認識DNA合成和序列測定在數據儲存的應用
  • Recognizing the applications of DNA synthesis and sequencing in data storage
0
  • 二進制代碼和DNA鹼基序列的轉換
  • Conversion between binary codes and DNA base sequences
0
  • 二進制數系統
  • Binary number system

簡介Introduction

在本活動中,你會探究如何以DNA作為數據儲存的媒介。你需要把文字訊息轉換成DNA密碼,並嘗試破解由其他同學設計的密碼。你還有機會展示團隊合作性、溝通技巧和解難能力。

In this activity, you will explore the use of DNA as a data storage medium. You will translate text messages into the DNA code and decode messages written by others. You will have a chance to demonstrate your team work, communication and problem-solving skills.



背景知識Background knowledge
1.
DNA的特性使它適合作為儲存遺傳信息的物質。
  • DNA是由大量核苷酸組成的大分子,能攜帶大量遺傳信息
  • DNA四種不同的核苷酸由四種含氮鹼基──腺嘌呤 (A)、胸腺嘧啶 (T)、胞嘧啶 (C) 和鳥嘌呤 (G) 形成。不同的鹼基序列可編碼無限多種不同的蛋白質。
  • DNA的分子穩定。兩條多核苷酸鏈通過互補鹼基之間的氫鍵連結,形成雙螺旋結構。鹼基朝向雙螺旋的內部,能保護遺傳信息,免受破壞。
1.
DNA is well suited to its function as a store of genetic information.
  • DNA is a large molecule made up of a huge number of nucleotides. This allows DNA to carry a large amount of genetic information.
  • There are four types of DNA nucleotides, each specified by one of four nitrogenous bases: adenine (A), thymine (T), cytosine (C) and guanine (G). Different sequences of bases code for a potentially infinite number of different protein molecules.
  • DNA is a stable molecule which exists as a double helix, with two polynucleotide chains held together by hydrogen bonds between complementary bases. The bases project towards the centre of the double helix. This protects the genetic information from damage.

2.
DNA能編碼並保存大量信息,啟發科學家研究以DNA作為數據儲存媒介。
DNA can encode and store a lot of information, which has inspired scientists to investigate its potential use as a data storage medium.

背景

在大數據時代,我們對數據儲存空間的需求急速增長。在不久的將來,傳統數碼儲存媒介,例如:硬碟、磁碟、光碟、可攜式硬碟及USB儲存裝置,或不足以滿足我們的需求。再者,這些媒介的壽命亦有限。以DNA作為數據儲存媒介是一個可能的解決方法。

Background information

In this era of big data, the demand for data storage capacity keeps growing rapidly. Traditional digital storage media, such as HDD (hard disk drive), floppy disks, CDs, DVDs, portable hard drives and USB streak drives, may soon not be able to meet the demand. Meanwhile, these media have limited span of life. DNA as a data storage medium may be a solution to these problems.

DNA儲存數據的原理

下圖顯示如何用DNA存取數據:

How DNA data storage works

The flow chart below outlines how to store data in DNA and how to retrieve it:

由數據編碼成DNA Encoding data in DNA
  1. 電腦和其他數碼電子設備用二進制數系統(只有「0」和「1」的數字系統)來儲存和處理數據。音訊檔案、文字、影片或圖像會在電腦系統中轉換成二進制代碼。
  2. 把二進制代碼會轉換成由四種鹼基 (A, T, C, G) 組合而成的DNA序列。
  3. 確定鹼基序列後,便可以合成DNA並儲存起來。一克的DNA能夠儲存高達2.15億GB的數據。
  1. Computers and other digital electronic devices store and process data with the binary number system that uses only two digital numbers ‘0’ and ‘1’. Audio files, text, videos or images are converted to binary codes in computer systems.
  2. These binary codes are converted to DNA sequences made up of combinations of four bases, A, T, C and G.
  3. Once the base sequence is determined, the DNA can be chemically synthesized and stored. One gram of DNA can store up to 215 million gigabytes (GB) of data.
從DNA取回數據 Retrieving data from DNA
  1. 要從DNA取回數據,需要進行DNA序列測定。
  2. 把DNA鹼基序列復原為二進制代碼。
  3. 電腦系統能夠處理二進制代碼並顯示原來的檔案。
  1. To retrieve the data, DNA sequencing is carried out.
  2. The DNA base sequence is converted back to binary codes.
  3. The computer interprets the binary codes as the original digital file.

有關科學本質的觀念

以DNA儲存數據的起源可以追溯至1988年,當時的科學家把圖像編碼到一個只有28個鹼基對的DNA片段。在2019年,科學家成功把16 GB的維基百科資料編碼到人工合成DNA上。DNA儲存容量之大,使它有着龐大的潛力作為儲存媒介(一克的DNA能夠儲存高達2.15億GB的數據),但是相關技術成本高,以及緩慢的讀取和寫入過程,導致現時的實際用途非常有限。由此可見,科學發展受當時的技術和設備所影響。

NOS ideas

One of the earliest uses of DNA data storage dates back to 1988 when scientists encoded graphic information in a DNA fragment consisting of 28 base pairs. In 2019, scientists reported that 16 GB of Wikipedia have been encoded into synthetic DNA. While DNA as a storage medium has enormous potential because of its high storage capacity—one gram of DNA can store up to 215 million GB of data, its practical use is currently severely limited because of its high cost and the slow processes of reading and writing data. This demonstrates that progress in science is influenced by the technology and the types of equipment available at the time.


活動Task
老師會把同學分成三至五人的小組。你會與其他組別比賽,模擬機器把數據編碼成DNA鹼基序列,以及從DNA鹼基序列中解碼的過程。 Your teacher will divide the class into groups of three to five. You will play a game which simulates how machines encode and decode data to and from DNA bases.

第一部分︰編碼Part I: Encoding
1.
與組員討論並寫出一個10個英文詞彙以內的信息。 Discuss with your group members to think of a message to encode. The message should be within 10 words.
2.
根據下頁的「二進制代碼表」,把信息轉換成一連串的二進制代碼。 Convert the message to a series of binary codes according to the chart on the next page.
字符 Character 二進制代碼 Binary Code
A01000001
B01000010
C01000011
D01000100
E01000101
F01000110
G01000111
H01001000
I01001001
J01001010
K01001011
L01001100
M01001101
字符 Character 二進制代碼 Binary Code
N01001110
O01001111
P01010000
Q01010001
R01010010
S01010011
T01010100
U01010101
V01010110
W01010111
X01011000
Y01011001
Z01011010
字符 Character 二進制代碼 Binary Code
.00101110
?00111111
(space)00100000
000110000
100110001
200110010
300110011
400110100
500110101
600110110
700110111
800111000
900111001
3.
為每個DNA鹼基配上一個二進制代碼。 Assign a binary code to each of the four DNA bases A, T, C and G.
A =     C =     T =     G =     A =     C =     T =     G =    
4.
把步驟2的二進制代碼轉換成DNA鹼基序列。 Convert the series of binary codes written in step 2 to a DNA base sequence.

第一部分︰編碼Part I: Encoding
5.
編碼後,把寫有DNA鹼基序列的p.5交給老師。 Hand in p.5 of your worksheet with the encoded DNA base sequence to your teacher.
6.
你和組員會收到其他組別的DNA鹼基序列,嘗試把DNA鹼基序列轉換成二進制代碼,然後再「解碼」成原來的信息。 Your group will receive the DNA base sequence from another group. Convert the DNA base sequence to binary codes, and then to text to retrieve the message.
7.
能在最短時間內把信息「解碼」的組別便勝出。 The group that can retrieve the message in the shortest time wins the game.

討論Discussion
1.
討論以DNA儲存數據的好處及限制。 Discuss about the advantages and current limitations of using DNA to store data.
2.
填寫下頁的「自我及組員評估表」,以評估自己和組員的表現。 Evaluate your group’s performance using the Self and peer evaluation form provided on the next page.
自我及組員評估表 Self and peer evaluation form
1.

把各組員的姓名寫在1–4的空格內,並為自己及各組員的表現評分。
分數:4 = 優異     3 = 良好     2 = 平平     1 = 有待改進

Write the names of your group members in the numbered boxes. Assign yourself and each group member a score for each attribute.
Score: 4 = Excellent     3 = Good     2 = Fair     1 = Need improvement

表現Attribute 組員Group member
Myself 1. 2. 3. 4.
積極提出意見Participation in developing ideas
願意聆聽組員的意見並作出回應Willingness to listen to and discuss other’s ideas
帶領/促進討論Participation in leading/facilitating discussion
對活動感興趣並投入其中Interest and enthusiasm in the activity
與組員的合作Cooperation with other group members
時間管理Time management
整體水平Overall contribution
2.

就團隊合作而言,你在這個活動中有甚麼得着?

What did you learn about working in a group from this activity?


3.

如果再有機會進行類似活動,你有哪些地方需要改進?

What would you have done differently, given the opportunity?


4.

你對這活動有甚麼評價和建議?

Do you have any other comments or suggestions about the activity?


延展部分Extension

進入以下連結,使用網上應用程式以轉換文字信息和DNA序列:
e-aristo.hk/r/bioccstem2701converter.c

Try the online programme to convert text to DNA sequences and vice versa:
e-aristo.hk/r/bioccstem2701converter.e

由文字轉換成DNA序列 From text to DNA sequence
由DNA序列轉換成文字 From DNA sequence to text
沒有使用權限No access right

你的學校未有使用本系列資源之權限。

Your school does not have the access right to open the resources of this series.